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Notes from the Editor 


Although my most recent annual editorial report was 
published in the January issue of PS: Political Science & 
Politics, it seems unsafe to assume that everyone who 
may be interested in how things are going at the APSR 
will have ventured into the nether regions of PS and 
there chanced upon my report. Therefore, I summarize 
here its most salient points. 

Submissions. During 2001-2, my first year as editor 
of the APSR, we experienced an extraordinary surge in 
submissions—a rise of either 44% or 56%, depending 
on how submissions are counted, in a single year. As 
we entered Year 2, we didn’t know whether that greatly 
enhanced manuscript flow would prove to be a one- 
year “spike” or would become the new norm. As it 
turned out, submissions showed no sign of abating dur- 
ing 2002-3. Indeed, they rose again, though by “only” 
about 8% over 2001-2. Obviously, then, the dramatic 
increase in submissions that we had experienced the 
year before wasn’t a one-time occurrence. A reversion 
to lower submission levels would have been disappoint- 
ing, but a continuation of such explosive growth would 
have been too much of a good thing. So I interpret the 
submission trend as good news. 

Turnaround. Our review process continued to move 


at a good pace. The median elapsed time from the day . 


we received a paper until the day I signed the decision 
letter to the author was 39 working days. Only rarely 
(certainly less than 5% of the time) did the process 
drag on for a period that could fairly be considered 
problematic. 

Intellectual diversity. One of my major goals as editor 
has been to attract a greater variety of submissions, so 
that the APSR’s pages are open in fact as well as in 
form to work representing the substantive and analyt- 
ical diversity of our discipline. In this regard, what has 
happened can be seen from either a “half empty” or 
a “half full” perspective. As indicated by the relative 
stability of the distribution of submissions across ana- 
lytical approaches and substantive fields, progress has 
been slow. On the other hand, consider, as a case in 
point, papers reporting “small-N” analyses, From 1995 
through 2000, such papers accounted, on average, for 
just 2% of APSR submissions, In 2002-3, they consti- 
tuted 5%. That probably doesn’t sound like much of 
a change, but bear in mind that the number of sub- 
missions was much larger in 2002-3 than it had been 
in earlier years, so “small-N” submissions became, in 
effect, a larger proportion of a much larger pie. In raw 
numbers, whereas the APSR received, on average, only 
about a half-dozen “small-N” submissions per year dur- 
ing the 1995-2000 period, in 2002-3 we received two 
dozen. In terms of the likelihood that “small-N” papers 
would actually be published in the APSR (and they 
were), that is a difference that matters. More gener- 
ally, given our very substantial increase in submissions 
across approaches and fields, we have actually made 
considerable progress in attracting papers that rep- 
resent approaches and/or fields previously under- 
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represented in terms of submission to, and publication 
in, the APSR. 

Rejection rates. Of every 100 first-round decisions 
that I made during 2002-3, 88 were rejections based 
on the recommendations of reviewers and, of course, 
my own assessment. Two of every 100 first-round deci- 
sions took the form of notifying the author that I was 
unwilling to open the review process because the au- 
thorship of the paper had not been rendered sufficiently 
anonymous, because the paper far exceeded our length 
limit, or because its formatting was wildly at variance 
with our guidelines; in every such instance, I invited 
the author to fix the problem and submit a corrected 
version of the paper, and that invitation invariably was 
accepted. Less than one out of every 100 decision letters 
informed the author that in my judgment the paper was 
so inappropriate for the APSR that no purpose would 
be served by sending it out for review; for the 671 sub- 
missions we received, I wrote only five such letters. The 
remaining eight of every 100 of my first-round decisions 
were “positive,” in the sense of inviting the author to 
revise the paper for further consideration, accepting it 
subject to some final conditions, or accepting it uncon- 
ditionally. Obviously, I spent a great deal of my time 
conveying bad news to authors. 

I continued to make sparing use of “revise and re- 
submit” invitations and to resist the temptation to pile 
one such invitation on top of another. I never offered a 
second “revise and resubmit” invitation, and it was not 
at all common for me to reject a revised resubmission. 
Of all the revised resubmissions on which we completed 
the review process during 2002-3, more than 85% were 
accepted. 

Compared to the distribution of submissions, the dis- 
tribution of accepted papers underrepresented purely 
quantitative submissions while overrepresenting in- 
terpretive/conceptual ones, and it underrepresented 
American politics submissions while overrepresenting 
international relations and normative theory ones. I 
have no ready explanation for this pattern, but I do 
caution against attributing it to editorial bias against 
quantitative work or research on American politics— 
for those are precisely the categories into which most 
of my own work falls. Rather, I tend to regard these 
as one-time blips that are unlikely to recur in years to 
come. 


IN THIS ISSUE 


That brings me to the current issue, the first one to 
appear during 2004. We do not publish “theme” issues. 
Occasionally, though, several of the articles that appear 
in a given issue prove, more or less serendipitously, to 
speak to the same broad theme. This is such an occa- 
sion. I hope it is not unduly procrustean to suggest that 
cooperation is a prominent focus of the articles in this 
issue (and hence the theme of this issue’s cover graphic 
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as well), for a recurrent theme of these articles is the 
acting out of shared values, goals, and policies through 
rules, procedures, and institutions. 

Cooperation is a vital element of human endeav- 
ors that involve more than one person—necessary to 
forming sustainable societies, choosing leaders, sustain- 
ing good citizenship, and maintaining peaceful rela- 
tions with other societies. It involves more than the 
absence of overt conflict, which could be a product of 
coercion or lack of opportunity. Rather, it involves a 
desire to get along with others. In contrast to views 
of human nature that emphasize competitive and ag- 
gressive instincts, Jobn Orbell, Tomonori Morikawa, 
Jason Hartwig, James Hanley, and Nicholas Allen think 
that getting along with others may, in effect, have 
been bred into us. In “‘Machiavellian’ Intelligence as a 
Basis for the Evolution of Cooperative Dispositions,” 
Orbell and his colleagues suggest that cooperative traits 
may be selected for in evolutionary processes based 
on certain cognitive mechanisms, These mechanisms 
underlie “Machiavellian intelligence”—the capacity to 
negotiate complex, competitive social environments. A 
fascinating simulation analysis shows how this capac- 
ity can provide an evolutionary foundation predispos- 
ing later generations to act cooperatively in ostensibly 
competitive settings. This evolutionary account recon- 
ciles payoff structures that seem to discourage coop- 
eration, on the one hand, and empirical findings that 
indicate widespread cooperation, on the other. 

Next comes an extreme case of the need for co- 
operation in moving a country from civil war to do- 
mestic tranquility. Political thinkers from Hobbes and 
Machiavelli to Huntington have deemed an authoritar- 
ian transition to be a necessary stage in the transfor- 
mation from civil war to democracy. In “The Paradox 
of ‘Warlord’ Democracy: A Theoretical Investigation,” 
though, Leonard Wantchekon argues that popular 
government can arise more or less directly from the 
chaos of civil war if the warring factions depend on 
the citizenry for economic growth, permit elections to 
arbitrate the new balance of power, and defer to a neu- 
tral third party to oversee the transition. Wantchekon’s 
multi-faceted analysis should help readers understand 
why, among other things, countries like El Salvador and 
Mozambique have been able to create new democ- 
racies, while the Sierra Leones of the world still find 
themselves mired in conflict. 

Cooperation can be especially difficult if the parties 
in question cannot readily identify common grounds 
or find a joint stake in mutually beneficial outcomes. 
In an age of global immigration into formerly homo- 
geneous European societies, Paul M. Sniderman, Louk 
Hagendoorn, and Markus Prior investigate the fac- 
tors that provoke animosity toward ethnic minorities. 
Such animosities have manifested themselves in pop- 
ular support for nativist political parties and policies 
on immigration, jobs, and crime. In “Predisposing Fac- 
tors and Situational Triggers: Exclusionary Reactions 
to Immigrant Minorities,” Sniderman and his associates 
use an experimental component embedded within an 
otherwise-conventional opinion survey to explore the 
impacts of economic hardship, fear of crime, and inter- 


cultural differences on tension between native-born 
Dutch and newly arrived immigrant groups. Although 
this study focuses on the Netherlands, its findings 
seem broadly applicable to other settings where once- 
homogeneous populations are becoming more diversi- 
fied and multicultural. 

As joint endeavors in which citizens collectively 
choose their leaders, elections are prime political mech- 
anisms of cooperative behavior. Three articles in this 
issue focus on electoral politics from various analyt- 
ical perspectives. In an analysis that combines empir- 
ical, normative, and prescriptive elements, Dennis F. 
Thompson considers the oft-overlooked matter of tim- 
ing. In “Election Time: Normative Implications of Tem- 
poral Properties of the Election Process in the United 
States,” Thompson highlights three characteristics of 
the timing of American elections that promote pop- 
ular sovereignty: elections occur at regular intervals 
(periodicity), voters cast their ballots at the same time 
(simultaneity), and the outcome conclusively deter- 
mines who will govern until the next election (finality). 
Thompson then uses these properties as standards to 
assess various electoral institutions. Do partisan redis- 
tricting, the use of absentee ballots, or the regulation 
of campaign-related spending enhance or detract from 
the democratic character of elections? Thompson’s an- 
swers to these questions seem certain to lead readers to 
rethink the relationship between America’s civic values 
and its public practices. 

Continuing this focus on elections as an integral com- 
ponent of democratic governance, Matt A. Barreto, 
Gary M. Segura, and Nathan D. Woods investigate the 
impact of minority-majority districts. The issue mo- 
tivating “The Mobilizing Effect of Majority-Minority 
Districts on Latino Turnout” is whether majority- 
minority districts, while enhancing the representation 
of minorities, also depress minority turnout. The idea 
here is that majority-minority districts may produce 
uncompetitive districts in which the motivation to vote 
wanes and may also produce political disenchantment 
when representation gains fail to pay off in terms of 
policy. If majority-minority districts actually undermine 
minority turnout, then minority influence on the out- 
come of “up-ballot” contests would be affected delete- 
riously. Focusing on turnout in five Southern California 
counties, Barreto and his colleagues show that Latinos 
who live in majority-minority districts are actually 
more likely to vote than Latinos who live elsewhere, 
especially as the number of embedded layers of 
majority-minority districts increase. This study makes a 
significant contribution to our understanding of this po- 
litically volatile set of issues by focusing on non-African 
American minority voters, and in so doing provides a 
firmer empirical base for policy advocates and policy 
makers to take into account when Congress considers 
reauthorizing the Voting Rights Act in 2007. 

A common criticism of formal modeling is that much 
of it betrays little immersion in the “real world” of pol- 
itics and, correspondingly, that notwithstanding their 
internal rigor, it is often difficult even to imagine how 
such models might be applied empirically or tested. 
Encouragingly, though, more and more attention. is 
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being devoted to empirical applications and tests of 
formal models. A case in point is Enriqueta Aragones 
and Thomas R. Palfrey’s experimental analysis of how 
candidate quality affects election outcomes. In “The 
Effect of Candidate Quality on Electoral Equilibrium: 
An Experimental Study,” Aragones and Palfrey put 
their prior theoretical work to the test, and in so doing 
provide evidence for the striking proposition that the 
indirect equilibrium effects of candidate quality may 
be more important in determining candidates’ policy 
positions than in producing more votes for a particular 
candidate. 

We then turn to a panel of great political thinkers for 
enlightenment about how individuals do and should 
cooperate with others. According to Aristotle, one is a 
citizen when one acts as a citizen. One’s status as free 
or slave does not depend on birth, accident, decree, or 
force, and is not immutable. Rather, it stems from the 
active and regular use of reason to make choices. Such 
a deliberative, active political life is the key to what Jill 
Frank, in “Citizens, Slaves, and Foreigners: Aristotle on 
Human Nature,” deems necessary to avoid the Scylla 
(slavery) and Charybdis (despotism) of living in a free 
democracy. Rethinking several assumptions about the 
relationship between nature and politics, Frank inter- 
prets Aristotle as holding that political institutions and 
norms can produce people accustomed to, and fit for, 
either freedom and democracy, on the one hand, or 
slavery and despotism, on the other. This admonition 
reminds modern-day democrats to be vigilant for out- 
side threats to a vigorous and reasoned democracy, and 
for the internal lethargy that can undermine it. 

The idea of action also runs through Laurence 
D. Cooper’s “Between Eros and the Will to Power: 
Rousseau and ‘The Desire to Extend Our Being.” Ac- 
cording to Rousseau, the fundamental desire of, and 
highest good for, humans is to feel existence as much 
as possible. Cooper identifies this under-noticed con- 
cept of desire as central to Rousseau’s moral and polit- 
ical philosophy. Rousseau’s relevance to the ongoing 
debate between liberals and communitarians makes 
Cooper’s analysis important reading for more than the 
relatively narrow circle of Rousseau scholars or even 
the broader community of political theorists. 

Cooperation of a different sort than considered in the 
other articles in this issue is Thomas Heilke’s concern in 
“Realism, Narrative, and Happenstance: Thucydides’ 
Tale of Brasidas.” Heilke’s point of departure is the 
frequently frustrating quest of scholars of international 
relations to develop law-like axioms and structural the- 
ories from which practically useful policy prescriptions 
can be generated. The key, Heilke holds, to bridging 
the gap and reconciling these two often-contradictory 
aspirations lies in constructive use of narrative. Tak- 
ing his cue from Hobbes that “the narration itself doth 
secretly instruct the reader, and more effectually than 
can possibly be done by precept,” Heilke uses the Spar- 
tan general Brasidas to illustrate not just how a prac- 
titioner was affected by “vaguely defined forces,” but 
also how “speech and actions” of individuals come into 
play in a “complex contextual interaction of luck and 
excellence.” 
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Like individuals, institutions and governments must 
find ways to reach common goals with one another. 
How do they do it? In an analysis that is sure to 
raise eyebrows, authors Pauline Jones Luong and Erika 
Weinthal counterpose the contractarian view, fathered 
by Hobbes and more recently championed by rational 
choice advocates, and a coercion-based perspective 
that depicts strong leaders and force as the engine of 
change. In “Contra Coercion: Russian Tax Reform, Ex- 
ogenous Shocks and Negotiated Institutional Change,” 
Jones Luong and Weinthal argue that Russia’s new 
tax regime was the product of a new social contract 
between its largest businesses and the central gov- 
ernment. The economic crisis that began in 1998 and 
crippled the Russian economy led both groups to rec- 
ognize a mutuality of interests and thus to negotiate 
an agreement that addressed the government’s needs 
for greater revenue streams and business’ needs for 
tax reasonableness, simplicity, transparency and reg- 
ularity. The negotiations were not “captured” by any 
group, nor were the outcomes dictated by a coercive 
leader or by concerns for international aid and invest- 
ment. Rather, goaded by their earlier failures and a 
desire for economic recovery, bolstered on either side 
by the confidence-building acts, Russia’s major oil con- 
cerns and its political leaders cooperated to achieve key 
goals. 

Some similar themes recur in Christina L. Davis’s 
“International Institutions and Issue Linkage: Building 
Support for Agricultural Trade Liberalization.” Why 
do international trade negotiations turn out the way 
they do? Interests and power, the usual suspects, tell 
only part of the story, and Davis argues that much 
of the action lies in the agenda, rules, and procedures 
that structure the negotiation. Linking agricultural and 
industrial issues in a supportive institutional environ- 
ment, Davis finds, can effectively surmount domestic 
political opposition. Though specifically focused on in- 
ternational trade, Davis’s analysis should be of interest 
to political scientists interested in preference aggrega- 
tion, agenda setting, and the relationship between in- 
stitutions and policy outcomes. For issues ranging from 
the price of wheat to the size of the federal budget, the 
ways in which agendas are shaped, rules are manipu- 
lated, and procedures formed, profoundly affect policy 
outcomes. 

Davis’s focus on trade liberalization provides a su- 
perb lead-in to Beth A. Simmons and Zachary Elkin’s 
analysis of “The Globalization of Liberalization: Pol- 
icy Diffusion in the International Political Economy.” 
Liberal economic ideas and policies have spread un- 
evenly over time, in fits and starts, and across the globe, 
skipping over some areas and concentrating in others. 
What accounts for this clustering? Exploring an array 
of geographical, sociological, and political influences, 
Simmons and Elkins identify competition for capital 
(indicative of altered economic payoffs) and shared cul- 
ture (indicative of cross-cultural learning and borrow- 
ing) as ially important factors in nations’ decisions 
to follow the lead of their liberalizing neighbors. The 
implication of the “shared culture” factor in particular 
is that the scope of theory and research on comparative 
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and international political economy must remain broad 
enough to encompass underlying cultural as well as 
more traditional economic considerations. 

Finally, let me assure readers who experience a sen- 
sation of déjà vu when they encounter the final article 
in this issue that they are not hallucinating. Gary King, 
Christopher J.L. Murray, Joshua Salomon, and Ajay 
Tandon’s “Enhancing the Validity and Cross-cultural 
Comparability of Measurement in Survey Research” 
was published in our November 2003 issue. Unfortu- 
nately, through no fault of the authors, that version of 
the article contained several important printing errors. 
Consequently, with apologies to Messrs. King, Murray, 
Salomon, and Tandon, we print the corrected version of 
the article in this issue. This should be regarded as the 
definitive version of the article, and future references 
to it should cite this version rather than the one printed 
in our last issue. 


INSTRUCTIONS TO CONTRIBUTORS 


General Considerations 


The APSR strives to publish scholarly research of 
exceptional merit, focusing on important issues and 
demonstrating the highest standards of excellence 
in conceptualization, exposition, methodology, and 
craftsmanship. Because the APSR reaches a diverse 
audience of scholars and practitioners, authors must 
demonstrate how their analysis illuminates a significant 
research problem, or answers an important research 
question, of general interest in political science. For the 
same reason, authors must strive for a presentation that 
will be understandable to as many scholars as possible, 
consistent with the nature of their material. 

The APSR publishes original work. Therefore, au- 
thors should not submit articles containing tables, 
figures, or substantial amounts of text that have already 
been published or are forthcoming in other places, or 
that have been included in other manuscripts submitted 
for review to book publishers or periodicals (includ- 
ing on-line journals). In many such cases, subsequent 
publication of this material would violate the copyright 
of the other publisher. The APSR also does not consider 
papers that are currently under review by other journals 
or duplicate or overlap with parts of larger manuscripts 
that have been submitted to other publishers (including 
publishers of both books and periodicals). Submission 
of manuscripts substantially similar to those submitted 
or published elsewhere, or as part of a book or other 
larger work, is also strongly discouraged. If you have 
any questions about whether these policies apply in 
your particular case, you should discuss any such pub- 
lications related to a submission in a cover letter to the 
Editor. You should also notify the Editor of any related 
submissions to other publishers, whether for book or 
periodical publication, that occur while a manuscript is 
under review by the APSR and which would fall within 
the scope of this policy. The Editor may request copies 
of related publications. 


If your manuscript contains quantitative evidence 
and analysis, you should describe your procedures 
in sufficient detail to permit reviewers to understand 
and evaluate what has been done and, in the event 
that the article is accepted for publication, to permit 
other scholars to carry out similar analyses on other 
data sets. For example, for surveys, at the least, sampling 
procedures, response rates, and question wordings 
should be given; you should calculate response rates 
according to one of the standard formulas given 
by the American Association for Public Opinion 
Research, Standard Definitions: Final Dispositions 
of Case Codes and Outcome Rates for Surveys 
(Ann Arbor, MI: AAPOR, 2000). This document is 
available on the Internet at <http:/Avww.aapor.org/ 
default.asp? page=survey_methods/standards_and_best 
-practices/standard_definitions>. For experiments, 
provide full descriptions of experimental protocols, 
methods of subject recruitment and selection, subject 
payments and debriefing procedures, and so on. 
Articles should be self-contained, so you should 
not simply refer readers to other publications for 
descriptions of these basic research procedures. 

Please indicate variables included in statistical anal- 
yses by capitalizing the first letter in the variable 
name and italicizing the entire variable name the first 
time each is mentioned in the text. You should also use 
the same names for variables in text and tables and, 
wherever possible, should avoid the use of acronyms 
and computer abbreviations when discussing variables 
in the text. All variables appearing in tables should have 
been mentioned in the text and the reason for their 
inclusion discussed. 

As part of the review process, you may be asked 
to submit additional documentation if procedures are 
not sufficiently clear; the review process works most 
efficiently if such information is given in the initial sub- 
mission. If you advise readers that additional informa- 
tion is available, you should submit printed copies of 
that information with the manuscript. If the amount 
of this supplementary information is extensive, please 
inquire about alternate procedures. 

The APSR uses a double-blind review process. You 
should follow the guidelines for preparing anonymous 
copies in the Specific Procedures section below. 

Manuscripts that are largely or entirely critiques or 
commentaries on previously published APSR articles 
will be reviewed using the same general procedures as 
for other manuscripts, with one exception. In addition 
to the usual number of reviewers, such manuscripts will 
also be sent to the scholar(s) whose work is being crit- 
icized, in the same anonymous form that they are sent 
to reviewers. Comments from the original author(s) to 
the Editor will be invited as a supplement to the advice 
of reviewers. This notice to the original author(s) is 
intended (1) to encourage review of the details of 
analyses or research procedures that might escape 
the notice of disinterested reviewers; (2) to enable 
prompt publication of critiques by supplying criticized 
authors with early notice of their existence and, there- 
fore, more adequate time to reply; and (3) as a courtesy 
to criticized authors. If you submit such a manuscript, 
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you should therefore send as many additional copies 
of their manuscripts as will be required for this 
purpose. 

Manuscripts being submitted for publication should 
be sent to Lee Sigelman, Editor, American Political 
Science Review, Department of Political Science, The 
George Washington University, 2201 G Street NW., 
Room 507, Washington, DC 20052. Correspondence 
concerning manuscripts under review may be sent to 
the same address or e-mailed to apsr@gwu.edu. 
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ing text, all tables and figures, notes, references, and 
appendices. This page size guideline is based on the 
U.S. standard 8.5 x 11-inch paper; if you are submitting 
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accordingly. The font size must be at least 11 points for 
all parts of the paper, including notes and references. 
The entire paper, including notes and references, must 
be double-spaced, with the sole exception of tables 
for which double-spacing would require a second page 
otherwise not needed. All pages should be numbered 
in one sequence, and text should be formatted using 
a normal single column no wider than 6.5 inches, as is 
typical for manuscripts (rather than the double-column 
format of the published version of the APSR), and 
printed on one side of the page only. Include an ab- 
stract of no more than 150 words. The APSR style of 
embedded citations should be used, and there must be a 
separate list of references at the end of the manuscript. 
Do not use notes for simple citations. These specifi- 
cations are designed to make it easier for reviewers 
to read and evaluate papers. Papers not adhering to 
these guidelines are subject to being rejected without 
review. 

For submission and review purposes, you may place 
footnotes at the bottom of the pages instead of using 
endnotes, and you may locate tables and figures (on 
separate pages and only one to a page) approximately 
where they fall in the text. However, manuscripts ac- 
cepted for publication must be submitted with end- 
notes, and with tables and figures on separate pages 
at the back of the manuscript with standard indications 
of text placement, e.g., [Table 3 about here]. In deciding 
how to format your initial submission, please consider 
the necessity of making these changes if your paper 
is accepted. If your paper is accepted for publication, 
you will also be required to submit camera-ready copy 
of graphs or other types of figures. Instructions will be 
provided. 

For specific formatting style of citations and refer- 
ences, please refer to articles in the most recent issue 
of the APSR. For unusual style or formatting issues, 
you should consult the latest edition of The Chicago 
Manual of Style. For review purposes, citations and 
references need not be in specific APSR format, 
although some generally accepted format should be 
used, and all citation and reference information should 
be provided. 


Speclflc Procedures 
Please follow these specific procedures for submission: 


1. You are invited to submit a list of scholars 
who would be appropriate reviewers of your 
manuscript. The Editor will refer to this list in 
selecting reviewers, though there obviously can be 
no guarantee that those you suggest will actually 
be chosen. Do not list anyone who has already 
commented on your paper or an earlier version of 
it, or any of your current or recent collaborators, 
institutional colleagues, mentors, students, or 
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2. Submit five copies of manuscripts and a diskette 
containing a pdf file of the anonymous version of 
the manuscript. If you cannot save the manuscript 
as a pdf, just send in the diskette with the word- 
processed version. Please ensure that the paper 
and diskette versions you submit are identical; the 
diskette version should be of the anonymous copy 
(see below). Please review all pages of all copies 
to make sure that all copies contain all tables, 
figures, appendices, and bibliography mentioned 
in the manuscript and that all pages are legible. 
Label the diskette clearly with the (first) author’s 
name and the title of the manuscript (in abridged 
form if need be), and identify the word processing 
program and operating system. 

3. To comply with the APSR’s procedure of double- 
blind peer reviews, only one of the five copies sub- 
mitted should be fully identified as to authorship 
and four should be in anonymous format. 

4. For anonymous copies, if it is important to the 
development of the paper that your previous pub- 
lications be cited, please do this in a way that does 
not make the authorship of the submitted paper 
obvious. This is usually most easily accomplished 
by referring to yourself in the third person and 
including normal references to the work cited in 
the list of references. In no circumstances should 
your prior publications be included in the bibli- 
ography in their normal alphabetical location but 
with your name deleted. Assuming that text refer- 
ences to your previous work are in the third per- 
son, you should include full citations as usual in the 
bibliography. Please discuss the use of other proce- 
dures to render manuscripts anonymous with the 
Editor prior to submission. You should not thank - 
colleagues in notes or elsewhere in the body of 
the paper or mention institution names, web page 
addresses, or other potentially identifying infor- 
mation. All acknowledgments must appear on the 
title page of the identified copy only. Manuscripts 
that are judged not anonymous will not be 
reviewed. 

5. The first page of the four anonymous copies 
should contain only the title and an abstract of 
no more than 150 words. The first page of the 
identified copy should contain (a) the name, 
academic rank, institutional affiliation, and 
contact information (mailing address, telephone, 
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author who will receive correspondence; (c) any 
relevant citations to your previous work that 
have been omitted from the anonymous copies; 
and (d) acknowledgments, including the names 
, of anyone who has provided comments on the 
manuscript. If the identified copy contains any 
unique references or is worded differently in any 
way, please mark this copy with “Contains author 
citations” at the top of the first page. 
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individuals in socially desirable directions, but by now well-established laboratory results show 


H to promote cooperative behavior is classically solved by incentives that lead self-interested 


that people often do act cooperatively, even at significant cost to themselves. These results suggest 


that cooperative dispositions might be an evolved part of human nature. Yet such dispositions appear 
inconsistent with the “Machiavellian intelligence” paradigm, which develops the idea that our brains 
have evolved, in substantial part, for capturing adaptive advantage from within-group competition. 
We use simulation to address the evolutionary relationship between basic Machiavellian capacities and 
cooperative dispositions. Results show that selection on such capacities can (1) permit the spread of 
cooperative dispositions even in cooperation-unfriendly worlds and (2) support transitions to populations 
with high mean cooperative dispositions. We distinguish between “rationality in action” and “rationality 
in design” —the adaptive fit between a design attribute of an animal and its environment. The combination 
of well-developed Machiavellian intelligence, modest mistrust, and high cooperative dispositions appears 


to be a rational design for the brains of highly political animals such as ourselves. 


individuals have strong incentives against such 

action is perhaps the fundamental problem of 
political theory (Axelrod 1981; Ostrom 1998). Clas- 
sically spelled out by Hobbes ([1651] 1947) but now 
usually framed in terms of the Prisoner’s Dilemma, the 
issue is how to persuade individuals to act in the inter- 
ests of the collectivity (to “cooperate”) when there is a 
clear incentive for them to do otherwise (to “defect”). 
The incentive structure means that there is only one so- 
lution for rational and self-interested individuals: The 
incentives they confront must, somehow, be changed 
so that cooperation rather than defection offers the 
greater return. We now understand that this does not 
necessarily imply a centralized Hobbesian Leviathan, 


H: to promote group-benefiting action when 
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and decentralized mechanisms might be sufficient—for 
example, by the existence of “altruistic punishment” 
(Boyd et al. 2003). Nevertheless, without additional or 
“selective” (Olson 1965) incentives, rational and self- 
interested individuals will defect in Prisoner’s Dilemma 
situations. 

But that does not mean that real people con- 
fronting real PD-like situations will always defect. In 
fact, as Field (2001) has recently emphasized, there is 
now strong laboratory evidence documenting humans’ 
frequent willingness to cooperate even in one-shot 
Prisoner’s Dilemmas where the incentive to defect 
is substantial and unambiguous (e.g., Caporael et al. 
1989; Orbell and Dawes 1993; and Ostrom, Walker, 
and Gardner 1992). The incidence of such cooperation 
varies between studies and experimental conditions, 
but people often do cooperate in prisoner’s dilemmas, 
sometimes in large numbers. The extensive literature 
on ultimatum and dictator games, although not ad- 
dressing cooperation per se, supports the same general 
conclusion: People are frequently prepared to carry sig- 
nificant private costs to benefit of others (Camerer and 
Thaler 1995). 

Taking such findings seriously has implications for 
how we address classic problems in political science, 
notably normative issues surrounding the design of in- 
stitutions. Certainly, it is foolish to design institutions in 
such a way that socially desirable outcomes depend on 
people consistently acting against their private interests 
(G. Hardin 1977). But “incentive compatibility” and co- 
operative dispositions might interact. On the positive 
side, perhaps awareness that “compatible” incentives 
are in place will increase players’ expectations that oth- 
ers will cooperate, increasing thereby the likelihood of 
their own cooperative dispositions being expressed in 
their own behavior. This might happen, for example, 
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when responses are governed by fear of being suckered 
rather than by greed for the free rider’s payoff (Orbell 
et al. 1986). On the negative side, Frohlich and Oppen- 
heimer (1996) have shown that incentive compatibility 
can, sometimes at least, undermine socially productive 
behavior that would otherwise result from innate co- 
operative dispositions. Perhaps the social capital rep- 
resented by such dispositions would be substantially 
lost if we were to organize social life exclusively as a 
response to private incentives. In general, the now well- 
documented’ existence of innate cooperative disposi- 
tions means that it is almost certainly an error to base 
recommendations about institutional design exclu- 
sively on “incentive compatibility’—or on the assump- 
tion that cooperative dispositions are universal and in- 
fallible in their production of cooperative behavior. 

We do take those findings seriously and believe, with 
Field (2001),'that it is appropriate to address the prob- 
lem in biological—that is to say, evolutionary—terms. 
Doing so, of course, goes against the “standard social 
science model” (Tooby and Cosmides 1992) that seeks 
explanations,of human behavior exclusively in cultural 
or environmental terms, but drawing a strict dichotomy 
between “culture” and “biology” is now widely rec- 
ognized as erroneous (e.g., Boyd and Richerson 1985 
and Dunbar, Knight, and Power 1999). Humans are 
certainly cultural animals, and culture certainly has a 
lot to do with human behavior, including cooperation. 
But accepting these facts only poses further evolution- 
ary questions, for example, How and why have hu- 
mans evolved the capacity for culture? What are the 
constraints that evolved human nature places on the 
substance of culture? and What are the constraints 
that evolving:culture places on human nature? Granted 
that the issue is human nature, biology is the ultimate 
source of understanding—and, as Dobzhansky (1973) 
famously pointed out, “Nothing in biology makes sense 
except in theilight of evolution.” 

Several evolutionary arguments about cooperative 
dispositions are well known in the life sciences. The 
theory of kin selection (Hamilton 1964) shows how a 
disposition te assist others with whom one is related 
can rebound to one’s own genetic advantage even if 
adaptive costs are involved in the helping act, sug- 
gesting that our cooperative responses toward nonkin 
might be founded, at least in part, on an evolved dis- 
position to cooperate toward kin. The theory of reci- 
procity (Axelrod 1984; Trivers 1971) provides an “Pl 
scratch your back if you scratch mine” explanation for 
cooperative dispositions that does not require genetic 
relatedness, but only that individuals (even members 
of different species) encounter each other through an 
iterated sequence of Prisoner’s Dilemma-like games. 
A further model, the one favored by Field (2001), is 
group selection, now staging a comeback (Sober and 
Wilson 1998); after many years of having been widely 
discounted (Dawkins 1976; Williams 1966). Although 
still hotly disputed (Reeve 2000), group selection’s ca- 
pacity to promote cooperative dispositions requires 
that a group’s survival prospects be increased by mem- 
bers’ cooperative choices, with the cooperator’s fitness 
gains from the group’s success being greater than the 


fitness costs that individuals incur as a result of their 
cooperative choices! 

Notably absent from this list is any model of how 
cognitive capacities designed by natural selection for 
addressing prior problems might have provided a suf- 
ficient, even necessary, basis from which cooperative 
dispositions could subsequently evolve. As is widely 
recognized, natural selection must build from existing 
structures—in Dennett’s (1995) terms, using “cranes” 
rather than “skyhooks”—and any significant coopera- 
tive dispositions must have evolved in the context of 
prior adaptations. In this spirit, the broad question we 
address is, Might cognitive capacities originally designed 
to address other adaptive problems have made possible 
the evolution of cooperative dispositions such as those 
now strongly suggested by the empirical data? 

In Tinbergen’s (1963) famous distinction, our in- 
terest is in “ultimate” rather than “proximate” pro- 
cesses—that is, we address adaptive pressures across 
many generations as opposed to particular mecha- 
nisms that might have evolved to capture adaptive 
gains in the here-and-now. This does not detract, of 
course, from the interest attaching to proximate mecha- 
nisms. At the conclusion of his original paper, for ex- 
ample, Trivers (1971) laid out a formidable research 
agenda for psychology by proposing that moralistic ag- 
gression, gratitude, sympathy, guilt, friendship, gos- 
sip, cheating, and the ability to detect cheating might 
have evolved as proximate mechanisms for capturing 
the gains available from reciprocity. But in evolution- 
ary theory, ultimate selective pressures and proximate 
mechanisms are distinct issues, and we do not address 
the latter here. 

Our particular interest is in cognitive mecha- 
nisms fundamental to the “Machiavellian intelligence” 
(Byrne and Whiten 1988; Whiten and Byrne 1997) 
hypothesis.” In its broadest terms, this proposes that 
group living selects strongly for whatever cognitive ca- 
pacities facilitate an individual’s successful negotiation 
of the competitive and highly complex social environ- 
ment of the group. A seminal paper by Humphrey 
(1976; see also Jolly 1966) pointed to the complexity, 
fluidity, and recursiveness of social relations within pri- 
mate groups as posing a particularly difficult adaptive 
problem for members of those groups—far more dif- 
ficult, Humphrey argued, than the standard adaptive 
problems of gathering resources and avoiding preda- 
tors: 


Once a society has reached a certain level of complexity, 
then new internal pressures must arise which act to increase 
its complexity still further. For... an animal’s ‘adversaries’ 
are members of his own breeding community. If intellectual 





1 We use “fitness” in the standard (although sometimes debated) 
evolutionary sense of an individual’s relative success in populat- 
ing subsequent generations with its descendants Adaptations— 
complex structures produced by natural selection in response to 
challenges in a species’ ancestral “environment of evolutionary adap- 
tation"(Bowlby 1969)}—necessarily involve fitness costs as well as 
fitness benefits, with the presumption being that an adaptation would 
not have been selected for if the net were not poative 

? Known also as the “social” (Brothers 1997) and “political” (Boehm 
1997) mtelligence hypothems. 
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prowess is correlated with social success, and if social suc- 
cess means high biological fitness, then any heritable trait 
which increases the ability of an individual to outwit his fel- 
lows will soon spread through the gene pool. And in these 
circumstances there can be no going back: an evolutionary 
‘ratchet’ has been set up, acting like a self-winding watch 
to increase the general intellectual standing of the species. 
(311) 


Although there are substantial adaptive advantages 
to group living—most obviously, better protection 
from predators, improved success as predators, and 
more ready access to mates—there is also genetic 
competition between group members, meaning that 
those attributes best equipped to win the never-ending 
Machiavellian games of “social chess” or “plot and 
counter plot” will be positively selected. 


Thus social primates are required by the very nature of the 
system they create and maintain to be calculating beings; 
they must be able to calculate the consequences of their 
own behavior, to calculate the likely behavior of others, to 
calculate the balance of advantage and loss—and this all in 
a context where the evidence on which their calculations 
are based is ephemeral, ambiguous and liable to change, 
not least as a consequence of their own actions. (309) 


By this hypothesis, evolutionary competition be- 
tween group members selects for capacities that allow 
individuals to deceive and exploit other group mem- 
bers, while simultaneously avoiding being deceived and 
exploited by others. Employing the terms introduced 
by Dawkins and Krebs (1978), the two fundamental 
“Machiavellian” capacities are (1) Sender’s capacity to 
persuade another group member to accept as true what 
it is in Sender’s interest to have it believe is true—viz, 
manipulation>—and (2) Receiver’s capacity to pene- 
trate to the truth underlying messages from potentially 
manipulative others—viz, mindreading. More specifi- 
cally, therefore, our research question is, Could evolved 
“Machiavellian” capacities for manipulation and min- 
dreading be a basis for the evolution of cooperative dis- 
positions among social animals such as ourselves? 

This has the air of paradox. On the one hand, the 
Machiavellian intelligence hypothesis seems to dove- 
tail quite nicely with the “harder” models of rational 
action, implying a brain designed to facilitate the pur- 
suit of private welfare by whatever means necessary. 
On the other, cooperative behavior involves, by def- 
inition, rejecting a dominant incentive, an alternative 
that is superior for the acting individual regardless of 
what another individual might choose. If our brains are 
designed for “Machiavellian” purposes, should we not 
expect evolved dispositions to defect—at least when we 
can get away with it—not to cooperate? 

We use evolutionary simulation to show how, un- 
der specified parameters, it is not only possible that 
Machiavellian capacities provide an evolutionary foun- 
dation for cooperative dispositions, but in fact quite 


3 As Trivers (1985) has commented, “One of the most important 
things to realize about systems of anımal communication is that they 
are not systems for the dissemination of the truth. An animal se- 
lected to signal to another animal may be selected to convey correct 
mformation, nusinformation, or both” (395). 
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likely. If we are, by nature, Machiavellian animals ben- 
efiting from group living but also exploiting other group 
members when that is possible, our findings show how 
Machiavellian capacities could be a critical underpin- 
ning for evolutionary selection on cooperative dispo- 
sitions capable of motivating us, perhaps quite often, 
to act cooperatively—thus helping to resolve collec- 
tive action problems absent appropriate selective in- 
centives. 


DESIGN OF THE SIMULATION 


We use simulation for two reasons. First and most ob- 
viously, the processes that interest us occurred over a 
period of perhaps hundreds of thousands of years and, 
because they involve social relationships, could leave 
only little by way of a directly observable fossil record 
(Wynn 2002). Although careful inference from the ex- 
isting fossil record can help us understand when partic- 
ular cognitive structures evolved (e.g., Mithen 1996), 
such work is necessarily informed by plausible models 
of process, and simulation is one way of constructing 
such models. Second, simulation can be a tool for dis- 
covery, letting the researcher explore the consequences 
of diverse parameter settings and, perhaps, identify 
theoretically interesting processes that might not have 
been considered using standard analytic techniques. 
This has been our approach—meaning that a first order 
of business here is to specify the design decisions made 
in constructing the simulation. 

A simulation for studying the relationship between 
Machiavellian capacities and cooperative dispositions 
requires modeling decisions at several levels of analysis: 

Individual attributes. We incorporate two disposi- 
tions and two cognitive (information processing) capac- 
ities. The dispositions are, first, to act in a cooperative 
manner within joined Prisoner’s Dilemma games‘ and, 
second, to “mistrust” others’ willingness to do so (thus 
its converse, to “trust” them). The cognitive capacities 
are, first, the capacity to manipulate the persuasiveness 
of what the individual communicates—true or false— 
to others and, second, the capacity to mindread the 
truth underlying others’ efforts at such manipulation 
(Dawkins and Krebs 1978). At the ultimate or func- 
tional level at which we operate, of course, we can 
sidestep the always fascinating proximate questions 
about particular mechanisms facilitating such manip- 
ulation and mindreading. 

Interindividual communication: Because we are in- 
terested in the evolution of cooperation, when two 





4 Although much recent theoretical and empirical work has focused 
on cooperation within multiple-person Prisoner's Dilemma games, 
the problem of predicting others’ behavior in such games is com- 
putationally formidable, perhaps requiring simplifying heuristics of 
one kind or another (Orbell and Dawes 1993), whereas the problem 
of choice and incentives remains basically the same in two-person 
and n-person PDs (although see Dawes 1975). Accordingly, we focus 
exclusively on the two-person PD here—defined, as usual, by a game 
with (1) a binary choice between “cooperation” and “defection”; 
(2) the standard payoff notation being £ = lone defection or free 
nding, c = mutual cooperation, d = mutual defection, and s = lone 
cooperation or “being suckered”; and (3) t > c>d> sand 2c > (t+5) 
> 2d 
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individuals encounter each other they must choose be- 
tween entering and not entering a potentially cooper- 
ative game. In this context, Machiavellian intelligence 
requires that each sends messages to the other about 
his or her intentions within any such game (subject to 
manipulation) and that each evaluates the truth-value 
of such messages received from the other (a task per- 
formed by mindreading). 

Individual decision-making: Individuals must make 
“social” decisions—whether to enter a particular po- 
tentially cooperative game and, if they do, whether to 
cooperate or'defect—and those decisions must be in- 
formed, at least in part, by their estimates of what the 
other individual is likely to do. Importantly, the option 
of not entering such a game implies the availability of 
an alternative course of action that has some value, thus 
that the individual’s play vs. not play decision will be 
based on its comparison between what it expects to get 
from playing a cooperative game and what it expects 
from such an alternative. There are, of course, many dif- 
ferent games that social animals play—Rapoport et al. 
(1976) identified 78 logically distinct two-by-two games, 
and many of those have multiple-person variants—but 
modeling a social ecology with all that complexity, if 
indeed possible, would detract from our primary goal 
of understanding how the evolution of Machiavellian 
cognitive capacities and cooperative dispositions might 
be related, ough, particularly interesting alterna- 
tives are (1) making a “hawk” challenge, indicating a 
willingness to fight over some resources otherwise in 
control of another individual (Orbell, Morikawa, and 
Allen 2002), and (2) acting in a solitary, “go it alone” 
manner, extracting resources from the environment ab- 
sent any interaction with one’s own species—an en- 
tirely “nonsocdial” course of action. 

Natural selection: In an evolutionary model there 
must be a process by which individuals’ actions during 
their lifetimes are translated into relative reproductive 
success, thus that allows for selective retention in the 
population of whatever attributes produced that suc- 
cess (Dennett 1995). As is well recognized, population 
attributes change by drift as well as by relative 
reproductive success; in fact, we will show that drift 
might have played an important role in the evolution 
of cooperative dispositions. In either case, however, 
evolution requires a source of variation that is ran- 
dom with respect to the individual’s success but that 
makes it possible for some individuals to be reproduc- 
tively more successful than others. We recognize that 
variation in the natural world can be produced by, for 
example, recombination as well as by mutation, but for 
simplicity in what follows we use the term “mutation” 
to include all' sources of variation. Critically for our 
purposes, mutations must happen on the dispositional 
and ae RAO E that bear on an individual’s 
success in complex social environments. 


| 
Dispositional and Cognitive Attributes 


We model an individual’s disposition to cooperate as a 
probability between 0.0 and 1.0 (its probability of coop- 
erating, or PC). Each decision it makes is determined 


by a random draw from its PC; it will cooperate if the 
draw falls at or below its PC and defect if it falls above 
that value. Notice that such decisions are not appropri- 
ately thought of as “rational.” Granted, an individual 
with a low PC might be classified as more “rational” 
than one with a higher such value; cooperation is, after 
all, dominated. But that has no particular bearing on 
our evolutionary model in which such an individual is 
simply “undisposed to cooperate.” If such dispositions 
are adaptive, then they will be positively selected and 
will spread throughout the population. If they are not, 
then they will be selected against and, probably, will 
vanish—“rational” or not. 

We assume that every agent, on encountering an- 

other, makes the claim “I will always cooperate” as it 
makes no strategic sense—certainly in “Machiavellian” 
terms—to claim anything less than perfect coopera- 
tiveness. The total communication is modeled as 100 
messages, some of which (PC * 100) are true (reflecting 
the true probability of cooperating) and the remainder 
of which ({1 — PC} * 100) are false (reflecting the true 
probability of noncooperation). But the sender of any 
such communication confronts the discrete manipula- 
tive problems of being convincing in both its true and its 
false messages. Each individual is therefore equipped 
with separate manipulative capacities for its true mes- 
sages (Manipulate True) and for its false ones (Manip- 
ulate False), in each case varying between 0.0 and 1.0. 
Agents in our simulation might, therefore, be good 
truth-tellers and good liars, good truth-tellers but poor 
liars—and so on. Particular truths and particular lies are 
not all equally persuasive but are normally distributed 
around the agent’s mean manipulative capacity in each 
case. 
Correspondingly, individuals are modeled as having 
a mindreading capacity that varies between 0.0 and 1.0. 
At the low extreme of mindreading, an individual has 
no capacity to penetrate the truth of a potential part- 
ner’s “I will always cooperate” claim. While at the high 
extreme it has a perfect capacity to do so, being (poten- 
tially) able to recognize all Sender’s lies as lies and all its 
truths as truths. Notice that we do not model mindread- 
ing capacities specialized separately for responding to 
truths and lies. That would imply a cognitive apparatus 
equipped with a priori knowledge of which is which, 
thus that the problem of mindreading has already been 
solved.° 

“Trust” is a concept that has attracted much inter- 
est in recent years (e.g., Fukuyama 1995 and Nesse 
2001), and it plays an important role in the simulation. 
As specified above, mindreading lets agents address 
whether or not they should “trust” a particular other to 
act on it’s “I will always cooperate” claim (cf. R. Hardin 


5 Of course, an individual could be better at (for example) detecting 
true statements than at detecting lies, and as our results will show, 
populations can evolve such differential abilities Similarly, individ- 
uals might or might not be aware that they have such drfferential 
abilities, if they do But our present modeling concern 1s with the 
structure of cognitive architecture, and a mechanism designed for 
mindreading—for detecting whether or not another individual is 
telling the truth—cannot be constructed on the assumption that it 
“knows” the answer to that question in the particular case. 
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1991 and Orbell, Dawes, and Schwartz-Shea 1994). But 
a distinct issue—related to the concept of social cap- 
ital (Putnam 2000)—concerns trust as a generalized 
disposition toward others, a willingness to accept oth- 
ers’ “I will always cooperate” messages independent 
of expectations that mindreading might have produced 
about particular individuals, and we have incorporated 
this into the simulation. We model generalized “mis- 
trust” as a value between 0.0 and 1.0, with individuals 
at the low end of that distribution being disposed to 
accept most or all of another individual’s messages— 
whether true or false, and operationally independent 
of what their mindreading tells them they should ac- 
cept from “this” particular individual—and individu- 
als at the high end being disposed to reject most or 
all such messages. Only messages whose persuasive- 
ness, modified by Sender’s manipulation and Receiver’s 
mindreading, places them above Receiver’s mistrust 
threshold are believed by Receiver and incorporated 
into its decision-making. 


Individual Decision-Making 


An encounter between two individuals obliges each to 
decide between entering a PD game with the other 
vs. taking the alternative, non-PD course of action 
(which we call “ALT”). By assumption, individuals 
know their own cooperate vs. defect choice, and they 
also know the payoffs available from a joined PD game 
and from ALT.° The only additional information they 
need before making a decision, therefore, is the prob- 
ability with which the other individual will cooperate 
in a joined PD game. A decision-maker’s estimate of 
this will be a function of (1) the other individual’s 
actual PC, (2) the other individual’s capacity to ma- 
nipulate both its true and its false messages, (3) the 
decision-maker’s own mindreading capacity, and (4) 
the decision-maker’s own level of generalized mistrust. 
Figure 1 uses illustrative values to show how the PD vs. 
ALT decision is made. 

Sender is attempting to manipulate Receiver into 
believing its 100 “I will always cooperate” messages 
(although, of course, both individuals play both roles 
simultaneously). In this example, Sender’s PC is .75, 
meaning that it is sending 75 true messages and 25 
lies, with each set of messages distributed (normally) 
around Sender’s manipulative capacities for true and 
false messages, respectively; in the example, those ca- 
pacities are .40 for true messages and .30 for false ones, 
making our example Sender somewhat better at per- 
suading with respect to truths than to lies The best 
outcome for Receiver would be to reject all Sender’s 


6 In natural circumstances there 1s likely to be some doubt about 
all these values, and an adaptive problem in our ancestral past— 
as now—is to evaluate accurately just what vs at stake in various 
games, as well as what particular game is a possibility here. Indeed, 
it seems likely that humans and other social animals have evolved 
special-purpose, domain-specific cognitive mechanisms for facilitat- 
ing accurate such evaluations (Gigerenzer 1997). Given our mterest 
in the relationship between Machiavellian intelligence and coopera- 
tive dispositions, however, we can make the simplifying assumption 
of perfect knowledge ın these respects. 
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FIGURE 1. An Example of How nitive and 
Dispostitional Attributes Interact In Recetver's 
Declslon-MakIng 


Sender's values PC= 75, Mean ManpolaieF aise = 3, Moan MeanrpuleteTroe = 4 
Recetver’s valnes bistrost = 60, Recerver’s Mindreeding = 5 
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Note: Although this Sender's 75 true statements are somewhat 
more persuasive than Its 25 false ones (.40, In comparison with 
.30), with no mindreading being exercised no message In @- 
ther rte true or Its false sets falls above Receivers .60 mistrust 
threshold. Recerver would, then, assess Sender's PC as zero. 
After mindreading ls exercssed, most—although not all-—of this 
Sender's true statements fall above Recetver's Mistrust thresh- 
old. Those that do are used to define Receivers estimate of 
Sender's PC; In this case, that estimate is somewhat less than 
the true .75 Of course, each of these attributes ts likety to vary 
among individuals, with varying consequences for the accuracy 
of such estimates. 





lies while accepting all its truths, but this receiver’s mis- 
trust threshold is set at .60, meaning that it will only 
accept messages that are above that threshold (absent 
mindreading, in the example, none). Absent mindread- 
ing, therefore, this receiver would underestimate this 
sender’s PC, believing it to be zero rather than the true 
value of .75. 

In general, any positive mindreading capacity on Re- 
ceiver’s part will lead it to read Sender’s true messages 
as more believable than otherwise (graphically, farther 
to the right) and to read its lies as less believable than 
otherwise (farther to the left). In the Figure 1 example, 
Receiver’s mindreading capacity of 5 leads it to read 
Sender’s 75 true messages as having a mean believabil- 
ity of .7 (.5 of the distance between their manipulated 
persuasiveness of .4 and being fully believable at 1.0) 
and will lead it to read Sender’s 25 lies as .15 believable 
(.5 of the distance between their manipulated persua- 
siveness of .3 and being fully unbelievable at 0.0). A per- 
fect mindreader would have done better still, reading 
all Sender’s true messages as 1.0 believable and all of its 
lies as 0.0 believable. In the example, Receiver’s level of 
generalized mistrust is set at .60, modestly high but not 
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completely mistrustful. After Receiver has exercised its 
(also modest) mindreading capacity, its mistrust thresh- 
old is low enough that it accepts as true perhaps 60 
of Sender’s 75 true messages, thus estimating Sender’s 
PC as .6. Of course, any combination of the various 
attributes is possible. 

The bottom line, however, is the decision that Re- 
ceiver makes using the information that, for better or 
worse, it has now gathered. Using standard Prisoner’s 
Dilemma notation (footnote 4) and defining estPC as 
the number of Sender’s 100 mindreading-adjusted mes- 
sages falling above Receiver’s own mistrust threshold, 
if Receiver draws cooperation from its PC it will enter 
a PD when 


estPC(c) + (1 — estPC)(s) > ALT, 


but otherwise will choose ALT. And should it draw 
defection, it will enter a PD when 


estPC(t) + (1 — estPC)(d) > ALT, 


but otherwise will choose ALT. In the particular case, 
of course, Receiver’s choice will depend importantly 
on the PD payoffs and the value of ALT. 

Whether or not Receiver’s decision will be profitable 
(adaptive) depends on the quality of the information it 
has gathered in this encounter and on the decision that 
the other individual—in its role as receiver—makes at 
the same time. Notice, however, that Receiver’s deci- 
sion is modeled as a rational one, granted the quite 
probably imperfect information (Simon 1985) it has 
gathered about Sender’s intentions. Within an evolu- 
tionary frame where payoffs are in units of fitness, this 
is entirely appropriate since “rationality” equates with 
“fitness maximization,” and individuals whose choices 
are not geared in that direction will not last long. But 
is it inconsistent to model individuals’ decisions be- 
tween entering PDs and playing ALT as rational but 
to model the decisions of those same individuals within 
PD games as produced by “dispositions” with no basis 
in rationality? 

It is not—granted that our concern is with the con- 
sequences of natural selection on such dispositions. As 
emphasized above, within an evolutionary framework, 
the importance of such dispositions is whether or not 
they contribute to an individual’s adaptive success in 
the context of decisions about whether or not to play 
particular games, Rationality generally assumes that in- 
dividuals have no innate dispositions. toward either co- 
operation or defection, basing their decisions entirely 
on recognizable gains and losses. But whether cooper- 
ative dispositions will grow and prosper or be selected 
against and vanish within a Machiavellian world is pre- 
cisely what we are investigating here. We will return to 
the issue of rationality insofar as it bears on evolution- 
ary modeling later in the paper. 

l 


Selection 


In the simulation, members of a population encounter 
each other during a generation, with the gains or losses 
from those encounters functioning as “units of evolu- 


tionary fitness.” All the members of a generation die 
after the specified number of encounters has been com- 
pleted, but the more successful among them reproduce, 
with their offspring carrying their various cognitive and 
dispositional attributes into the next generation, sub- 
ject to mutation. Specifically, individuals whose wealth 
falls below zero at the end of their generation die with- 
out reproducing, those whose wealth is positive but 
below twice the median reproduce once, those whose 
wealth is between twice the median and below three 
times the median reproduce twice—and so on. Carry- 
ing capacity of the ecology is limited; should the num- 
ber of offspring exceed that capacity, a random lottery 
determines which particular agents populate the next 
generation.’ Generations can involve any number of 
encounters, but in the simulations we report below each 
agent encounters each other agent twice in their partic- 
ular generation, once as “Alpha,” who chooses whether 
or not to offer a PD game, and once as “Beta,” whose 
choice is limited to accepting vs. not: accepting a PD 
offer, should one be made. If Alpha chooses not to 
offer a PD, or if Beta rejects such an offer, both make 
ALT. 

On reproduction, mutation on all a parent’s disposi- 
tional and cognitive attributes—PC, mindreading, mis- 
trust, Manipulate True and Manipulate False—is a pos- 
sibility. Because all those attributes are modeled as 
probabilities, mutation of fixed magnitude could eas- 
ily have reached a limiting point beyond which further 
mutation in a particular direction would not be pos- 
sible. Our solution was to define each attribute from 
two “constituent” integers above zero, one facilitating 
and one inhibiting the attribute in question. Thus, for 
example, PC is constructed from the integers “cooper- 
ate plus” (a positive disposition toward cooperation) 
and “cooperate minus” (a negative such disposition), 
with the individual’s actual PC being defined as the 
proportion 


cooperate plus/(cooperate plus + cooperate minus). 


Similarly, “mistrust” is constructed from integers defin- 
ing a disposition not to believe that others have coop- 
erative intentions (“skepticism”) and a disposition to 
believe that they have such intentions (“credulity”), 
with the attribute itself being defined as the proportion 


skepticism/(skepticism + credulity). 


A similar approach is taken with respect to mindread- 
ing and the two manipulative capacities. Mutation on 
each of the integers from which a parent’s attributes 
are constructed can thus affect the values of attributes 
an offspring inherits, with potential consequences— 
neutral, adaptive, or maladaptive—for that offspring’s 
success within its own generation’s social environment. 


7 Carrying capacity is thus a constraint preventing population ame 
from increasing indefinitely—as could happen if natural selection on 
social success were the only limiting factor. The random lottery 1s, 
thus, only conducted among individuals who are already successful 
in terms of natural selection In natural circumstances, of course, 
immigration to unpopulated areas might have been possible for many 
of our early ancestors. 
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Mutation happens on such “constituent” attributes 
with the parameters “probability” and “magnitude.” 
That is, there is some fixed probability with which any 
given parental attribute will mutate on being transmit- 
ted to an offspring, and some fixed magnitude (positive 
or negative) of those mutations that do happen. In the 
simulations to be reported, probability is set at 0.05 and 
magnitude at 5.0— meaning that either of the two inte- 
gers from which each of an offspring’s several attributes 
is constructed could differ from those of its parent with 
a probability of 0.05, and by a magnitude of plus or mi- 
nus 5.0.8 Note that although the number of offspring is 
influenced by a parent’s relative success in its own gen- 
eration, the only source of change in attributes passed 
from a successful parent to its offspring is mutation thus 
specified. The model does not, therefore, involve any 
parent-to-offspring learning and, certainly, no Lamar- 
ckian inheritance. 


Experimental Design 


In these terms, the objective is to observe how 
population-level values of the several individual at- 
tributes change through successive generations and the 
selective processes by which such changes happen. In 
particular, we are interested in the possibility that se- 
lection on manipulation and mindreading influences se- 
lection on cooperative dispositions—and, of course, the 
frequency of whatever cooperative behavior follows. 
We start each simulation with a population of 
individuals whose cognitive and dispositional at- 
tributes militate strongly against PD games being 
played—specifically, PC=.10, Mistrust=.90, Ma- 
nipulateTrue = .10; ManipulateFalse = 10, and Min- 
dread =.10. At least until mutation has had a chance 
to change population attributes, agents will send each 
other “I will always cooperate” messages that are 
mostly lies; they will have little capacity to be persua- 
sive (either in their many lies or in their few truths), 
they will have little capacity to penetrate to the truth of 
particular messages; and they will be so generally mis- 
trustful that even very persuasive claims will seldom be 
accepted. Agents in this initial generation are, in fact, 
substantially nonsocial insofar as they lack the ability to 
engage in collective action, and they are also substan- 
tially nonpolitical insofar as their Machiavellian capac- 
ities are underdeveloped. Any cooperative equilibria 
that might emerge from this (perhaps “Hobbesian”) 
world will have to be explained, but so will the process 
by which any initial “leap” out of this world happens. 


FINDINGS 


Our initial exploratory runs suggested two possibilities. 
(1) Selection on Machiavellian intelligence can both 


8 Given that there are 10 components on which mutation might 
happen, these parameters mean that there 1s a 599 probability of 
any given offspring having no mutation on any of the components 
it inherits from its parent Sensitivity testing shows that increasing 
or decreasing these parameters does have predictable effects on the 
speed with which mean population attributes change but not on the 
basic pattern of findings we report. 


produce and sustain transitions from such a “coopera- 
tion and PD unfriendly” founding population to later 
populations with very high mean PC values and very 
frequent mutually cooperative PD games, (2) Such “co- 
operative transitions” happen almost exclusively under 
the parameters 0 < ALT < c. Our subsequent analysis 
focused more systematically on exploring these possibi- 
lities and on identifying the processes underlying them. 

Our approach was to run multiple simulations 
with PD parameters constant (t = 15; c = 5; d = —5; 
s = —15) but varying ALT. We first confirmed the 
absence—more accurately, very low incidence—of co- 
operative transitions outside the parameter range 0< 
ALT < c. In retrospect, the reasons for that pattern are 
clear. When the payoff from ALT is higher than from 
mutual cooperation, only an individual who intends 
defection and is convinced that a potential partner will 
cooperate will rationally enter a PD game in preference 
to ALT. Although one competent mindreader might 
make that assessment, both parties must agree before 
a PD is joined, and the probability of both deciding the 
same thing about the other, though not zero, is slim— 
and becomes still more so as ALT approaches ¢. 

At the opposite extreme, when ALT < 0, even a low 
estimate of the other’s PC might return an expected 
value for entering a PD that is higher than ALT, and 
because cooperation is dominated by defection, such 
an estimate will be particularly likely for intending 
defectors—probabilistically, individuals with low PC 
values. Estimating the other’s PC at 0.4, for example, 
an intending defector would calculate the EV of a PD 
as 3, whereas an intending cooperator would calculate 
it at —7. Trapped between two alternatives both offer- 
ing a loss, the intending cooperator—probabilistically, 
an individual with a high PC value—cannot prosper, 
meaning negative selection on high PC. But a popula- 
tion of low PC agents produces, for the most part, either 
PDs played to mutual defection (a negative) or ALT 
(also a negative in this range) and will rapidly die out.’ 

With this understanding, we ran 90 simulations, each 
consisting of 20,000 generations, each with the same 
PD payoffs but including 10 at each 0.5 interval across 
the range 0 < ALT < 5. The results are reported in 
Table 1. First, the data confirm that transitions from 
overwhelmingly ALT choices to overwhelmingly PD 
games played with mutual cooperation are very fre- 
quent within that parameter range. Although there are 
slightly fewer transitions toward the lower end of that 
range, across that whole range transitions fail to happen 
in only 9 of the 90 simulation runs. Within each of the 
nine ALT categories, there is considerable variation as 
to when cooperative transitions begin, but such tran- 
sitions normally do take several thousand generations 
to get under way (column 3). Nevertheless, and despite 
occasional lapses that are reversed with the passage of 





9 With different PD parameters the population might not die out; if 
mutual defection were a positive, for example, a population of very 
low PC types could survive indefinitely even with a negative ALT. The 
important point is that, with a negative ALT, cooperative dispositions 
will be strongly selected against, and cooperative behavior will be 
extremely rare—at best. 
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TABLE 1. 


Cooperative Transitions within the Parameter Range 0 < AlT<c 























ALT ‘Number of Cooperative Mean Generation Where Mean PC Value After Predicted 

Value Transitions Transition Starts the Transition Threshold PC 
(1) _(2) (3) (4) _(5) 
4.5 9/10 6535 0.981 0.975 
4.0 10/10 5923 0.965 0.950 
3.5 10/10 5983 0.940 0.925 
3.0 9/10 5924 0.926 0.900 
2.5 9/10 6046 0.898 0.875 
2.0 10/10 4474 0.897 0.850 
1.5 8/10 3398 0.858 0.825 
1.0 8/10 4040 0.833 0.800 
0.5 8/10 5566 0.815 0.775 























Note Based on 10 simulations for each value of ALT. 





time, the 81 cooperative transitions all produced quite 
high mean PC values (column 4) that persist in appar- 
ent equilibrium until the simulations end at the twenty 
thousandth generation.!° 

Further, those high PC values are reflected in actual 
behavior. We illustrate this with an example when ALT 
was set at 4. Figures 2a, 2b, and 2c show, respectively, 
mean levels of PC across the 20,000 generations of this 
simulation; the incidence of ALT outcomes vs. joined 
PD games; and, among the PD games that were joined, 
the incidence of mutual defection, mutual cooperation, 
and the outcome in which one defecting individual ex- 
ploits the other. From Figure 2a, the transition to high 
levels of PC begins at generation 5,319 after what ap- 
pear to be a couple of “false starts” but—after a thou- 
sand or so generations of some fluctuation—has settled 
down to a basically stable state around PC = .97 with 
only minor fluctuations from generation to generation. 
After the transition, in other words, this population is 
very disposed to cooperation, granted not perfectly so. 

From Figure 2b, we see that these cooperative dis- 
positions are paralleled by a willingness to enter PD 
games and a corresponding decline in the incidence of 
ALT outcomes.” There is some instability from gener- 
ation to generation and one occasion (around genera- 
tion 15,000) when the preponderance of PD games in 
the ecology is:temporarily challenged by ALT, but the 
reversal in the overall pattern of game choices after the 
transition is clear. 

Finally, from Figure 2c we see that the PD games 
that dominate relationships among individuals after the 
transition have, overwhelmingly, mutually cooperative 
outcomes. The one fluctuation that does occur is as- 
sociated with the incidence of PD games, not with any 
major fluctuation of within-game behavior, and mutual 
cooperation remains the characteristic outcome to PD 


10 More extensive testing showed that the general pattern of cooper- 
ative transitions happening only within the specified parameter range 
was robust—when, for example, the mitial world was cooperation- 
friendly and across wide variation in the absolute values of PD 
yoffs. 

[i The lack of a perfect negative match between PD and ALT out- 
comes is explained by there two ways the latter outcome can 
happen—when Alpha (the first mover) rejects the PD option and 
when Beta (the second mover) rejects a PD offer from Alpha. PD 
games, on the other hand, require both parties to agree. 






games throughout this period. Importantly, notice that 
the “trickle” of PD games that we observe after about 
generation 1,000 (Figure 2b) is identified here as being 
games played almost exclusively to mutual defection. In 
fact, only two mutually cooperative games were played 
during the 5,318 generations before the cooperative 
transition got under way. We will return to the signifi- 
cance of this “trickle” of mutually defecting PD games 
shortly. In general, however, these data show that the 
rapid change in cooperative dispositions is associated 
with a corresponding change in behavior from over- 
whelmingly ALT choices to overwhelmingly joined PD 
games and, within those games, frequent cooperation. 
This granted, the problem is to explain (a) the origin 
of such cooperative transitions and (b) the processes 
by which they are maintained at what appears to be 
equilibrium. 


Orlgins 


To find answers, we examine mean levels of the three 
Machiavellian attributes (ManipulateTrue, Manipu- 
lateFalse, Mindread), the generalized disposition to 
mistrust, and cooperative dispositions, as those varied 
with PD games played. Although population changes 
are the ultimate concern of evolutionary analyses, ag- 
gregate data do not speak directly to the selective pres- 
sures on individuals. Therefore, we will also report cor- 
relational data at the individual level during adaptively 
important periods. 

Of greatest initial interest, Figure 3 shows, for the 
case being followed, a substantial step-level, upward 
change in mean population mindreading values at 
around generation 2,000, a change that is plausibly a 
response to the costs of becoming involved with PD 
games when the mean PC is very low, thus the proba- 
bility of taking a negative from such a game is high— 
whether as a (rare) cooperator or as a (more frequent) 
defector. A strong test of that hypothesis is provided by 
observing the relationship between individuals’ min- 
dreading scores (independent variable) and the pro- 
portion of PD games that, if offered to an individ- 
ual, were accepted, across the 100 generations between 
1,990 and 2,090 when the increase in mindreading was 
most marked and the variance in mindreading was also 
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FIGURE 2. ACoo 


Transition When ALT = 4—Mean PC and Behavioral Outcomes: (a) Mean 


Levels of PC, (b) ALT vs. PD Outcomes to Encounters, and (c) Outcomes to Jolned PD games 


(a) 








highest.!? The result was 6 = —0.1593 (p < .001), sup- 
porting the hypothesis that selective pressures in this 
nasty period were favoring individuals with mindread- 
ing capacities sufficient to keep them from accepting 
invitations to enter PD games—that, if entered, would 
have resulted in loss. 


12 Data points are each of the 50 individuals in each of the 100 gen- 
erations, for an 7 of 5,000. 
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Notice that positive selection on PC values in this 
early, cooperation-unfriendly stage is not likely because 
so few PD games are being played—with those that 
are played resulting in costly mutual defection. But 
what happens to individuals whose PC does happen 
to mutate upward? We can predict, first, that such “co- 
operative mutants” will be spotted as potential victims 
by mindreading-equipped low-PC types, and the data 
support this: In the case being followed, for example, 
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FIGURE 2—Continued. 
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the relationship between agents’ own PC values (in- 
dependent variable) and the number of PD games 
offered to them by others for the 100 generations im- 
mediately prior to the start of the cooperative transi- 
tion was $ = 36.3824 (p < .0001). The same mindread- 


ing capacity that evolved to address the prophylactic 
function of keeping agents out of costly PD games in 
a cooperation-unfriendly world also equips them to 
recognize and try to exploit occasional high PC types. 
But mindreading cuts two ways, and Table 2 reports 








FIGURE 3.; A Cooperative Transition When ALT = 4—Mindreading 
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TABLE 2. Multiple Regression Beta Weights: 
Proportion of Offered PD Games Accepted by 
Mistrust, Mindreading, and PC 











Coefficient 
Intercept 1.2216 
Mistrust —0.6821* 
Mindreading —0.9414* 
Probability of cooperating —0.0641* 
Note: One hundred generations prior to the cooperative trans+ 


tlon. A? =.557; * P < .001; N=4,978. Observatons are on 50 
agents in each of the 100 generations; missing cases had zero 
offers. 








coefficients from a multiple regression predicting, from 
the PC, mindreading, and mistrust scores of such tar- 
get agents (independent variables), the proportion of 
PD offers that, once made, were accepted by agents. 
Clearly, the mindreading capacities of such occasional 
“cooperative mutants” makes it possible for them to 
recognize and avoid such traps—thus, for their rela- 
tively higher cooperative dispositions to be passed on 
to any offspring they might have. 

Granted, sustained upward drift of PC values will re- 
quire low-probability upward mutations on the PC val- 
ues of at least several members of a lineage, something 
that is not likely for any particular lineage. But min- 
dreading does make such “cooperative drift” at least 
possible and, with the passage of many generations, 
quite likely on at least some lineages. And this is what 
sets the stage for the observed cooperative transitions. 

We have followed the individual-by-individual de- 
tails across several different values of ALT and—with 
due allowance for the variation resulting from the many 
stochastic elements in the simulation—the basic pat- 
tern is always the same. At some point, two relatively 
cooperative agents, both equipped with high mindread- 
ing, each recognize the other as a good bet for a PD 
game, and with one offering and the other accepting, 
both capture the mutual cooperate payoff With other 
agents all rejecting PD games in favor of ALT (where 
ALT <c), these two enjoy relative reproductive suc- 
cess. Their offspring, inheriting the high PC and high 
mindreading that made their parents successful, have 
similar success in their own generation until, quite 
rapidly, the whole population is descended from this 
original pair. 

Because this process relies on randomly drifting PC 
values, such cooperative transitions are not inevitable 
within this parameter range. Quite possibly, no two 
individuals will ever reach high enough PC levels to 
be seen as attractive partners by each other. In fact, 
as we have pointed out, transitions failed in nine of 
the 90 simulation runs we conducted. By the same 
token, some transitions might take many more gen- 
erations to happen than others. But the process we 
have described makes cooperative transitions likely to 
happen—sooner or later. 

Returning to Table 1, it is apparent that the level at 
which PC equilibrates varies monotonically with ALT, 
as ALT becomes higher within the critical parame- 
ter range, mean PC values after the transition also 
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become higher. Because natural selection can be as- 
sumed to “discover” attribute levels that are optimal 
for particular environments, it appears that optimal lev- 
els of cooperative dispositions are positively related to 
ALT. This is perhaps surprising, as one might expect 
cooperative dispositions to be higher as the relative 
advantage of mutual cooperation over other ways of 
making a living becomes greater, but just the opposite 
happens. 

The issue is resolved by recognizing that two crite- 
ria must be met in order to maximize returns in this 
world. First, an agent must persuade another to enter a 
PD game rather than to play ALT; and, second, under 
that constraint, it must maximize its own returns within 
any PD game that might be joined. Given the potential 
victim’s (now) probably high mindreading capacities, 
meeting the first criterion requires the agent to have 
genuinely high cooperative dispositions; when ALT is 
high, a potential victim can afford to be fussy about 
the cooperative dispositions of would-be victimizers. 
But the would-be victimizer’s cooperative dispositions 
should be no higher than minimally necessary to get 
the potential victim into play. Being any more coop- 
eratively disposed will result in lost opportunities for 
profitable free riding, placing the agent at an evolution- 
ary disadvantage relative to others who do manage to 
enter PD games with cooperative individuals but who 
do, nevertheless, defect. 

This analysis suggests: Natural selection will “dis- 
cover” levels of cooperative dispositions that are simulta- 
neously high enough to ensure that prospective partners 
assess the expected value of entering PD games as greater 
than ALT, but low enough to maximize the possibility of 
exploiting partners should a PD game be joined. For a 
population of perfect mindreaders, this logic predicts a 
threshold PC for each value of ALT (just) above which 
a population’s PC values will equilibrate—specifically 
where PC(c) + (1 — PC)(s) = ALT. In these terms, col- 
umn 5 in Table 1 specifies such thresholds for the nine 
values of ALT that we have examined across the param- 
eter range 0 < ALT <c, and Column 4 shows that, as 
predicted, the mean levels at which cooperative disposi- 
tions equilibrate after transitions are (1) monotonically 
related to values of ALT within this range and (2) only 
slightly above the threshold in each case—the slight “er- 
ror” in each case being, presumably, a consequence of 
minor mistakes in mindreading that still do happen.’* 


13 Of course, in this situation two individuals encountering each other 
are both would-be victrmizers and potential victims. 
14 Although transitions do frequently happen across all values of 
ALT within this parameter range, case-by-case and generation-by- 
generation micro analyses of the process by which they happen show 
that there are many more “false starts” at the lower values of ALT 
where several lineages normally appear about to dominate the entre 
population but che out before doing so. Whereas one normally does, 
sooner or later, do so, the reason for such false starts 1s the permussive- 
ness that low values of ALT provide for relatively defecton-inclined 
individuals who are more likely to attempt exploitation of each other 
than their (necessarily) more cooperative counterparts when ALT is 
y, the permissiveness provided for defection at 
very low values of ALT makes cooperative transitions, when they do 
happen, more unstable than at higher such values, in a very small 
number of cases allowing them to collapse altogether. 
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FIGURE 4., A Coo 
“Mindreading-Adjusted” Truths and Lies 
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What, then, sustains such cooperative transitions once 
they have happened? From Figure 4, we observe a rapid 
drop in mistrust as the cooperative transition is getting 
under way (generation 5,319). This is explained in terms 
of competition among individuals in the newly cooper- 
ative population. In this “nice” world, advantage will go 
to those who enter more PD games than their adaptive 
competitors, and with mindreading already high, this 
means selective advantage to those who have relatively 
low levels of mistrust. The result is that, for a few thou- 
sand generations, the posttransition population is not 
only highly disposed to cooperate, but also very low in 
mistrust. 

But such fine dispositions in combination with 
high PC values make this population doubly ripe for 
exploitation by low PC types, thus for an eventual re- 
version to generally low PC values. The reason why 
such a reversion does not happen involves the relation- 
ship between mean levels of mistrust and mindreading. 
First, from 4 we can see that the drop to very 
low levels ofimistrust only lasts a few thousand gen- 
erations after the transition has happened. Although 
that drop did facilitate individuals’ entering now gen- 
erally cooperative relationships, it also opened the door 
for relatively, less cooperative individuals to prosper 
and spread, with that, in turn, selecting for individuals 
with somewhat higher mistrust, as reflected in the mean 
values in Figure 4. The adaptive problem is to find an 
optimal level of mistrust, given widespread mindread- 
ing capacities but also the substantial gains that are 
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now available from entering PD games in this generally 
cooperative world. 

The solution “discovered” by natural selection is a 
balance between mistrust and mindreading, with mis- 
trust high enough to ensure that lies are rejected but 
low enough not to endanger acceptance of the true mes- 
sages that predominate in this posttransition, generally 
cooperative world. Figure 4 illustrates how this hap- 
pens using the case we have been following. Granted 
the elevated mindreading that is now normal, receivers 
accept senders’ true messages with a higher probabil- 
ity than the persuasiveness of those messages would 
imply, but they also accept Senders’ lies with a lower 
probability than the persuasiveness of those messages 
would imply. Critically, mean levels of mistrust evolve 
upward to a point that is high enough to ensure that 
Senders’ lies are rejected, thus precluding invasion by 
lower PC types. There are two brief periods where the 
mean “mindreading-adjusted persuasiveness” of lies 
moves above mean mistrust, and the consequences of 
that are visible in the brief drop in the incidence of 
cooperatively played PD games (Figures 2b and 2c). 
But in both cases this drop is only temporary and the 
cooperative equilibrium returns. 

The reason for this equilibrium is that the most adap- 
tive configuration of cognitive and dispositional at- 
tributes is one that will maintain frequent access to PD 
games played with cooperators. But in general, advan- 
tage will go to those whose mistrust is high enough, in 
conjunction with their mindreading capacity, to ensure 
rejection of most, if not all, lies that are directed toward 
them, while being low enough to ensure acceptance of 
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most, if not all, the truths that are also directed toward 
them. Those who best strike such an optimal balance 
will be best positioned to prosper within a population 
of individuals who are, in general, strongly disposed to 
cooperative behavior but who nevertheless sometimes 
do defect. 

In summary, in the initial cooperation-unfriendly 
world, individuals can occasionally be led to enter PD 
games that are played to costly mutual defect outcomes, 
thus selecting for high mindreading capacities. Once in 
place, those capacities permit upward drift on coopera- 
tive dispositions. Individuals with such dispositions are 
the frequent targets of exploitative attempts by those 
who are more defection-inclined, but their mindread- 
ing allows them to survive (via ALT choices) no dif- 
ferently from others in the population. At some point, 
however, two such high PC individuals are likely to play 
a PD game with each other and to do so cooperatively— 
and, given that ALT < c, they will prosper by compar- 
ison with the rest of the population. Quite rapidly, the 
descendants of these individuals populate the entire 
ecology. The equilibrium level of cooperative disposi- 
tions after such a transition is high enough to attract 
skilled mindreaders into PD games but low enough 
to extract some gains from exploiting them once they 
have entered. Transitions are sustained by selection in 
favor of individuals who “discover” an optimal mixture 
of mindreading and mistrust—sufficient to ensure that 
they accept the true “I will always cooperate” messages 
characteristic of this environment but low enough to 
ensure that they reject the false messages that are, nev- 
ertheless, still being sent. 


DISCUSSION 


As is often pointed out, humans’ ancestors were almost 
certainly cooperative animals well before the point at 
which our line diverged from that of other large pri- 
mates (e.g., Caporael et al. 1989), meaning that our 
“cooperation-unfriendly” starting world is best under- 
stood as a convenient analytic device for showing how 
cooperative dispositions could evolve despite adaptive 
pressures to the contrary. And if cooperative transitions 
resembling those we have discussed ever did happen, 
there can be no implication that they happened with 
anything approximating the speed across natural gen- 
erations that we observe in the “generations” of our 
simulation. Any natural cooperative transitions could 
well have taken hundreds of thousands of years. 

All we claim to have shown is that dispositions to 
cooperate can evolve and be sustained at equilibrium 
as a direct consequence of selection on Machiavellian 
capacities for manipulating the content of messages 
sent to others and for mindreading to the truth un- 
derlying others’ attempts at manipulation. As sketched 
above, the literature on social evolution has identi- 
fied a number of plausible evolutionary paths to co- 
operative behavior, the best known being kin selection 
(Hamilton 1964), reciprocity (Trivers 1971), and group 
selection (Sober and Wilson 1998). And the broad 
idea that humans and other highly social animals have 
evolved “Machiavellian” capacities supporting their 
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capacity to successfully negotiate the competitive chal- 
lenges of group life is a widespread and peculiarly fer- 
tile one. To our knowledge, however, there has been 
no model of how our cooperative dispositions and our 
Machiavellian capacities for manipulation and min- 
dreading might be functionally related, and that is the 
gap we have attempted to fill here. 

Although the problem of cooperation has a long 
analytic history in political science, the idea of 
Machiavellian intelligence—that we are a “political an- 
imal” in our cognitive design as well as in the fact that 
we relate to each other in groups—is less well known, 
and we believe that it deserves greater currency. This is 
certainly the case if, as our data suggest, Machiavellian 
intelligence and cooperative dispositions have an inter- 
twined evolutionary history. 

Our model is based on natural selection acting on 
individual cognitive and dispositional attributes, but 
the processes it identifies could interact with cul- 
tural evolution as modeled, for example, by Boyd and 
Richerson (1985). As we have shown, cooperative tran- 
sitions occur when a pair of agents with high cooper- 
ative dispositions and well-developed mindreading ca- 
pacities recognize each other as offering a good PD bet 
and, joining such a game, both cooperate and prosper 
accordingly. In our pared-down world, however, agents 
do not have the capacity to learn from others’ expe- 
rience, to adapt their behavior by observing others’ 
success, but such a capacity certainly does exist among 
humans and, indeed, among some other primates (see, 
e.g, de Waal 2001). Incorporating that capacity, we 
might see the success of cooperative behavior among 
others being recognized by individuals not personally 
disposed in that direction, with the behavior spreading 
by imitation—independent of the cognitive and dispo- 
sitional evolution that concerns us here. 

This raises many further possibilities. For example, 
populations could evolve to include some individu- 
als whose cooperation is a consequence of genetically 
based cooperative dispositions and others whose coop- 
eration is a consequence of social learning coupled with 
a “strategic” recognition that cooperative behavior will 
work for them as it has for others. More interesting, 
perhaps, we might see individuals appearing whose co- 
operative behavior is a product of both innate dispo- 
sitions and such social learning. Social learning could 
also reveal opportunities for Machiavellian exploita- 
tion that would not be recognized in its absence, fur- 
ther accelerating the evolutionary arms race between 
manipulation and mindreading. And benefits received 
as a consequence of social learning could select for 
the (presumably genetically based) capacity for social 
learning—perhaps in an upward spiral involving that 
capacity, cooperative dispositions, and Machiavellian 
manipulation and mindreading. But these are specula- 
tions, taking us well beyond our more modest present 
concern with the evolutionary relationship among just 
the latter two attributes. 

The idea of rationality is presently under dispute 
in the discipline (Green and Shapiro 1994), but the 
evolutionary model proposed here allows us to dis- 
tinguish between two versions of rationality, perhaps 
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contributing'to a reconciliation between the idea of 
rationality and the empirical data on cooperation— 
mentioned above—that has featured in that dispute. 

Rationality in action: This is the standard way in 
which rationality is employed in political science and 
related disciplines, notably economics, Essentially, indi- 
viduals choose so as to maximize their private welfare— 
under a variéty of constraints, most importantly, on in- 
formation. Within our simulation, this is how we have 
modeled individual actors choosing between entering 
PD games and playing ALT. Their choices are con- 
strained by others’ success at manipulation and by their 
own limits with respect to mindreading, but they do the 
best they can. And, we believe, that is how individuals 
should be modeled in an evolutionary simulation such 
as ours. As Alchian (1950) long ago pointed out, the 
idea that individuals choose as if they were rational 
in this classic sense can be employed as a profitable 
fiction as long as interest resides in equilibria that are 
produced through an evolutionary process that selects 
for fitness maximizing behavior. Remembering that 
payoffs in our evolutionary simulation are “units of 
fitness,” individuals who do not act in such a manner 
would rapidly be selected out, making it reasonable 
to start from this model of action, even if it were in 
no way parallel to the actual deliberative or cognitive 
processes that ancestral populations employed in their 
social decision making. 

Yet, as we have pointed out, this is not how we 
have modeled choices by those same individuals within 
any PD games that might be joined. Those are mod- 
eled as random draws from a probability distribu- 
tion in ordér to capture the idea of cooperative 
dispositions that have no necessary basis in the indi- 
vidual’s calculation of self-interest or, indeed, in its 
actual self-interest. By this emphasis on dispositions 
unrelated to: calculations of current interest, we are 
departing from the tradition in political science and 
elsewhere that seeks solutions to the “problem of coop- 
eration” in processes that are founded in “rationality in 
action.” 

Rationality in Design: Here rationality refers to the 
adaptive fit between some designed apparatus and the 
environmental problems that apparatus is intended to 
solve—an idea usefully captured within an evolution- 
ary context by Tooby and Cosmides’ (1992) metaphor 
of an appropriately designed key being one that opens 
a particular lock. As these authors propose, a focus 
on design requires asking a series of engineering ques- 
tions, most importantly about the correspondence be- 
tween the problem to be solved and the mechanism 
“designed” by natural selection to solve it. Granted that 
a particular adaptive problem was a repetitive part of 
the ancestral environment, what design solutions has 
natural selection produced in response? And thinking 
as a “reverse engineer” (Dennett 1995), What adaptive 
pressures in the ancient environment are most likely to 
have led natural selection to “design” particular com- 
plex structur¢s—presumed “adaptations” —that we ob- 
serve today? 

This “rationality in design” approach lets us address 
the empirical fact of frequent cooperative behavior that 
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is anomalous in terms of rationality in action. Our sim- 
ulation suggests that the most adaptive configuration of 
cognitive and dispositional attributes—the most rational 
design response by natural selection to the problems of 
group living—is strongly but not perfectly cooperative 
dispositions, a modest but not particularly high level of 
mistrust, and a substantial ability to mindread as well 
as to manipulate. Such a model not only is consistent 
with the laboratory data about cooperation, but also 
has the advantage of explaining cooperative disposi- 
tions squarely within the broader intellectual (and in- 
terdisciplinary) enterprise seeking an evolutionary un- 
derstanding of humans’ cognitive adaptations for social 
life, including Machiavellian capacities. 

This model is not inconsistent with some individu- 
als having only very weak dispositions to cooperate, 
just as it is not inconsistent with some having unusu- 
ally strong, even perfect, such dispositions. Individ- 
ual differences aside, however, the model addresses 
species-typical attributes, suggesting that, as a default, 
we should expect social animals—most interestingly, 
of course, humans—to be quite strongly disposed to 
cooperative behavior, as well as modestly trusting and 
reasonably adept at both manipulation and mindread- 
ing. How individuals’ cooperative dispositions might be 
undermined or reinforced by the incentive attributes of 
the present situation is, of course, a different matter. 

The availability of an alternative to playing prisoner’s 
dilemma games as a way of gathering resources is criti- 
cal to our model, and our finding that cooperative tran- 
sitions only happen within the finite range 0 < ALT <c 
invites further study. Whereas we believe that the logic 
by which transitions are so confined is clear, the more 
difficult question concerns the natural world circum- 
stances that might embody that logic—or, more ac- 
curately perhaps, might have embodied that logic in 
the ancient environment. Most interesting, we think, is 
the finding that cooperative dispositions evolve to their 
highest level when the payoff from alternative ways of 
gathering resources most closely approaches the payoff 
from mutual cooperation without actually exceeding it. 
This appears counter-intuitive at first; one might expect 
cooperation to evolve to its highest levels when there is 
the most to be gained from mutual cooperation relative 
to alternative courses of action. But as the argument 
about particular equilibria of cooperative dispositions 
suggests, a greater distance between ALT and c just 
allows more “room” for noncooperative dispositions 
to evolve. 

We recognize, of course, that the ancestral environ- 
ment that shaped our modern cognitive apparatus did 
not involve a single set of payoff parameters (as do 
particular runs of our simulation), but a substantial di- 
versity of particular parameters, making that environ- 
ment best thought of as a statistical average across that 
diversity (Daly and Wilson 1999; Tooby and Cosmides 
1990). Nevertheless, our model does provide a basis for 
hypothesizing that sociality itself—a willingness to en- 
ter PD-type games coupled with a strong disposition to 
play such games in a cooperative manner—evolved 
toits highest levels when there were only marginal gains 
to be had from jointly cooperative action in comparison 
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with “going it alone.” The problems associated with 
sociality became more acute as relatively higher pay- 
offs became available from cooperative activities, thus 
permitting the evolution of at least somewhat more am- 
bivalent cooperative dispositions. 
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The Paradox of “Warlord” Democracy: A Theoretical Investigation 
LEONARD WANTCHEKON New York University 


olitical theorists from Machiavelli to Huntington have denied the possibility of popular government 

arising out of the chaos of civil war, instead prescribing an intermediate stage of one-man rule 

by a Prince, Leviathan, or a military dictator. Based on recent empirical evidence of post-civil 
war democratization in El Salvador, Mozambique, and elsewhere, I show that democracy can arise 
directly from anarchy. Predatory warring factions choose the citizenry and democratic procedures over 
a Leviathan when (1) their economic interests depend on productive investment by the citizens, (2) citizens’ 
political preferences ensure that power allocation will be less biased under democracy than under a 
Leviathan, and (3) there is an external agency (e.g., the United Nations) that mediates and supervises 
joint disarmament and state-building. Ultimately, I discuss the implications of this argument for the basic 
intuitions of classical political theory and contemporary social theory regarding democratization and 


consensus is emerging among political scien- 

tists that democracy can originate from several 

types of social conditions (Przeworski, Alvarez, 
and Cheibub 2000; Shapiro 2001). Democracy can be 
a by-product of modernization (Lipset 1959). It can be 
the result of the emergence of the bourgeoisie (Moore 
1966) or an organized working class (Rueschmeyer, 
Stephens, and Stephens 1992). However, the least stud- 
ied and seemingly most unlikely source of democrati- 
zation is large-scale civil war, even anarchy. Indeed, 
vicious African and Central American warring fac- 
tions that obviously have no normative commitment to 
democratic ideals have created democracies. Democ- 
racy came about in an environment in which there is 
no political culture of tolerance, the state institutions 
have badly failed or even collapsed, the civil society 
is weak, and political actors profoundly distrust each 
other. 

The claim that democracy can in fact arise from 
civil war is backed by hard empirical evidence. Using 
measures of democracy provided by Jaggers and Gurr 
(1998) and civil war data from Licklider (1993), I find 
that nearly 40% of all civil wars that took place from 
1945 to 1993 resulted in an improvement in the level 
of democracy.” Civil wars gave birth to relatively stable 
democracies in Mozambique, El Salvador, Guatemala, 
and Nicaragua among others (Bermeo 2003, 160-61). 
The most spectacular improvements were experienced 
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I am responable for any remaining errors 
1 See Rotberg 2003 for a definıton and characteration of failed, 
collapsed, or weak states. 

2 See also Wantchekon and Nickerson (1999). They measure the 
change in democracy from just before the confhct began to five years 
after the cessation of conflict Following the standard procedure in 
international relations, they subtract the Polity 98 autocracy score 
(—10 to 0) from the democracy score (0 to +10) to create a democ- 
racy rating ranging from 0 to 20 They find that the most 

changes took place in Mozambique and El Salvador (14 pomts each), 
followed by Nicaragua (11 points), then Malaysia (10 points). 


in Mozambique and E] Salvador (14 points on the Polity 
98 scale.) 

Post-civil war democratization is particularly puz- 
zling because it occurs in a context in which most po- 
litical theorists not only predict but also recommend 
dictatorships. For instance, writing against the back- 
drop of English civil wars and at a time when royalists 
were wondering whether they had to resist a conquer- 
ing republican regime, Hobbes ({1968] 1985) contends 
that they (the royalists) should surrender their political 
rights to the rulers so long as they could govern and put 
an end to the civil war. Schumpeter (1942) argues that 
his minimalist or procedural conception of democracy 
would emerge only where there is political tolerance 
and a strong state. Huntington (1968) considers any 
form of liberal democracy simply unattainable in the 
absence of political order. In his study of the histori- 
cal experience of the emergence of European states, 
Tilly (1990) suggests that warlords can create a state. 
But he is quick to add that the European experience in 
state creation is not replicable in post-WWII develop- 
ing countries. In addition, there is no evidence in Tilly’s 
works to suggest that European warlords have ever cre- 
ated democratic states. Nevertheless, this emergence 
of democratic regimes from chaotic situations sounds 
like the confirmation of an old Kantian intuition. In 
his essay “Perpetual Peace,” published in 1795 (1995), 
Immanuel Kant contends that 


The problem of the formation of the (republican) state, 
hard as it may sound, is not insoluble even for a race of 
devils, granted that they have intelligence. 


He then Jays out the following theoretical puzzle. 


Given a multitude of rational beings who, in a body, require 
general laws for their own preservation, but each of whom, 
as individual, is secretly exempt himself from this restraint: 
how are we to order their affairs and how to establish for 
them a constitution such that, although their private dis- 
positions may be really antagonistic, they may yet so act as 
checks upon one another, that is, in their public relations 
the effect is the same as if they have no such evil sentiments. 
Such a problem must be capable of solution. (582) 


But Kant’s awareness of this matter has not led 
to a systematic explanation of how a republican 
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constitution can arise from an interaction between “ra- 
tional beings? and “evil sentiments.”? Exploring Kant’s 
remark in the light of post-civil war democratization 
strategies, I present a theory of democracy as an ar- 
bitration mechanism that stands as a solution to the 
Kantian paradox of a republican constitution framed 
by “a multitude of rational beings” without moral com- 
mitments to democratic values. It aims to offer a theory 
of post-civil war democratization. 


WHAT IS POST-CIVIL WAR 
DEMOCRATIZATION? 


Post-civil war democratization is different from the 
kind of political liberalization that followed the break- 
down of authoritarian governments in Europe and 
Latin America, and which has received much atten- 
tion in the literature on transitions to democracy. First, 
post-civil war democratization is primarily motivated 
by the need for political order. Citizens adhere to this 
form of democracy because it generates social order as 
a by-product'and offers protection against large-scale 
theft and illegal expropriation carried out by predatory 
warring factions. Citizens prefer this form of democ- 
racy because it allows them to change governments 
peacefully. In other words, in a post-civil war democ- 
racy, Popperian (1962) methods for transferring power 
without violence become preponderant. The need to 
control the ability of competing political elites to re- 
vert to political violence, at least temporarily, may 
dominate the need for popular representation or even 
public accourtability. Thus, a post-civil war democracy 
is Schumpeterian at best. It is essentially a tool for 
elite cooperation in the process of creating political 
order. 

To better understand transitions to democracy, it is 
essential to present parsimoniously the key differences 
between democratization following an authoritarian 
breakdown and democratization following civil wars. 
Polish and South African transitions, for example, were 
qualitatively different from those in Mozambique and 
El Salvador. Whereas in the former cases, citizens were 
fighting for a more open political process and basic po- 
litical rights, in the latter, citizens wanted to end a civil 
war and to create a sociopolitical order. The process 
of democratization following an authoritarian break- 
down typically involves an authoritarian government 
facing a more or less democratic opposition that is 
supported by a burgeoning civil society. Democracy 
comes into existence in such an environment when 
the balance of power within the government shifts in 
favor of the more moderate elements, who successfully 





3 Glaucon’s challenge to Socrates in the Republic (see 358B—360C) 
can be seen as a yariation of the Kantian paradox He wrote, “ But we 
could perceive that those who practice justice do so unwillingly and 
because they cannot do mjustice, 1f we could put a case in imagination; 
let us grant license to each, both just and unjust, to do whatever he 
wishes, and let us follow this up by seeing where his desire will lead 
each” (Plato 1999). Variations of Kantian paradox can be found in 
Hume's crrcumstances of justice and Machiavelli’s Prince. But m my 
view, among political theorists, Kant is the first to state quite clearly 
the strategic problem facing warlords mvolved in a civil war. 
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negotiate a political compromise with the democratic 
opposition. 

In contrast, the process of democratization in a war- 
torn society sometimes involves predatory warring fac- 
tions with no normative commitment to democracy, 
facing a very weak, collapsed civil society. Democracy 
in such a context has come about as the consequence of 
a peace agreement. Moreover, a key component of the 
democratization process in a war-torn country such as 
El Salvador has been the demilitarization of the war- 
ring factions and the reconstruction of the state. The 
electoral process was a tool for the creation of political 
order.* In sharp contrast with El Salvador, in South 
Africa and Poland the purpose of holding elections was 
to secure the representation of previously excluded po- 
litical or social groups. State-building efforts were not 
central to the transition process. 

The fact that civil wars in countries such as 
Mozambique and Nicaragua have been preceded ‘by 
authoritarian rule may explain why the transition litera- 
ture does not clearly distinguish between the authori- 
tarian breakdown and post-civil war democratization. 
In fact, the choice of democracy in these countries 
might have been partly rooted in the politics of exclu- 
sion that took place before the civil war. However, the 
war itself has such a profound effect on the government 
that post-civil war democracy is more an institutional 
response to civil war than to pre-civil war authorita- 
rian rule. In Mozambique and Nicaragua, the civil war 
almost annihilated the authoritarian political situation 
that led to war, whereas in postauthoritarian situatio 
many features of the previous regimes have pervaded. 

Not only are postauthoritarian democratization and 
post-civil war democratization empirically distinct po- 
litical categories, but their normative justifications ap- 
peal to very different traditions of democratic theory. 
Whereas political liberalization following authoritarian 
breakdown is typically in line with the liberal tradi- 
tions of Locke ([1689] 1967) and Hamilton, Madison, 
and Jay (1788), post-civil war democratization essen- 
tially appeals to the Hobbesian and Schumpeterian 
traditions, with an emphasis on order and conflict 
resolution.® The liberal element in post-civil war de- 
mocratization stems essentially from its promise of pro- 
tection of citizens’ rights to property. Thus, although 
a defense of the minimalist and Schumpeterian con- 
ception of democracy might be extremely relevant in 
a civil war-torn society, it might not be sufficient in a 
postauthoritarian society. In such a society, issues of 
accountability and representation, not sociopolitical or- 
der, are the top priorities of the democratic opposition 
and various groups in civil society. 


4 Catholic emancipation m the United Kingdom (1829) and the disso- 
lution of the Sonderbund ın Switzerland represent interesting cases of 
major political change or democratization in Europe resulting from 
a threat of civil war or actual civil war. 

5 See Diamond, Linz, and Lipset 1986 and O'Donnell and Schmitter 
1986. See also Manning (1997) and Montgomery (1995), who present 
excellent stucies of democratrzation in Mozambique and El Salvador, 
respectively. 

é See Dahl 1956 for a review of the Madisonian version of liberal 
democratic theory. 
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A THEORY OF POST-CIVIL WAR 
DEMOCRATIZATION 


Why would warring factions agree to democratize? 
Why would they defer to the citizenry the role of deci- 
ding which faction has control over the government? 
To address these questions, I describe the strategic situ- 
ation created by the civil war. I present the players and 
their strategies and predict the equilibrium outcome of 
their strategic interaction. An equilibrium in the game 
is defined as the strategy profile that is self-enforcing, 
in which no player can gain by choosing another 
strategy. 

The key players of the game are two warring factions 
and the citizenry.’ The two factions are involved in a 
violent conflict for political supremacy. To support the 
war effort and enjoy the economic benefits of political 
control, they also engage in extortion and expropriation 
of citizens’ wealth.® Thus, the warring factions might en- 
joy political supremacy for its own sake, but for the pur- 
poses of the present analysis, they want such supremacy 
because it provides economic benefits, particularly the 

_Tight to “expropriate” the fruits of the citizens’ labor 
or control the extraction of raw materials in the areas 
under their control.? 

While warring factions are motivated by power, citi- 
zens are motivated by security concerns (Kalyvas 2002). 
The citizenry is divided into three groups: those affili- 
ated with one of the two factions, such as political sup- 
porters or political activists, and the rest, who are not 
affiliated with either faction.’ The affiliated citizens are 
protected and enjoyed the full benefit of their invest- 
ment, while the nonaffiliated citizens are not protected 
and can be expropriated by warlords. 

The two warring factions have been involved in a 
costly and inconclusive conflict for control over the 
government. The conflict has reached a stalemate in 
which the probability of victory by either side has be- 
come very low, and a large portion of the citizenry has 
either fled the country or hidden their assets due to 
a wild expropriation in the hands of the warring fac- 
tions and their members." The stalemate has led the 


7 In some concrete civil war situations, the game might include more 
protagonists. A game’s having more than two factions will not signif- 
icantly affect the results. 
8 As shown by the recent civil war in Sierra Leone and the ongoing 
conflict m the Democratic Republic of Congo, precious raw mate- 
rials extraction may also support the war effort. Also note that m 
the model presented here, civil wars are not categorized as “justice- 
seeking” or “loot-seelang” (Collier 2000). It is assumed that rebels 
could have political motrves, but they use different forms of expropri- 
ation to finance the war. This assumption 1s consistent with evidence 
presented ar Kalyväs 2001. 

See Ross 2003 for case studies on the effect of resource wealth on 
civil wars. 
10 Affiliated citizens could mclude members of the Diaspora, who 
` tend to play a significant role m sustaining the civil war (P. Collier 
and Hoeffler 2000). For instance, in El Salvador, affiliated citizens 
are supporters of the government and the ARENA and the FMLN. 
Nonaffiiated citizens are ordinary citzens caught in the crossfire 
between the two warring factions. 
1 Doyle and Sambanis (2000) show that the probabilrty of resolution 
of the civil war and the success of peace-building operations is higher 
10 longer wars. The results are consistent with Fearon and Laitm 2002, 
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factions to consider the possibility of signing a peace 
agreement and creating a new government. A long- 
term peace agreement entails bargaining over future 
spoils of office, which is nearly impossible. So factions 
decide to call upon an independent third party to ar- 
bitrate the process of creating the new government. 
The third party or the arbitrator can be a single player, 
whom we will loosely refer to as Leviathan (e.g., a 
foreign power or military leader). It can also be a large 
set of individual players such as the citizenry of the 
country. 

There are three meaningful stages to the game. In 
the first stage, the two factions simultaneously choose 
whether to invite Leviathan, democratize, or maintain 
the status quo. If the two factions agree on their choice, 
that choice is implemented. Otherwise, the status quo 
is maintained. If they choose Leviathan, it picks one of 
the factions to run the government, and together they 
eliminate illegal expropriation and provide ordinary 
citizens with the security. If the factions were to choose 
democracy, they would then ban illegal expropriation 
and set up an electoral process. In that process, each fac- 
tion proposes a tax rate. Then the electorate observes 
the proposal and chooses its preferred candidate. The 
faction with the most votes wins and implements its tax 
policy.” Figure 1 is a sketch of the full game. 

The sole source of wealth in society is the citizens’ 
investment. Investment is assumed to be costly, but to 
generate private wealth for whoever invests, but it also 
has a positive effect on the wealth of the society as 
a whole (including the warring factions). The warring 
factions’ payoffs depend on the productive investment 
made by the residents of the country, an investment 
that generates a positive externality. This assumption 
is likely to be violated when the warring factions con- 
trol natural resources such as diamond (as in Sierra 
Leone or Angola), are involved in drug trafficking (asin 
Colombia), or are heavily sponsored by a foreign coun- 
try. In this case, the residents’ investment has very lim- 
ited effect on the warring factions’ political decisions.“ 
My goal is to show that a key difference between civil 
wars that do result in democracy (e.g., in Mozambique 
and El Salvador) and civil wars that do not (Sierra 
Leone, Angola) resides in the degree of economic de- 
pendence of the factions on citizens’ investment. The 
result would indicate that democratization is less likely 
when the factions depend heavily on foreign aid or na- 
tural resource wealth. 


Licklider 1993, Mason and Fett 1996, and Walter 1999. For the case 
of ethnic wars, see Kaufman 1996 and Rothschild 1997. 

12 As in standard game theoretical models, I am assuming that war- 
ring factions are intelligent in the sense that they know everything 
that we know about the (cvil war) game and that they can make any 
mference about the civil war sttuation that we can make (Myerson 
1991) Such an assumption may be unrealistic However, it would 
be reasonable to think that warring factions would learn from past 
experience and from interacting with the UN. Asa result, they would 
eventually be as knowledgeable of the civil war game as we are. 

13 Tn fact, the continuation of war might in fact be more profitable 
for the factions than its resolution. Obviously, everything else being 
equal, a faction that has to give up a lucrative drug business during the 
transition to democracy will be leas willing to embrace democracy. 
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FIGURE 1.' The Decision Tree 








The Status Quo 


The status quo situation is a “state of belligerence” that 
is characterized by a great deal of lawlessness. The state 
of belligerence is distinguishable from Hobbes’s under- 
standing of the state of nature. It is not a fictitious ac- 
count of a prepolitical condition of humanity. Instead, 
it is an illustration of concrete civil wars where factions 
are involved in costly battles. Members of both warring 
factions illegally expropriate part of the fruit of the 
citizens’ labor. Taking the factions’ expropriation levels 
as given, unaffiliated individuals choose how much to 
invest to maximize their retained output minus cost.!4 





14 Thus, in the model, players are assumed to be rational in the sense 
that they make “decisions conmstently in pursue of therr objectives, 
that us, to maxmmre the expected value of therr payoffs” (Myerson 
1991, 2). However, the state of belligerence (my version of the 
Hobbesian state ‘of nature) is partly caused by passion for power or 
polrtcal , which may cloud reason. The question 1s, How 
would the civil war game hold up if one introduced emotions or 
passions? In light of recent literature in economics, it appears that 
expressions of emotions are compatible with rationality (see Brams 
1997 for a survey) Frank (1988) writes, “Passions often serve our 
mterests very well indeed. The apparent contradiction arises because 
we face important problems that simply cannot be resolved by ratio- 
nal action. The common feature of these problems is that to solve 
them, we must commit ourselves to behave in ways that may later 
prove contrary to our interest” (4). As I show below, passion for 
power and the economic benefits associated with it enable warrmg 
factions to commnt to democracy. 














The following result is the first and (quite intuitive) 
equilibrium prediction of the game under the status 
quo. 


Result 1: Citizens’ investment decreases when the 
anticipated level of expropriation increases. 


That is, citizens will tend to work less when they an- 
ticipate more expropriation from warlords, As a result, 
wealth-maximizing warlords will want to curtail their 
greed in the face of the grim reality of a shrinking pie. 
This very simple result is quite important because it 
makes the cost of the conflict endogenous. 


Leviathan 


The second alternative that may be available to the 
warring factions is to invite an external enforcer or 
Leviathan that, for a “price,” will create a new govern- 
ment, eliminate illegal expropriation, and uphold an 
authoritarian rule of law. The external enforcer does 
two things: (1) tax citizens and (2) decide which fac- 
tions will constitute the government. The external en- 
forcer wants to leave each faction guessing about its real 





15 Formal proofs of the results (1 to 4) are provided in Wantchekon 
and Neeman 2002. 
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preference and may favor either faction. The equilib- 
rium prediction under Leviathan is given by: 


Result 2: There is more expropriation under 
Leviathan than under democracy. However, cit- 
izens are better off under Leviathan than un- 
der the status quo. In addition, both factions will 


never agree to invite Leviathan. 


The result is driven by the fact that the situation be- 
tween Leviathan and its favored faction is quite similar 
to the situation between the two warring factions un- 
der the status quo. In any case, the outcomes, as far 
as unaffiliated individuals are concerned, are almost 
identical. Under Leviathan, the citizens do not control 
who will be in charge of the government and what the 
government does. The tension between the desire to 
expropriate more (because it increases the direct pay- 
ment to the government) and the desire to expropriate 
less (because of its effects on investment) is resolved 
in favor of more expropriation since Leviathan and its 
agent “compete” over expropriation as under the status 
quo. 

The faction that would not be favored by Leviathan 
would clearly be worse off than under the status quo 
and democracy. Therefore it can never be the case that 
both factions would choose to invite Leviathan. Even 
if each faction receives a positive signal to indicate that 
Leviathan favors it, it will never be the case that, in equi- 
librium, both factions agree to invite the Leviathan. The 
reason is simple. Even if one faction obtains a favorable 
signal, it realizes that the other faction will agree to 
invite Leviathan only if it observes a favorable signal 
too. Since this is impossible, it can never be the case 
that the two factions will coordinate and agree to invite 
the Leviathan. 

As a final note, suppose that instead of creating the 
new government with only one faction, Leviathan es- 
tablishes a power-sharing government that would draw 
resources from both factions. Under this scenario, cit- 
izens would still invest less than they would under the 
status quo. This is because competition over expropri- 
ation would then involve three players (Leviathan and 
the two factions) instead of two players (Leviathan and 
one faction). As a result, both factions and the citizens 
would be worse off than under the status quo. There- 
fore, they will not agree to invite Leviathan. 


Democracy 


Democracy is modeled through the agreement of both 
factions to abide by the results of democratic elections. 
Such elections would allow one of the factions to form 
a government and eliminate illegal expropriation. The 
probability that Faction 1 assigns to winning in demo- 
cratic elections is denoted g, and the probability that 
Faction 2 assigns to winning, 1 — q. These probabilities 
measure the level of support that they enjoy or that 
they expect to enjoy among unaffiliated individuals, as 
well as on the number of their affiliated supporters. 
The probabilities summarize the level of optimism of 
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the warring factions in the future of democracy in the 
country.!6 

Democracy payoffs may be not enforced, unless 
there is an effective and impartial police force and other 
state institutions. But “Who will control the coercive 
powers of the state once the central authority has been 
reconstructed?” (Hartzell 1999, 7). I assume that the 
process of creation of the state involves transfers of mil- 
itary and human resources from each faction. The pro- 
cess will be mediated and supervised by neutral exter- 
nal agencies such as the United Nations (UN) through 
the UN peacekeeping operations.’” Thus, internal secu- 
rity measures or state-building processes take the form 
of an integration of the antagonists’ armed forces or 
a decentralized and federated command structure for 
the armed forces (Walter 1999).18 

I assume that when the state institutions are properly 
created, democracy (if chosen) will be enforced and 
the electoral process will be free of violence.” In other 
words, when citizens are concerned about postelection 
violence and the warring factions contribute to the state 
building effort, democracy payoffs will be realized. The 
factions will choose democracy if the payoffs associated 
with it are at least as high as the status quo payoffs and 
the payoffs under Leviathan. Since the payoffs under 
each scenario depend on the citizens’ investment, which 
in turn depends on the anticipated levels of expropri- 
ation, we need to compare the level of expropriation 
under democracy to its level under Leviathan and the 
status quo. The following equilibrium outcome com- 
pares the degree of expropriation under the various 
scenarios: 


Result 3: There is less expropriation un- 
der democracy than under the status quo or 
Leviathan. 


The intuition behind this result is that individuals 
who know that they will be expropriated will work less, 


16 In the democratic subgame, the warring factions seek to maximize 
therr respective payoffs under the constraint that these payoffs are 
higher than those under the status quo. We assume that players can- 
not commit themselves to future actions but are all rational agents. 
This mplies that the two factions and ordinary or unaffiliated citizens 
behave in a sequentially rational manner The equilibrium outcome i8 
solved by backward induction. That is, the factions choose democracy 
looking ahead to the election outcomes and their enforcement and 
assessing the credibility of the negotiated settlements. 

17 This was the case in all successful cases of post-civil war state build- 
ing such as Mozambique, Nicaragua, and El Salvador. For a large-N 
study of the likelihood of success of peace these operations depend 
on a variety of factors analyzed by Doyle and Sambanıs (2000). 

18 Also see Stedman (2001) for further analysis of conditions of suc- 
cessful implementation of post-civil war peace agreements. Yashar 
(1999) discusses the importance of joint control of the armed forces 
1n the process of democratization. 

19 A major obstacle to enforcement of the democratic payoffs resides 
in the “spoiler problem.” Spoilers are factions that believe that “the 
emerging peace threatens their power, world view, and interests, and 
who use violence to achieve it” (Stedman 2000, 178). This was the 
case in Angola when Jonas Savimbi refused to abide by the outcome 
of the 1994 UN-monitored post-civil war elections. The country was 
then plunged mto a new civil war m which more than 300,000 people 
died. In the presence of a spoiler problem, peacekeeping operations 
must be robust. 
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and a democratic government deals with this reduc- 
tion in productivity better than warring factions would 
under the status quo. Under the status quo, there is a 
“race to bottom” in the expropriation game between 
the warring factions since a dollar that is not stolen by 
one faction could be stolen by the other. The competi- 
tion leads to an overexpropriation. 

Now consider the situation under democracy. First, 
the factions have disarmed and deprived themselves 
of their power to illegally expropriate. Second, citizens 
prefer lower taxes and would work harder if the factions 
could commit to such a strategy. The factions would not 
mind choosing such a strategy ifit did not lead to a lower 
revenue than the status quo. But, in fact, parties are 
to some extent forced to lower taxes to improve their 
chances of winning the elections. Indeed, to get more 
votes, the parties will be willing to lower taxes to the 
point where their expected revenues under democracy 
are equal to their expected revenues under the status 
quo. Thus, bécause there is a lower rate of expropri- 
ation, citizens work harder. Both the winning faction 
and the unaffiliated individuals are better off compared 
with the situation under the status quo. The faction that 
loses the election may be worse off because it loses its 
ability to expropriate, but its loss is moderated by the 
fact that citizens work harder and therefore generate a 
higher level of well-being for the whole society. 

The bottom line is, If parties can commit to lowering 
the level of expropriation (which they do at the state- 
building stage of the game by disarming themselves), 
then citizens will work harder and invest more. As re- 
sult, a level of expropriation that is lower than that 
under the status quo can be an equilibrium outcome. 
This leads to the following: 


Result 4: There is a threshold probability q* that 
is less than one-half, such that, whenever the fac- 
tions think that they have at least g* chance of 
winning the election, they will choose to democ- 
ratize. ` 


Because the rules of the game are such that if one 
faction insists on maintaining the status quo, there is 
nothing that the other faction can do about it, there 
is always a chance that the status quo is maintained. 
However, if both factions assess a high enough chance 
of winning in future democratic elections, there also 
exists a more interesting scenario where both parties 
choose democracy. Democratization occurs when the 
distribution of political preferences of the citizens is 
such that factions assess a reasonable chance of winning 
the first and subsequent democratic elections. In other 
words, q* reflects the diversity of political interests in 
the current and future electorate. 

The economic dependence of the warring factions 
on citizens’ productive investments and that they are 
divided explain in large part why they choose to insti- 
tutionalize the power of the citizens by democratizing. 
In countries such as Angola and Sierra Leone, where 
there are lootable resources (diamonds), democratiza- 
tion is unlikely since the warring factions do not depend 
on citizens’ investments to maximize their revenues, 


22 


and anarchy is highly profitable in the short run. In 
a country such as El Salvador, not only do the war- 
ring factions depend on domestic investment during the 
civil war, but also their own economic interests become 
less conflictive. As Wood (2000) shows, the democratic 
pact between the Farabundo Marti National Liberation 
Front (FMLN) and the National Republican Alliance 
(ARENA) in EI Salvador was facilitated by the fact 
that ARENA became less reliant on agriculture. 

The great challenge of a post-civil war democracy 
is that its institutions play both the (Hobbesian) role 
of securing the protection of ordinary citizens against 
illegal expropriation and the (Popperian) role of se- 
curing a peaceful power sharing between the factions. 
This is because warring factions need to convince each 
other and the citizens that they are ready to take turns 
in ruling the country. They also need to convince cit- 
izens that they are serious about eliminating political 
banditry. They need to simultaneously develop law en- 
forcement institutions, such as an effective police force 
and a criminal justice system, as well as political insti- 
tutions, such as a depoliticized judiciary and electoral 
commission. Without those institutions, the promise of 
security of property for citizens or political rights for 
the warring factions will not be realized, and the choice 
of democracy will not be validated. 


Illustrative Example: El Salvador 


The civil war in El Salvador during the 1980s was 
largely the result of gross inequality and repression by 
the military government. The inequality dates back to 
the mid-1800s, when government policies concentrated 
land into the hands of the “14 families” to produce 
coffee “efficiently.” This high concentration of wealth 
and power led to a series of peasant and worker up- 
risings, culminating in the movement led by Augustin 
Farubundo Marti in the 1920s (Paige 1997; Torres-Rivas 
2001). In 1932, the insurrections were stamped out by 
a number of military officers, who then seized power 
through a coup d’etat. The next 50 years were char- 
acterized by cycles of reform and repression, but in- 
equality and military domination remained constant 
(Montgomery 1995; Paige 1997). In 1980, a coalition 
of 18 leftist and far-leftist groups formed the Revolu- 
tionary Democratic Front (FDR) and later the FMLN, 
the military affiliate of the FDR. A civil war broke out 
between government forces and the FMLN. 

Both factions were predatory. For instance, the cot- 
ton industry was an easy target for FMLN fighters be- 
cause of its susceptibility to fire and bug infestations. 
Fighting also led to the abandonment of farms, which 
were subsequently appropriated by the peasants affili- 
ated with the guerillas. As predicted by Result 1, com- 
petitive predatory behavior led to underinvestment, 
especially in the agricultural sector. Coffee production 
decreased by nearly 50%, the GDP per capita fell by 
28% between 1978 and 1982, and there were a mas- 
sive emigration of the economic elite in the 1980s. The 


2 See Renner 2002 for an extensive empiical study of this question. 
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contribution of agricultural export to domestic produc- 
tion fell from 25% to well below 5%, a very sharp 
decline that Wood (2000) attributes to increased inse- 
curity over land property and to a shift in the structure 
of economic production away from export agriculture 
57). 

l The war was thus extremely destructive and in- 
conclusive. By 1984, both sides realized that an out- 
right military victory was extremely unlikely; however, 
neither side trusted the other enough to begin ne- 
gotiations. The government and military feared that 
the FMLN would use the talks to build up military 
strength while pushing for reductions in the size of the 
government army. They feared that the rebels would 
leave the negotiations after obtaining a decisive mil- 
itary edge. The rebels feared that negotiations would 
expose their clandestine organization and enable the 
military to eliminate leaders through a massive “dirty 
war.” In 1990, both warring factions finally agreed to 
invite the UN Secretary General to mediate further 
negotiations.” To be sure, the assassination by the mil- 
itary of the six Jesuits during the 1989 offensive had 
led to a sharp decrease in U.S. support for the military 
under the status quo. Without strong U.S. support of 
their war effort, the military had no choice but to ne- 
gotiate. The final settlement, reached in 1992, included 
the following terms: the disbanding of rebel forces in- 
crementally over a nine-month period; the government 
purchase of land for redistribution in rural areas, with 
the beneficiaries repaying the government; the purging 
of the government officers’ corps; the absorption into 
the regular army of the national guards and the treasury 
police; the dissolution of the military intelligence and 
civil defense units; and the creation of new police forces 
(Montgomery 1995). 

More importantly, the two warring factions agreed 
to hold elections. The agreement was possible because 
(1) each faction assessed a high enough chance of 
winning either the first or subsequent post-civil war 
elections, and (2) each faction thought that its eco- 
nomic interests would be protected under democracy. 
In particular, the FMLN was looking well beyond 
the first election and thought that democracy would 
protect their political rights as well as the economic 
interests of their constituents.” At the same time, 





21 Wood (2000) also attributes the decline of the contnbution of 
export agriculture to the GDP to the nationalization of financial 
and export-marketing sectors and the government economic policies 
from 1980 to 1989 (57-58). 

2 The UN's role is best understood as that of a (third-party) “arbitra- 
tor” rather than an “enforcer” or Leviathan. Although the UN made 
several “binding” recommendations and supervised the carrots and 
sticks that kept the implementation of the peace agreement going, 
unlike the “enforcer,” ıt was capable of establishing a reputation to 
act impartially 

D Joaquim Villalobos, one the leaders of the FMLN, said, “Our polit- 
ical forces will be participating with the aim of preventing the talang 
of land from the peasants, the reversal of judicial reform and the 
politicrzation of the training of the new police force. ..The ques- 
tion of majority or mmority electoral support does not matter. In 
El Satvador, it is important that we continue to reach an agreement 
whether we are in the majority or ın the minority The confrontation 
ended only months ago. Perhaps, once it is further behind us we can 


according to Paige (1997), the ARENA and its con- 
stituents (the coffee plantation owners) believed that 
there was a high enough chance that they would win the 
first post-civil war election and control the government 
(215). 

However, the stated goal of the parties was the cre- 
ation of a political system in which real power is al- 
located by elections. The winners must be guaranteed 
that they will obtain genuine control over government 
decisions. The losers must be guaranteed that oppor- 
tunities for their political participation will continue 
and will not subsequently be repressed and eliminated 
(Gibb and Smyth 1999, 1). 

In summary, the outcome of the civil war in El 
Salvador illustrates the main point of the theory out- 
lined above: Democratization can be forced by the eco- 
nomic cost associated with a breakdown of civil order. 
In other words, the rationale for democratization stems 
in part from its ability to generate the rule of law asa by- 
product. The El Salvadorian case also sheds light on the 
difference between inviting an external enforcer (such 
as Syria in Lebanon) and inviting an arbitrator (such as 
the UN in El Salvador). Once the factions have decided 
to settle on democracy, an external arbitrator can fa- 
cilitate its implementation by providing the resources 
for successful negotiations and transparent transition 
to democracy. 


POST-CIVIL WAR DEMOCRATIZATION AND 
CLASSICAL POLITICAL THEORY 


Are there aspects of classical political theory that could 
help explain or predict the emergence of democracy 
from civil wars? What are the connections (if any) be- 
tween the theory developed here and classical political 
theory? 

I claim that, despite some possible similarities, clas- 
sical political theory does not fully explain or predict 
that political order, and democracy could simultane- 
ously arise from civil war. For instance, the Hobbesian 
and Lockean political theories unequivocally preclude 
warlord democratization, and Machiavelli considers 
warlords not as creators of Italian republics but, in- 
stead, as their worst enemies. In addition, Machiavelli 
quite explicitly favored the authoritarian solution dur- 
ing the transition period between anarchy and popular 
rule.> However, my theory of post-civil war democ- 
racy echoes crucial aspects of Machiavellian republi- 
canism inspired by popular rule in Rome (McCormick 
2001). Machiavelli presents the people or the plebes in 
the Roman republic as the ultimate arbitrator of the 
competition for political power among the nobles. Fi- 
nally, I find that even though standard theories of state 
formation (Huntington 1968; Tilly 1990) preclude the 
creation of democratic states from a situation of civil 





embark upon a path of more democratic norms” (Gibb and Smyth 
1990, 1) 

M Thus is not surprising. Machiavelli ([1531] 1997) always considered 
warlords to be the worst enemies of Italian city republics. 

3 He wrote, “In order to establish the constitution of a republic one 
needs to have the sole power” ([1531] 1997, 110) 
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war, they provide useful building blocks to develop a 
theory of post civil war democracy. 

Below, I examine some aspects of Hobbesian and 
Lockean political theories to illuminate the differences 
between their understanding of political order and the 
theory of warlord democracy. In order to emphasize 
my demarcation, I explore one of the major concepts 
of modern political thought, the “archaic notion” of the 
state of nature as Nozick (1974) calls it, which is used 
by Hobbes and Locke as well. 


The Hobbesian Theory of Polltical Order 


The theory ‘presented here is partly grounded in a 
Hobbesian theory of social order. The theory’s con- 
text is the British Islands plagued by a decade-long 
civil war, which generated lawlessness. Hobbes ([1968] 
1985) models the civil war environment as one in which 
people are assumed to be in relentless pursuit of power. 
Hobbes explains that the equality of ability in people to 
accumulate power leads to an equal hope among them 
for achieving all their desired ends. So if two people 
want the same thing, which they cannot both have, they 
will probably end up destroying each other (87). People 
feel secure only if they see that no other power is great 
enough to endanger them (88). This produces a cyclical 
effect, so that as some try to accumulate power, others 
are forced to do the same. 

If people constantly try to dominate others, it creates 
a zero-sum game where acquiring power means mak- 
ing others powerless. The game also generates chaos 
in society and the fear of violent death in each indi- 
vidual. However, because people can exercise reason, 
they can agree on the principle of self-preservation. 
Each individual would give up rights to all things as 
long as everyone else does the same. The making of 
such a contract poses the problem of mistrust. Hobbes 
([1968] 1985) notes that “he who performeth first, has 
no assurance that the other will perform later” (96). 
He cites people’s ambition, avarice, and anger as po- 
tential causes for the breakdown of the contractual ar- 
rangement. Thus, anarchy and chaos threaten people 
at all times. To prevent chaos, Hobbes suggests that 
people assign the sovereign of the authority to balance 
their desire to accumulate power against their need for 
peace. The sovereign must have coercive power to com- 
pel persons equally to perform their covenants, using 
the terror of some punishment, which must be greater 
than the benefit they expect in breaching the contract. 
In other words, all rights are abdicated to the sovereign, 
and by implication, all resources are put at its disposal 
to enforce the law. 

Hobbes ([1968] 1985) also stressed the fact that al- 
though the authority of the sovereign must be self- 
perpetuating, undivided, and ultimately absolute, it is 





2 In Anarchy, State, and Utopia, Nozick (1974) resuscitates the idea 
of a state of nature ın an attempt to legitimate the minimal state as 
“the only morally tolerable one” (333) In contrast, I do not mtend 
to offer another theory of state. My interest ın the state of nature is 
based solely on its fruitfulness as a demarcation point to highlight 
the differences between my approach to warlord democracy and the 
classical conceptions of political order. 
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an authority conferred by the people (227-28). Thus, 
the sovereign’s right to command and the subjects’ 
obligation for obedience are the result of consent. In 
other words, the people rule through the sovereign 
(Held 1996, 77). In a more forceful endorsement of 
“Hobbesian liberalism,” Hampton (1994) argues that, 
“implicit in Hobbes’s own contract is the idea not only 
that law can be the ultimate governor in a political so- 
ciety, but also that the ruling of law can and should 
have ... a certain ‘democratic’ content” (14). 

In contrast with Hampton’s interpretation, my view 
is that Hobbesian liberalism is a political fiction. Cit- 
izens are heterogeneous and are likely to have con- 
flicting interests. As a result, they cannot unanimously 
agree to create an undivided and absolute authority. In 
addition, even if the Hobbesian sovereign were created 
and could commit not to abuse his or her authority, 
unless such authority is divided and reflects the diver- 
sity of citizens’ political or economic interests, he or 
she cannot commit to being impartial. As the model 
above suggests, some individuals could even have a 
lower payoff under the sovereign than under anarchy.’ 
Therefore, the consent of all citizens to surrender their 
rights to an all-powerful sovereign is simply impossible. 
The only way the Leviathan can come to life is by being 
im 


Thus, the most natural interpretation of Hobbesian 
political theory is that it justifies authoritarian rule as 
a mean to create social order. As Skinner (1972) notes, 
Hobbes himself presented Leviathan as a justification 
for Cromwell, who was an active player in the En- 
glish civil war, not a solution to it. More recently, Leo 
Strauss (1965) turned to Hobbes to find a stable so- 
lution to the chaos and disorder in Weimar Germany. 
But as McCormick (1997) argues, this attempt led to 
the strengthening of the military, the repression of the 
Left, and, ultimately, the collapse of the republic and 
the emergence of authoritarianism. 


Polltical Order In a Post-Civil War Soclety 


The state of belligerence in war-torn countries is char- 
acterized by lawlessness. Political order in such an en- 
vironment requires a contract of governance between 
political actors and citizens. Following the Hobbesian 
logic, such a contract of governance requires an en- 
forcer or a sovereign. The role of the sovereign is to 
create and maintain political order and to protect the 
rights that political order makes possible and, most im- 
portantly, the right to hold political office or the rights 
to secure private property. However, in contrast to the 
Hobbesian logic, the role of the sovereign is also to 
choose a government and, therefore, allocate political 
offices. In addition, instead of reducing the alternatives 
available to the political actors to only Leviathan, i.e., a 
single powerful player such as a military leader or a 
foreign government, I assume that the enforcer can 
also be a small group of players such as the clergy, the 


77 For instance, Hobbes was concerned about the potentially subver- 
sive nature of religion. He stressed the need to monitor public opinion 
and to subject religious pastors to the sovereign (Ngomo 2003). 
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armed forces, or a very large group of players, such as 
the citizenry. As does Hobbes, I also assume that the 
sovereign’s rule depends on the consent of all (if not 
most) political actors. However, I assume as in Locke 
([1689] 1967) that the actors’ consent is conditional and 
depends on whether they perceive the sovereign to be 
as effective as Leviathan but also fair and neutral. For 
example, if a political actor believes that the enforcer 
is likely to be biased against him or her, this actor has 
little incentive to participate to the process of creating 
the new government. 

Based on the above premises, I argue that if the 
sovereign were Leviathan, i.e., a single powerful ac- 
tor, at least one political actor will choose either not 
to participate in the process of the creation of the new 
government or to walk away from the political contract 
in the future. Leviathan could be effective in creating 
and enforcing political order, but it cannot commit to 
allocate political power fairly and neutrally between 
groups. It has incentives to collude with one of the 
warring factions for short-term gains.” However, if the 
warring factions choose to join their forces for the cre- 
ation of the new institutions of the state (army, police, 
and court) and then grant the citizenry (a very large 
set of small-scale “sovereigns”) the authority to fill the 
offices of the new government, the warring factions 
will abide by the contract, and political order will be 
created. This is because a large set of small sovereigns 
is less likely to coordinate and collude with one group 
at the expense of the other. They are more likely to 
commit to being neutral. One particular citizen might 
have clear political preferences. But the preferences of 
the citizenry as a whole will tend to be fuzzier and more 
unpredictable. 

Furthermore, governments under the sovereignty of 
citizens (i.e., under democracy), generate a lower level 
of expropriation and higher investment than govern- 
ment under Leviathan. Since by assumption invest- 
ments generate wealth for investors but also have a 
positive effect on the well-being of the society as a 
whole, all factions are better off under democracy than 
Leviathan. To summarize, political actors might settle 
on democracy because (1) the process of power allo- 
cation under democracy is less likely to be biased and 
(2) economic gains under democracy are higher than 
under Leviathan. 

Post-civil war democratization partly originates from 
the state of belligerence, a form of the Hobbesian state 
of nature, but how does it connect with more liberal 
traditions, in particular, with Lockean liberalism? 


The Lesson of Lockean Liberallsm 


Locke derives his conception of political order from 
an analysis of the natural condition of humankind. In 
contrast to Hobbes, Locke asserts that “one cannot as- 
sume that a priori that men are in state of war with each 
other .... Rather, an incident of war is a declaration of 





28 As mentioned earlier, this was the case with Cromwell in the 
English civil war (Skinner 1972). 


design to commit injury, because men are presumed 
not to harbor attitudes of ‘enmity and destruction’ 
towards each other” (Ashcraft, 1968, 901).% Locke’s 
([1689] 1967) state of nature is not the state of anar- 
chy characterized by famine and war. Instead, it is a 
state in which individuals endowed with reason believe 
in the law of nature, which “teaches all mankind... 
that being all equal and independent, no one ought to 
harm another in his life, health, liberty and possession” 
(12). The central problem with the law of nature is that 
individuals must enforce it. Individuals are partial to 
or biased toward themselves. As a result, justice ren- 
dered by an individual will tend to be disproportionate. 
The aggravated party might then retaliate, and society 
could quickly sink into the Hobbesian state of nature. 
However, continuous interactions between individuals 
in the process of decentralized law enforcement could 
lead an equilibrium in which markets, money, and even 
a civil society emerge, things that Hobbes explicitly 
ruled out.” 

In summary, Locke has less pessimistic assumptions 
about human nature and develops a notion of nat- 
ural rights that sets moral limits on government. In 
addition, Lockean government presupposes the exis- 
tence of social order and a civil society. As opposed to 
the Hobbesian government, which creates the society, 
Lockean government is created by the society. Its role 
is to protect individuals’ natural or God-given rights. 

The historical context of Lockean political philos- 
ophy (Locke’s association with radical opponents to 
the policies of Charles I, the Glorious Revolution, 
but also the English Civil War) indicates that he may 
well be proposing a solution to both government un- 
certainty and civil war. In other words, Locke’s po- 
litical theory ([1689] 1967) could well be a response 
to Filmer’s (1651) ideological defense of royalist ab- 
solutism, but also to Hobbes’s (1951) solution to the 
English Civil War. However, he explicitly conceded 
that original commonwealths or “young societies” have 
generally been “under the government and the admini- 
stration of one man,” “nursing fathers” who led them 
all the way through “the weaknesses and infirmities of 
their infancy” (101, 110, 162).3! Thus, the immediate 
conclusion that one could draw from Lockean political 
theory is that democracy is not possible in the absence 
of natural rights or in a state of anarchy. 





D Locke ([1689] 1967) makes a clear difference between the state 
of nature and the state of war, in contrast to Hobbes, who equates 
it to that of war Locke mentions the “plain difference between the 
state of nature and the state of war some men have confounded” 
(30). According to him, they “are as far distant as the state of peace, 
good-will, mutual assistance, and preservation, and a state of enmity, 
malice, violence, and mutual destruchon are from another” (32). 
There is no doubt that this passage pinpoints Hobbes’s approach 
to the state of nature. 

30 Ashcraft (1968) wntes, “In a state of nature, it is possible for men 
to behave both well and badly. As a state extended over a period of 
tme, a state of nature will be both tranquil and violent, and always 
because of specific social circumstances, it will be esther one or the 
other” (903). 

31 Locke obviously did not conmder England as an origimal common- 
wealth or a young society. 


The Paradox of Warlord Democracy 


Post-civil war democracy is to some extent Lockean 
because it allows for popular control of government 
officials. Indeed, because the citizens have been granted 
not only the rights to private property but also the rights 
to allocate political offices between the factions, the 
citizens have the power to alter the government if they 
are wronged. The government is set up to protect the 
political rights of all factions as well as the private prop- 
erty of the citizens and will be illegitimate and voted out 
of office if it fails in this regard. 

A threat of a revolution might not be credible 
enough to deter serious infringement of political rights, 
given the fact that civil society is not usually strong 
in post-civil war countries. However, such a risk will 
be minimized when state institutions are properly de- 
signed. The design of the institutions takes into account 
Hamilton, Madison, and Jay’s (1788) concerns about a 
tyranny of the majority, i.e., the winning faction. 


Post-Civil War State Bullding 
Machiavelli [1531] 1997) contended that the power 


of all governments (whether they are new, long es- 
tablished, or between the two) is based on good laws 
and good armies. He stressed that there could not be 
good laws where there were not good armies, and where 
there were good armies there must be good laws (39). 
Machiavelli’s remarks are particularly valid today in 
the context of post-civil war democracies. Commitment 
to democratization is achieved through demilitariza- 
tion, the creation of an effective police force as well as 
an independent judiciary under the supervision and the 
technical assistance of the UN. 

The first crucial step in post-civil war state building is 
the formation of the new armed forces. These include 
their size, their composition, and locations for train- 
ing and command structure. The new army should be 
strong enough to wield enough enforcement power but 
not so strong that it could remain under civilian control. 
Furthermore, the army’s composition and command 
structure should reflect the political and perhaps eth- 
nic makeup of the country. Finally, in order to secure 
the abidance of the democratic principles by the new 
army, it has to be purged of the most notorious human 
rights violators and the rank-and-file elements have to 
be properly trained. 

In Mozambique, the Rome Accords, signed on 
October 4, 1992, set the size of the new joint armed 
forces at 24,000, with each side contributing 12,000 per- 
sonnel, consisting of the air forces at 4,000, with 2,000 
from each side, and the navy at 2,000, with 1,000 from 
each side, The parties also agreed to the formation of 
the Joint Corhmission for the Formation of the Mozam- 
bican Defense Force (CCFADM, CSC). The UN’s 
involvement served as a guarantee to either side that 
the other side would not renege on its part of the 
agreement.“ 

Judicial reforms represent the other crucial step in 
the process of state building in post-civil war democra- 





32 See Wantchekon and Jensen 2002. 


26 


February 2004 


cies. Since Montesquieu’s ([1748] 1952) groundbreak- 
ing work on the separation of powers, the judiciary has 
been perceived as the main cornerstone of democracy. 
Montesquieu’s chief concern was the development of 
a positive legal system to regulate public and private 
life and to prevent abuse of power. The reason was 
that “every man invested with power will attempt to 
abuse it, and to carry its authority as far it will go” 
(69). Thus, one should check power with power. One 
should design governments in such a way that “no man 
shall be compelled to do things to which the law does 
not oblige him or not force him to abstain from things 
which the law permits” (69). In addition, political power 
should be distributed and organized so that those who 
attempt to abuse it finds legal restraints in their way 
(Plamenatz 1963, 292-93). The evidence suggests that 
the three main aspects to the design of a newjudiciary in 
post-civil war democracies are human rights provisions, 
Supreme Court appointments, and changes to the penal 
code (Wantchekon and Jensen 2002). 

For instance, a national council on the judiciary was 
to be set up in the aftermath of the El Salvadorian Peace 
Accords to serve independently as the Supreme Court. 
The council would take over the appointment of the 
lower court judges from the Supreme Court, as well 
as nominate half the Supreme Court candidates, with 
the other half to be nominated by Salvadoran bar as- 
sociations. The Assembly would elect Supreme Court’ 
justices by two-thirds majority for “staggered terms of ` 
nine years, instead of by a simple majority to serve only 
a single five-year term.” 


Machlavelllan Republicanism 


Democracy as an exit option from civil war is essentially 
a modern phenomenon. In medieval Europe, warlords 
or warring factions were never associated with the cre- 
ation of democracies or republics. They were instead 
associated with their decline and destruction. When 
invited as mercenaries by city-republics across Italy 
to provide order and security, they turned themselves 
against democracy and republicanism. Machiavelli 
({1531] 1997) described warlords as one of the chief rea- 
sons behind Italy’s ruin (38). He presented them as am- 
bitious, ill disciplined, and treacherous. He contended 
that when a republican government was lucky enough 
to hire competent mercenary commanders (warlords), 
they would increase their power at the government’s 
expense, They would simply ruin the government if it 
were incompetent (39). 

However, when defined as an arbitration mecha- 
nism, post-civil war democracy connects quite well with 
Machiavellian republicanism.” As in the current analy- 
sis, the historical background of Machiavellian political 
theory was intense competition and even war within 
and between Italian city-states. Machiavelli endorsed 
a pluralistic approach to politics in which competing 
social groups could promote their own interests, with 





B See McCormick 2001 for a thorough analyms of “Machiavelhan 
democracy.” 
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the outcome being decided according to the balance 
of power among the different branches of government. 
Using Rome as an example, Machiavelli argued that 
only popular participation in political life and mixed 
government comprised of an executive, the aristocracy, 
and the “people” could help overcome factional strife 
and generate individual freedom as well as peace and 
Tity. 

In his analysis of Roman republic, Machiavelli 
claimed that the Roman people (the plebes) were 
guardians of liberty and were more trustworthy and 
less power-hungry than the nobles and the great 
(McCormick 2001). As arbitrators of the competition 
for power between the nobles, the people were more 
likely to be fair and exhibit good judgment. They were 
more able to select the best policy proposals put forth 
by the elites and were better than elites at distributing 
political offices. Machiavelli ([1531] 1997) claimed that 


a prudent man ought to never depart from the popular 
judgments especially concerning distribution of ranks and 
dignities, for ın this only does the people does not deceive 
itself. If it does deceive itself at some time, it so rare that the 
few [i.e., the nobles] who have to make such distribution 
will deceive themselves more often. (1.47) 


The logic of Machiavelli’s thesis is in many ways sim- 
ilar to that of the one developed in this article. As in my 
setup, the alternative to using the people as arbitrators 
is to use a unitary actor, a prince, or a foreign power. 
Machiavelli argued that whereas the people base their 
decisions on the candidates’ good reputations, the uni- 
tary actor is more likely to fear those candidates with 
good reputations. In other words, the unitary arbitra- 
tor tends to view the good candidate as a threat to his 
or her power and tries to eliminate him or her. For- 
eign arbitrators also tend to exacerbate domestic con- 
flicts to take advantage of the chaos that would result 
from it. 

Like post-ctvil war democracies, Roman republican- 
ism was generated by indirect domestic pressure for 
political change. Whereas the former was created by 
the warring factions in response to a drastic decrease 
in citizens’ productive investment, the latter was cre- 
ated following a massive emigration of the plebes in 
the aftermath of the power struggle among the king, 
the nobles, and themselves (the plebes). As Machiavelli 
contends, Rome developed into a republic when the 
plebes and the Senate joined forces to expel the kings. 
However, after the expulsion of the kings, the plebes 
left Rome in Jarge numbers in 494 BC, in response to 
different forms of abuse at the hands of the nobles. The 
nobles feared the city’s economic and military decline 
and accepted the establishment of the tribunes. Accord- 
ing to Machiavelli ({1531] 1997) the plebes created the 
tribunes “to hold back the insolence of the nobles and 
thus preserve the free life of the republic” (1.3, 11.11). 

A key difference between Machiavelli’s account of 
Roman republicanism and post-civil war democracies 
is that whereas the former was created by a combina- 
tion of indirect economic pressure and a direct political 
pressure from the people, the latter have been created 
almost entirely through indirect economic pressure. Al- 


though in Rome the plebes took over the role of arbi- 
trator and played a more active role in the creation 
of the republic, in Liberia or Mozambique the people 
were essentially invited by warlords to be arbitrators 
and played a less active role in the democratization 
process. As a result, the mode of popular participation 
in the democratic process and the mechanisms for elite 
accountability described in McCormick 2001 seemed 
to be stronger in Rome than in modern post-civil war 
democracies. This difference is not surprising since war- 
lords clearly had much more power in Mozambique or 
Liberia than did the nobles in the city of Rome, who 
relied on the population for soldiers. 

The choice between the people and the prince or a 
foreign power as highlighted by Machiavelli is a choice 
between one powerful arbitrator and a large number of 
small arbitrators. Machiavelli (1994, 96) preferred the 
people but added that the people could not play their 
role as arbitrators unless unified by political institu- 
tions such as tribunes and popular assemblies. Without 
the support of these institutions, the people would be 
isolated and, as a result, weak and cowardly, thinking 
only about their fears. In modern-day, post-civil war 
democracies, state institutions such as the judiciary and 
political institutions, such as the parliament, support 
the people’s actions. 

Two characteristics of popular arbitration that 
Machiavelli did not mention are its fairness to all par- 
ties and its neutrality. Not only will the people tend to 
show good judgment in the process of selection of the 
good candidates and good policies, but also they are 
less likely to collude with one candidate at the expense 
of the other. The choice between the people and a for- 
eign power involves a trade-off between enforceability 
and neutrality: Popular arbitration of an elite’s political 
contract is superior to external arbitration because the 
people’s action (the electoral outcome) is more likely to 
be perceived as fair and neutral by the political parties 
and less likely to be subject to ex-post renegotiation. 
However, popular arbitration is inferior to external ar- 
bitration in that, in contrast to an external arbitrator, 
the people may lack the power to enforce their own 
decisions. In other words, when civil society is weak, 
voters cannot stop either party from altering elec- 
toral outcomes. Thus, only when parties are disarmed 
and a party cannot alter electoral outcomes can pop- 
ular arbitration or democracy generate elite coopera- 
tion. 

The search for an impartial and neutral tool to miti- 
gate the disruptive effect of factionalism was an impor- 
tant feature of political life in Italian city republics. As 
Waley (1991) maintains, the political scene in medieval 
Italy was characterized by factionalism fueled by in- 
tense competition for political office. The citizens were 
driven by an ardent desire to obtain the “honors and 
benefits” of office (Manin 1995). To overcome factional 
strife, most Italian communes adopted the institution 
of podesta, a foreigner endowed with judicial and ad- 
ministrative powers. The podesta was usually hired for 
a year and played the role of military leader, judge, and 
administrator. An important attribute of the podesta 
was that he had to be a foreigner so that he could 
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be neutral to the internal “discords and conspiracies” 
(Waley 1991, 37). 

Besides the podesta, which was essentially an admin- 
istrative tool for mediating factional strife, the lottery 
system was used to allocate political offices. The attrac- 
tive feature of lots was to shift the allocation of offices 
to a procedure that was not subject to human influence. 
According to Waley (1991) this procedure was chosen 
to prevent “the domination of city republics by cliques 
who might prolong their control by securing the choice 
of members of their own factions” (37). But the practice 
of the lottery became unpopular, and many intellectu- 
als asked for its replacement by elections. Among them 
was the political theorist Leonardo Bruni, who argued 
that when citizens must compete in an election and 
openly put their reputation on the line, they have an 
incentive to behave well (cited in Manin 1995). In ad- 
dition, Guicciardini ([1521] 1994) proposed extending 
the electorate to include a greater number of citizens, 
to combine the lottery’s neutrality and impartiality 
effects with the incentive for good behavior provided by 
elections™ Manin (1995) states that, “Guicciardini’s 
proposal is remarkable for its rather unexpected 
justification of the extension of voting rights, but more 
importantly in its search for neutral institutions that 
could mitigate the divisive effects of competition for 
office” (54). Thus, democracy in the city-republic of 
Florence can be defined essentially as a competitive 
oligarchy,*5 

Even though Machiavelli, with Guicciardini, favors 
the people as arbitrators of political disputes, he clearly 
discounted their ability to found a republic. In that re- 
gard, he is well in line with Hobbes. Machiavelli argues 
that only one man can create a republic (e.g., Romulus 
in Rome). In The Prince (Machiavelli 1994), he also 
described how Cesare Borgia established courts and 
representative institutions in the anarchic Romagna re- 
gion controlled by warring nobles only after he brought 
order to the region. Order was brought to Romagna by 
a repressive authoritarian rule. Referring to Romulus 
in the Roman case, Machiavelli contends that a radical 
reform of the republic or a kingdom requires that one 
person alone must make all the key decisions. He wrote, 
“A wise legislator, when establishing a republic ... 
should make every effort to ensure that all the power 
` lies in his own hands” (108). 





™ From “Del modo di ordinare il governo popular” (Guicciardim 
([1521] 1994), cited m Manin 1995. Note that, in contrast to 
Guicciardini, for whom elections are the full extent of popular partic- 
ipation, for Machiavelli, elections are just one of several mechanisms 
of popular government and elite control (see McCormick 2001). 

35 Democracy as competitive oligarchy is known in the con 

theory literature as Schumpeterian democracy. Schumpeter (1942) 
defines democracy as a mechanism to manage power relations be- 
tween political partes, in this case the former warmng factions. To put 
it differently, Schumpeterian democracy 1s an institutional arrange- 
ment for arriving at political decisions by vestmg m certam political 
leaders the power to rule as a consequence of their successful quest 
for the people's votes (269). I should acknowledge that elections in 
medieval Europe were local or regional and were not part of na- 
tionwide democratic practices, at least not untl 1760 or the early 
nineteenth century. 
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A Defense of the “Minimalist” Nature of 
Post-Clvil War Democracy 


Post-civil democracy is Schumpeterian at its core be- 
cause of its emphasis on leadership and the fact that po- 
litical office holders (the former warlords) are elected 
rather than appointed by Leviathan. Post-civil war 
democracy is also Popperian because it is aimed at 
avoiding the return to civil war and anarchy by pre- 
venting tyrannical rule by whoever wins the election.» 
Thus, post-civil war democracy is essentially procedu- 
ral, minimalist. Is this form of democracy defensible? 

Przeworski (1999) justifies Schumpeterian democ- 
racy using Popperian standards, He shows that even if 
democracy does not appropriately aggregate citizens’ 
preferences or adequately control politicians or help 
reduce economic inequalities, the very fact that it helps 
change governments without bloodshed can help gen- 
erate a peaceful resolution of conflicts. As in Result 4, 
this holds true if parties assess a high enough chance 
of winning the elections so that the short-term net gain 
of not accepting electoral outcomes is outweighed by 
the long-term gain of abiding by them. Thus, competi- 
tion for offices generates compliance with democratic 
rule, and “bloodshed is avoided by the mere fact that, 
a la Aristotle, the political forces are expected to take 
turn” (47). Democracy also works like a lottery, a coin- 
flipping operation. However, a coin flip differs from 
voting or democracy because election outcomes en- 
able competing factions to assess each other’s military 
strength. “Voting constitutes flexing muscles: a reading 
of chances in an eventual war” (48). But what if the 
eventuality of war is unlikely because the state has al- 
ready monopolized the means of coercion so that vot- 
ing cannot provide a reading of chances in a violent 
conflict? 

Przeworski (1999) argues that, at the very least, elec- 
tions reveal some information about the cost of an 
eventual rebellion. In a post-civil war democracy, a 
party that contemplates subverting democracy will face 
the resistance of both the warring factions and the in- 
direct resistance of ordinary citizens through underin- 
vestment. 


The Democratic Transition Literature 


More generally, the theory developed here comple- 
ments the “top-down” theories of democratization, 
which explain the process of democratization in the 
absence of a strong civil society. The fact that democ- 
racy emerges in nations with weak civil societies has 
led many scholars to downplay the importance of civic 
culture in transitions to democracy and emphasize 





% Popper’s (1962) conception of democracy emphasizes the major 
differences between democracy and tyranny. Whereas democracy de- 
rives its legitrmacy from the people and 1s the only regime that makes 
It possible for the ruled to dismiss a given government without blood- 
shed (by means of elections), tyranny can only be dismissed through a 
revolution and political violence. A political system us not democratic 
because the majority rules; it 1s democratic because the institutions 
are designed to prevent the rise of a totalitarian government, which 
can be dismissed only by means of violent revolution. 
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elite bargaining (Diamond, Linz, and Lipset 1989; 
O’Donnell and Schmitter 1986; Przeworski 1991).7’ 
Weingast (1997) argues that citizens’ democratic val- 
ues and elite interests are complementary aspects 
of democratization and that the study of democratic 
civic culture is necessary to understand political elites’ 
strategies during the process of democratization. How- 
ever, Diamond, Linz, and Lipset (1989) assert, “In 
Latin America, the choice of democracy by political 
elites clearly preceded the presence of democratic val- 
ues among the general public” (12). O’Donnell and 
Schmitter (1986) claim that transition to democracy is 
often the outcome of a division between hardliners or 
softliners, within the authoritarian regime and between 
radicals and moderates within the opposition (15-16). 
And Rustow (1970) hypothesizes along the lines argued 
here that democratization is set off by a prolonged and 
inconclusive political struggle followed by a “deliberate 
decision on the part of political leaders to accept the 
existence of diversity in unity and, to that end, to insti- 
tutionalize crucial aspects of democratic procedures” 
(355). 

Neither Rustow (1970) nor O’Donnell and Schmitter 
(1986), however, present a theory of why political 
forces involved in an “inconclusive” conflict would se- 
ttle precisely on democracy and not on other forms 
of power sharing. So why do political groups in conflict 
ever settle on democracy? Przeworski (1999) addresses 
this question, first by defining democracy as an incom- 
plete power-sharing contract, with the ultimate or resi- 
dual power changing hands with positive probability. 
This arrangement is contrasted with dictatorship, where 
the residual power never changes hands. Przeworski 
then argues that a dictatorial contract is not an attrac- 
tive option for political groups in conflict because it 
would give one group a decisive advantage in the event 
of an open conflict. 

Przeworski’s (1999) argument is a helpful point of 
departure but it does not mention the interests and 
the role of ordinary citizens in democratization, which 
is crucial in helping to understand why democracy in- 
volves voting, not simply coin-flipping. In the context of 
post-civil war society, citizens benefit from democrati- 
zation because it generates social order and protection 
against illegal theft and political banditry. I argue that 
democracy enables citizens to control expropriation 
and warring factions to commit to respecting property 
rights. I present an alternative approach to the choice 
of democracy by defining democracy as an arbitration 
mechanism. I show that in a situation of stalemate, the 
warring factions looking for an exit option from the 
civil war will tend to design a power-sharing contract 





37 Note, however, that Ruth Coller (1999) argues that more recent 
transitions were less elite-driven and involve more social mobihza- 
tion and popular protest. In my judgment, this was not the case in 
most rf not all post crvil war transitions 

38 In the context of El Salvadorian avıl war, Woods (2000) argues 
that the warring factions reached a bargam in which the “Left agreed 
to a democratic political regime and a capitalist economy with only 
limited socio-economic reform, and the Right agreed to the Lefts 
participation in a democratic regime along with some socio-economic 
reform” (85) 


and delegate some power to a third party that acts as 
the ultimate arbitrator and enforcer of such a contract 
(e.g., Leviathan). I argue that this delegation of author- 
ity may jeopardize the contract, and cooperation may 
fail to materialize. This may happen if, for example, 
the arbitrator is suspected by one of the parties to be 
biased toward the other party. I derive the rationale 
for democracy by explicitly analyzing the third-party 
arbitrator’s incentives. 

There have been several historical examples of a 
contract of governance arbitrated by the clergy or a 
foreign power. After World War IL, political systems 
in many developing countries were based on explicit 
or implicit elite political arrangements implemented 
by a foreign power. During the Cold War, the Soviet 
Union was the enforcer and the ultimate arbi- 
trator of political arrangements in Eastern Europe, 
whereas the French government played similar roles in 
Francophone Africa.” In Lebanon following the 1990 
“Taif agreement,” the Syrian government became the 
official arbitrator and enforcer of political order in the 
country. 

The theoretical argument developed here suggests 
that even when the external arbitrator is clearly known 
to be neutral, as long it has some military power, it 
cannot commit against using this power to initiate and 
enforce side agreements that would establish one-party 
rule and expropriate citizens. Because such agreements 
clearly make one party worse off, this party will never 
agree to invite an outside arbitrator. Thus, unless an 
external arbitrator is imposed upon the parties, these 
considerations may lead them to prefer another type 
of arbitrator. I argue that this arbitration role can be 
played by the citizenry, which explains why political 
elites could initiate democratic transitions. 

I show that when the electorate is sufficiently diverse, 
the citizens can commit to being an unbiased arbitra- 
tor. Furthermore, in contrast with the external arbi- 
trator, the citizens can also commit against initiating 
side agreements because they do not have the military 
power to enforce such agreements. Thus, the two key 
features of democracy as an arbitration mechanism that 
I want to stress are (1) the inability of a mass of voters to 
get together after the vote to undermine the result and 
(2) that citizens tend to have more moderate or neutral 
party preferences than a single external arbitrator. 

One aspect of the theory developed here that has 
relevance even in the context of postauthoritarian tran- 
sitions is that long-term foreign involvement in domes- 
tic politics cannot be conducive to democracy. A good 
illustration of this insight is Francophone Africa, a text- 
book example of the pitfalls of a political system built 
on elite contracts implemented or arbitrated by a for- 
eign power. As mentioned earlier, despite conceding 
independence to its former colonies in 1960, France 
retained the right to enforce political order in these 
countries, with troops stationed in Gabon, the Ivory 
Coast, and the Central African Republic. As Foccart 
(1994) has acknowledged, France used these rights to 


39 For the case of French intervention, see Foccart 1994. 
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“protect” the power of the “friends of France” among 
African elites. In exchange for this protection, the 
friends of France helped maintain the relatively strong 
French economic and cultural influence in Africa. This 
collusion between some African elites and the French 
government, as well as the resulting political turmoil 
(numerous coups, civil wars, and government bankrupt- 
cies in Congo, Benin, Cameroon, Gabon, and Chad 
among others), is consistent with the theory developed 
here. In line with my theoretical prediction, the wave 
of democratization started in Francophone Africa only 
after the Baule Summit in 1989, when the Mitterrand 
government:switched from a policy of intervention to 
one of disengagement from African internal politics 
This change in French foreign policy made the elec- 
toral process the only mechanism for allocating power 
among the elites and led to more cooperative behavior 
by African elites. Likewise, democratic change swept 
through Eastern Europe only after the collapse of the 
Soviet Union, the external power in charge of enforcing 
communist rule. The Soviet Union’s breakdown cre- 
ated a new balance of forces between political groups 
that facilitated the emergence of democracy. i 


WAR MAKING AND DEMOCRATIC STATE 
MAKING: TILLY, OLSON, AND LEVI 


The theoretical argument presented in this article is 
not only about the choice of democracy as an arbi- 
tration mechanism by warring factions, but also about 
the construction or reconstruction of state institutions 
from chaos. The link between war making and state 
formation, on the one hand, and democratization and 
state building, on the other, has been studied recently 
by Tilly (1990), Ertman (1997), and Levi (1999), among 
others.“ 

Tilly (1990) shows that the creation of national states 
in Europe by power holders or warlords was the un- 
intended consequence of the interaction among war 
making, extraction, and capital accumulation. Euro- 
pean statesmen and state builders waged wars to secure 
territories within which they could enjoy the economic 
benefits of power, that is, a permanent access to credit 
sources and collection of tax revenues. To facilitate tax 
collections and protect their sources of credit, they mo- 
nopolize the means of violence, thereby generating a 
state apparatus. 

Tilly (1990) claims that there are essentially 
three types of states: (1) the coercion-intensive state 
(Brandenburg and Russia), (2) the capital-intensive 
state (generally city-states, such as Genoa, or federa- 
tions, such as the Dutch Republic), and (3) the cap- 
italized coercion state (the only long-term survivor— 
the earliest examples are France and England) (30). 
Significantly, the formation of national states was not 


© There is a large literature in comparative politics and sociology 
that deals with the difficultes of state formation and state consol- 
idation in various places, particularly Africa. For instance, Herbst 
(2000) presents an intnguing and comprehensrve histoncal account 
of the way in which geography and social structure constrained state 
formation and state development in Africa. 
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complete until the nineteenth century, when “the last 
part of Europe to consolidate into substantial national 
states was the city-state band running from northern 
Italy, around the Alps, and down the Rhine to the 
Low Countries” (47). These areas were finally brought 
into national states with the nineteenth-century cre- 
ation of Germany, Italy, and the federal government of 
Switzerland.“ 

Tilly (1990) divides the formation of European na- 
tional states into four periods. First was the patrimoni- 
alist period, which ran through the fifth century, when 
monarchs extracted capital in the form of tribute, and 
“tribes, feudal levies, urban militias, and similar custom- 
ary forces played the major part in warfare” (20). Then 
there was the brokerage period (1400-1700), when 
rulers relied on mercenary soldiers for war and for- 
mally independent capitalists for loans. It was followed 
by the nationalization period (1700-1850), when states 
created mass armies drawn from their national popula- 
tion and took over the direct operation of the fiscal ap- 
paratus. Finally, we had the specialization period (1850 
to the recent past), in which military and nonmilitary 
activities of the state were increasingly differentiated, 
legislatures came to dominate the military, and the wel- 
fare state developed (29). Thus, according to Tilly each 
step resulted in an increase in the “organizational con- 
tainment of military men” (124). The story of European 
state formation is the gradual subordination of military 
force to outside political control. This process has en- 
tailed bargaining between states and their subjects. In 
return for the greater contributions (in men and wealth) 
that society has been asked to make to the state’s war- 
making efforts, the state has had to provide protective 
institutions and social welfare pro 

Various scholars have studied the link between the 
economic dependence of the ruling class on tax rev- 
enues and the taxpayer demand for political represen- 
tation. It is argued that rulers maximize revenues by 
promoting joint ventures with taxpayers. In particular, 
they grant voting rights to citizens as a way to com- 
mit to this joint venture, and citizens use these rights 
to protect their investments (Levi 1999, 114). In other 
words, citizens give up a share of wealth to the ruler in 
exchange for political rights that enable them to protect 
their property rights. 

The bargaining power of taxpayers is even stronger 
when their assets are mobile (Bates and Lien Da- 
Hsiang 1985). It leads to concessions that take var- 
ious institutional forms. North and Weingast (1989) 
study the evolution of constitutional arrangements in 
England following the Glorious Revolution of 1688. 





41 In line with Tilly’s (1990) analysıs, Lopez-Alves (2000) finds that 
the type of war and type of mobilization have a great effect on state 
institutions and the nature of the regime. For instance, there was a 
high degree of rural mobilization and a high frequency of civil wars 
m postcolonial Uruguay and Colombia. This weakened the Central 
army but led to a more competitive and less repressive and open 
political regime. In a similar study, Centeno (2002) finds that strong 
and centralizing state authority was absent in most Latin American 
countnes. As a result, those states were not effective in protecting 
their subjects or collecting rents and did not have the capacity to fight 
one another. 
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They argue that the resulting institutions enabled the 
Crown to credibly commit to upholding property rights 
and to tightening budget constraint. The new institu- 
tions removed the underlying source of the expedi- 
ency, limited the Crown’s legislative and judicial power, 
and reasserted the taxation power of the parliament, 
especially to allocate funds and monitor expenditures. 
Thus, the Crown’s commitment to honoring the agree- 
ment was established because taxpayers had a voice 
through their representatives in Parliament.” Levi 
(1988), Taylor (1987), and Brewer (1989) provide a 
range of evidence that supports this conclusion. In par- 
ticular, Levi (1988) shows that “early modern parlia- 
ments enabled the English monarch to negotiate taxes 
that were acceptable to key constituents who then 
helped enforce them” (116). 

The argument presented in this article combines 
insights from Tilly’s (1990, 1999) war-making-leads- 
to-state-making thesis and Levi’s (1999) no-taxation- 
without-representation thesis. In contrast to the Eu- 
ropean experience analyzed by Tilly, the state-making 
experience analyzed here is a product of civil wars, not 
interstate wars. The governors-to-be do not have sta- 
ble and secure territory where they enjoy a monopoly 
of protection. Instead they have joint control of the 
territory and are involved in an oligopolistic game of 
protection. As a result, they have to agree on how to 
divide the gains for protection, divide the peace divi- 
dends, and jointly commit to protect citizens’ property 
and not overexpropriate them. The aspiring governors 
achieve this goal by granting voting rights to citizens 
and creating a balanced and inclusive state to enforce 
election outcomes. Thus, contrary to Tilly’s claim, the 
war-making-leads-to-state-making thesis is at least par- 
tially applicable to post-World War II developing coun- 
tries, particularly war-torn ones.” 

Olson’s (1993) analysis of the economic origins of 
dictatorships and democracies is in many ways simi- 
lar to the argument developed in this article. As in 
the present analysis, Olson starts from a situation of 
a Hobbesian state of nature and argues that it is a 
suboptimal political organization because it does not 
provide enough incentive for investments. Following 
the Hobbesian logic, he shows that autocracy would rise 
out of anarchy because the “bandits” that exist under 
anarchy would have an incentive to become stationary 
bandits and make themselves autocrats. They provide 
political order, which, to their benefit, enhances the 
productivity of their subjects. However, autocrats tend 


“ Note that I am adopting the North and Weingast (1989) view that 

d evolved in a form of democracy at the Glorious Revolution. 
® Tilly (1990) claims that, m contrast to the experience of Europe, 
developing countnes are not becoming less dominated by the military 
but, rather, are actually more dominated by them (209, 212). Tilly 
explains this difference by reference to what he calls the greater “ex- 
ternalization” of state formation in the decolonized countries. First, 
developing country military rulers have been able to draw revenue 
from commodity exports or great power military aid, thus allowmng 
them to ing with their subject populations” (208). 
Second, Tilly hypothesizes that the competition between the United 
States and the Soviet Union created incentives for the superpowers 
to back military regimes to bring as many states as possible into their 
rival camps 
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to tax too much, and democratic leaders will replace 
them. Olson shows that a democratic leader will ex- 
tract less in taxes than an autocrat, because he or she 
represents the majority of the population that earns a 
“significant share of the market income of the society 
and has a more encompassing interest in the productiv- 
ity of the society” (570). He also shows that democracy 
will arise when none of the leaders involved in the coup 
or the upheaval to overthrow the dictator are strong 
enough to rule on their own. The argument is supported 
with the historical example of the initial emergence of 
democracy in England following the Glorious Revolu- 
tion of 1689. 

Asin this article, Olson’s (1993) analysis points to the 
protection of property against illegal expropriation as 
one major motivation for democratization. However, 
whereas my goal is to investigate conditions for the 
emergence of democracy from anarchy, Olson follows 
the traditional route to explaining the emergence of 
democracy out of autocracy, analyzing the conditions 
under which democrats would overthrow stationary 
bandits. Thus, instead of assuming that democracy can 
arise only after a stationary bandit generates political 
order, I show that a situation of anarchy can generate 
political order and democracy simultaneously. The cost 
of the conflict leads the warring factions to consider 
three options, maintaining the status quo, inviting an 
external arbitrator, and choosing the people as arbi- 
trator. There is an equilibrium in which warring fac- 
tions can choose to agree on democracy as long as each 
faction assesses high enough chances of winning the 
elections. They make the choice of democracy credible 
by constructing (perhaps with the temporary help of an 
external force) an impartial enforcement mechanism of 
election outcomes. 

The argument represents a significant departure 
from Huntington’s (1968) views on state building in 
changing societies. Huntington quite bluntly rejects the 
possibility of liberal democracy for developing coun- 
tries. He considers the formula of designing govern- 
ments in most developing countries with written con- 
stitutions, separation of powers, bills of rights, checks 
and balances, regular elections, and competitive parties 
to be irrelevant. He contends, 


In many, if not most, modernizing countries elections serve 
only to enhance the power of the disruptive and reac- 
tionary forces and to tear down the structure of public 
authority .... The primary problem is not liberty but the 
creation of a legitimate public order. Men may have or- 
der without liberty, but they cannot have liberty without 
order. Authority has to exist before can be limited, and it 
is authority which is in scarce supply in those modernizing 
countries where government is at the mercy of alienated 
intellectuals, rambunctious colonels and noting students. 
(7-8) 


Needless to say, the theory developed in this arti- 
cle and the historical evidence of many Latin Ameri- 
can and African countries indicate quite emphatically 
that the promise of political order is more credible 
when democratic institutions support it. Communist 
and right-wing authoritarian governments around the 
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world did actually attempt to crush alienated intellec- 
tuals and rioting students in the 1970s and the 1980s. 
But the resistance of those intellectuals and students 
against government oppression made the third wave of 
democratization around the world possible. 


CONCLUDING REMARKS 


Post-civil war democracy is Hobbesian in its origin, 
Lockean and Schumpeterian in its functions. It simul- 
taneously arbitrates conflicts between former warring 
factions and protects citizens against illegal expropria- 
tion. It shows that warring factions can create not only a 
state, but indeed a democratic state. Above all, warlord 
democratization confirms Kant’s ([1795] 1997) predic- 
tion that a good political constitution (assuming that a 
democratic constitution is one of them) does not have 
to result from a progress of morality but instead can re- 
sult from an outcome of strategic interaction between 
“intelligent” warlords. Also, if we were to believe, as 
Kant does, that the irresistible “will of nature” will force 
warring factions to do what is in their best interests, 
then democracy will become the natural outcome of 
civil wars, provided that there is military stalemate and 
the factions!are economically dependent on citizens’ 
productive investments. 

Post-civil war democracy provides a way of bypa- 
ssing or (not returning to) the Prince or Leviathan. In- 
deed, Machiavelli, and many others after him (Hobbes, 
Olson, Huntington) have claimed that popular govern- 
ments could'be established only after an intermediate 
period of one-man rule, whether Moses, Cesare Borgia, 
Napoleon Bonaparte, Chang Kai-Shek, or Mobutu. I 
argue that democratic states can be created directly 
from anarchy. A popular regime can, in fact, skip the 
“princely” stage and go directly from oligarchic conflict 
to a form of popular government. 
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is paper examines the bases of opposition to immigrant minorities in Western Europe, focusing 

on The Netherlands. The specific aim of this study is to test the validity of predictions derived 

from two theones—realistic conflict, which emphasizes considerations of economic well-being, 

and social identity, which emphasizes considerations of identity based on group membership. The larger 
aim of this study is to investigate the interplay of predisposing factors and situational triggers in evoking 
political responses. The analysis is based on a series of three experiments embedded in a public opinion 
survey carried out in The Netherlands (n = 2007) in 1997-98. The experiments, combined with parallel 
individual-level measures, allow measurement of the comparative impact of both dispositionally based 
and situationally triggered threats to economic well-being and to national identity at work. The results 
show, first, that considerations of national identity dominate those of economic advantage in evoking 
exclusionary reactions to immigrant minorities and, second, that the effect of situational triggers is to 
mobilize support for exclusionary policies above and beyond the core constituency already predisposed 


to support them. 


emocratic politics in Western Europe faces a 

new challenge. In Austria, France, Germany, 

and Italy most dramatically, but throughout 
Western Europe more generally, the surge of immigra- 
tion over the last decade has had major political reper- 
cussions. Politically, it has triggered intense debate 
about the nature of citizenship (Favell 1998), the rights 
of women (Okin 1999; Wikkan 2002), the claims—and 
limits—of multiculturalism (Barry 2001; Parekh 2000), 
even restrictions on the scope of free speech in order to 
promote group tolerance (Horton 1993; Modood 1993). 
Electorally, the traditional right has broadly benefited 
in a number of countries (e.g., Hitchcock 2002), and 
in some the extreme right has profited particularly, 
including the National Front in France, the National 
Alliance in Italy, and the Austrian Freedom Party in 
Austria (Mayer 2002; Sniderman et al. 2000; but see 
Kitschelt 1997). 

The aim of this study is twofold. The first is to gauge 
the extent to which concerns over cultural and eco- 
nomic integration now drive reactions to immigrants 
and immigration in Western Europe. The second is 
to show how the dynamics of public responses to po- 
litical issues follow from the interplay of two sets of 
factors, predisposing concerns and situational triggers. 
The specific way that these two factors operate in com- 
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bination is a key to unexpected, large-scale electoral 
mobilization—in a word, to the flash potential of anti- 
immigrant politics. 


THEORY AND HYPOTHESES 


One of the oldest intuitions—it dates from at least 
Aristotle—is that conflict between groups is rooted in 
a clash of interests. Group interests can clash over a 
wide horizon of valued goods, including claims to so- 
cial status and privileges (Blumer 1958, 4; Bobo and 
Hutchings 1996, 955). But on most realistic conflict in- 
terpretations, the core of group conflict is the clash of 
competing economic interests (Hardin 1995). In prin- 
ciple the calculus of economic advantage may operate 
at the individual or collective level, but in the theoreti- 
cally best-developed interpretation of realistic conflict 
theory, people identify with a group because they have 
an individual interest in doing so (Hardin 1995, 48). 
Social identity theory is now the principal theoreti- 
cal alternative to realistic conflict theory (e.g., Brown 
1995; Capozza and Brown 2000; and Huddy 2001).! The 
premise of social identity theory is that an integral ele- 
ment of individuals’ sense of who they are is based on 
what groups they belong to or identify with. Since all— 
or nearly all—people strive for a positive self-concept, 
so all—or nearly all—are motivated to evaluate posi- 
tively groups that are the basis of their social identity 
(Tajfel 1981). To evaluate their own group positively, 
they are often—though not always (see Brewer 2001)— 
motivated to evaluate other groups negatively. 
Realistic conflict and social identity explanations 
need not be mutually exclusive. In all likelihood, 


1 To be sure, there 1s a host of older theoretical approaches—for ex- 
ample, personality-based accounts of prejudice (e.g., Adorno et al. 
1950) and social structural accounts (e.g, Selznick and Steinberg 
1969). We take account of other relevant factors below. For an excel- 
lent overview of a wide range of approaches, see Duckitt 1992. 
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concerns about both economic well-being and national 
identity underlie reactions to immigrant minorities in 
Western Europe, to some discernible extent. But the 
two theories point in different explanatory directions. 
Realistic conflict explanations concentrate on social— 
structural sources of group difference. They take the 
key explanatory mechanism to be economic competi- 
tion. And they presume the driving motive is a desire 
to be materially better off. Social identity explanations 
concentrate on an array of group memberships. They 
take the key explanatory mechanism to be group cat- 
egorization. And they presume the driving motive is a 
need for positive differentiation. Our understanding of 
the heart of the conflict over immigrant minorities, and 
what may be done to ease it, will differ depending how 
far strains over immigration turn on a conflict of mate- 
rial interests rather than a clash of cultural identities. 

The Netherlands i is the site of our study. There, but 
not only there, our hypothesis is that considerations of 
national identity dominate those of economic interest 
given current economic and political conditions. Two 
considerations are key. 

First, the bearing of economic concerns on issues 
of immigrants and immigration is conditional on eco- 
nomic conditjons: The worse the economic conditions, 
the greater the impact of economic concerns; the bet- 
ter the economic conditions, the lesser (Coenders and 
Scheepers 1998; Quillian 1995). Over the last decade, 
The Netherlands has been among the best off of the 
OECD countries, with GDP growing on the one side 
and government deficits declining on the other (Visser 
and Hemerijck 1997). Moreover, most Dutch recognize 
they have become better off and expect to become bet- 
ter off still? Given the positive economic conditions 
and the positive perceptions of them, concerns about 
economic threats immigrants may pose should fall to- 
ward the lower end of their range of potential influence. 

Second, the impact of concerns about national iden- 
tity is conditional on the prominence of differences 
between groups. Specifically, concern over group iden- 
tity varies as a function of perceptual distinctiveness, 
salience, and entatitvity.? In terms of perceptual dis- 
tinctness, many of the immigrant minorities in The 
Netherlands stand out by virtue of darker skin color 
(e.g., Surinamese, Moluccans Moroccans, and Turks), 
by virtue of dress (with Muslim women, especially 
older women, wearing headscarves and long dresses 
and some Muslim men wearing caps), by virtue of lack 
of fluency in Dutch, and by virtue of educational and 
labor market' handicaps (Hagendoorn, Veenman, and 
Vollebergh 2003). With regard to salience, issues re- 
garding immigrant minorities have been a prime topic 
in all the mass media since the early 1990s, with public 
references to the cultural distinctiveness of immigrant 


2 In our sample, for example, 85% believe that they and therr fam- 
tty will be as well or better financially two years in the future and, 
similarly, that the economy will be as good or better two years m the 
future, i 


3 Brown (1995, 63); entatrvity ıs a term introduced by Campbell 


(1958), referring, roughly, to the perceived internal cohesrveness of 
a group. 
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minorities increasing over this period. Finally, with re- 
gard to entativity, immigrant minorities are spatially 
concentrated in The Netherlands and tend to have 
strong family and group loyalties, unifying beliefs, and 
distinctive practices.* Given the perceptual distinctive- 
ness, salience, and entativity of immigrant minorities 
in The Netherlands, the impact of conflicting cultural 
identities should fall toward the higher end of its range 
of influence, 

But how, exactly, do concerns about either eco- 
nomic well-being or cultural identity influence citizen 
responses to immigrant minorities and issues of im- 
migration? Consider concerns about economic well- 
being, for the sake of illustration. They can manifest 
themselves in two different ways. People may have 
developed, for whatever reasons, an ongoing concern 
about their economic prospects that manifests itself in 
a generalized readiness to be concerned about being 
economically worse off. On the other hand, some aspect 
of people’s immediate circumstances can trigger a con- 
cern about economic well-being. The critical question 
is, How do predisposing factors and situational triggers 
in combination shape reactions to ethnic minorities? 

From a political point of view, two alternatives stand 
out.’ A situational trigger may galvanize those already 
concerned about a particular problem. Alternatively, 
it may mobilize citizens whether or not they already 
were disposed to be concerned about the problem. Po- 
litically, there is all the difference between galvanizing 
a core constituency and mobilizing a broader public. 
The former increases the intensity of support for a pol- 
icy; the latter also enlarges the portion of the public in 
support of it. 

It is an open question when each effect occurs, since 
either clearly can occur under some conditions. But 
for the kinds of concerns at the center of issues of 
immigration, we believe that a mobilizing effect is more 
likely than a galvanizing effect. All, or nearly all, care 
about their economic well-being, not on a continuous 
basis, to be sure, but when a risk to their economic 
prospects becomes salient. Possibly fewer, but all the 
same a great many, care about their country’s national 
identity and culture, again not on a continuous basis, 
but when a risk to the national way of life becomes 
salient. A large portion of the public accordingly should 
be ready to respond to circumstances triggering a con- 
cern about their economic well-being and their coun- 
try’s way of life above and beyond those immediately 
concerned about either. 


4 In our sample, for example, immigrants are overwhelmingly per- 
ceived by the Dutch to live therr lives together with much contact 
among one another, to have a solid group structure bound together 
though extended families and strong internal obligations, to share 
essentially the same values, and even to be well organized to pro- 
tect their common interest. The percentage agreeing with each of 
these judgments about mmorities, in our sample, is 91, 96, 80, and 77, 


respectively. 

5 The literatures on predisposing factors and situational triggers cor- 
respond, broadly, to political and sociological studies through the 
medium of public opmion surveys and social psychological studies 
through the medium of laboratory experiments, respectively. For a 
review of both, see Duckitt, 1992 
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TABLE 1. individual and Collective Threats 
Threat 
Individual Individual Collective Collective Collective 
Threat Safety Economic Safety Economic Cultural 
Individual safety 41 .48 38 .39 
(1007) (1011) (1004) (999) 
Individual economic 19 43 59 52 
(972) (1008) (1002) (996) 
Collective safety 38 11 56 57 
(983) (972) (1005) (1000) 
Collective economic 20 41 .20 64 
7) (963) (971) (995) 
Collective cultural .19 .29 .40 
__ 6) _ (950) (981) (950) 
Note: Call entries are Spearman's rank correlations. Above-diagonal correlations are for the coupled condition; below-dlagonal, for the 
decoupled condition. Unwelghted number of cases In parentheses. 





Gauging the joint impact of predisposing factors and 
situational triggers requires a combination of random- 
ized experiments and representative public opinion 
surveys. In this study we report results from a series 
of three experiments as part of a national survey of 
the adult population of The Netherlands, carried out 
from October 1997 through June 1998 by the CATI— 
Department of the Faculty of Social Sciences, Utrecht 
University (n = 2007). The first, the Decoupling exper- 
iment, focuses on predispositions to perceive threats to 
economic well-being and cultural identity.® The second, 
the Fitting-In experiment, builds on the first by examin- 
ing the situational triggering of threats to identity and 
economic well-being. The third, the Identity Priming 
experiment, builds on the second by examining the sit- 
uational priming of national identification. 


THE “DECOUPLING” EXPERIMENT 


Our first objective is to gauge the comparative impor- 
tance of threats to economic well-being and cultural 
identity in driving negative evaluations of immigrants. 
A threat to a group’s identity and way of life inherently 
is a collective threat. In contrast, threats to economic 
interests may be perceived by individuals as threats 
to their own economic well-being or, alternatively, as 
threats to the economic well being of their group as a 
whole. At a more popular level, immigrants have be- 
come linked to problems of crime. Threats to safety 
may be perceived by individuals as threats to their own 
safety or, alternatively, as threats to the safety of indi- 
viduals in the society as a whole. Accordingly, we assess 
three types of threats—to cultural identity, economic 
well-being, and safety—and assess the last two at two 
different levels—individual and collective. 

But how should perception of a threat be assessed? 
Here is a representative indicator from the Bobo and 
Hutchings (1996) Index of Perceived Group Competi- 
tion: “More good jobs for (group mentioned) means 
fewer good jobs for members of (other group men- 
tioned).” Notice that the test indicator asks whether an 


gee ate ves uch in deht to qhipieneerima Kipari Seneme at 
Marcus et al (1995), though our terms differ from theirs. 





object of value, good jobs in this instance, is at risk and 
whether a particular group is responsible for it being 
at risk. The test indicator is thus “double-barreled.” It 
simultaneously assesses how respondents feel about a 
possible threat and how they feel about a particular 
group identified as the source of the threat. But just so 
far as threat perception indicators tap both a judgment 
that feelings toward a group and a concern that a ob- 
ject of value is at risk, there is an obvious danger of 
tautology in explanations of group intolerance. People 
may see a particular group as threatening and therefore 
dislike it, But they may also say that it is threatening be- 
cause they dislike it. And to the extent that the second 
alternative is right, it means that standard threat per- 
ception measures are measuring the very thing, dislike 
of a group, that they are meant to explain. 

Hence the Decoupling experiment. This experiment 
has two conditions. Consistent with standard practice, 
the object at risk and the group allegedly putting it at 
risk are coupled in the first. So one half of the sam- 
ple, randomly selected, is asked whether they agree 
strongly, agree somewhat, disagree somewhat, or dis- 
agree strongly with a series of five statements. 


“I am afraid of increasing violence and vandalism in my 
neighborhood by ethnic minorities.” 

“I am afraid that my economic prospects will get worse 
because of ethnic minorities.” 

“I am afraid of increasing violence and vandalism in Dutch 
society by ethnic minorities.” 

“These days, I am afraid that the Dutch culture is threat- 
ened by ethnic minorities.” 

“I am afraid that the economic prospects of Dutch society 
will get worse because of minorities.” 


In the second experimental condition, in contrast, the 
wording of the question is different in one respect: The 
reference to ethnic minorities is omitted. So the first 
question, for example, reads, “I am afraid of increasing 
violence and vandalism in my neighborhood.” 

Just so far as respondents are reacting on the basis 
of how they feel about the group mentioned (rather 
than the threat mentioned), threat perception indica- 
tors referring to quite different threats should be highly 
correlated whatever is threatened. Table 1 presents the 
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intercorrelations of threat perception indicators in the 
coupled condition, above the diagonal, and in the de- 
coupled condition, below the diagonal. 

As inspection of Table 1 shows, threat judgments are 
markedly more correlated with one another—whatever 
is threatened—in the coupled than in the decoupled 
condition. Thus the mean correlation between threats 
in the coupled condition is .49; in the decoupled con- 
dition, .29. When threat judgments are coupled with 
a reference to ethnic minorities, people who perceive 
themselves to be threatened in one way are markedly 
more likely to perceive themselves to be threatened in 
other ways—whatever those other ways are. In short, 
the double-barreled character of the standard threat 
perception measurement format blurs the distinctness 
of different types of threats. 

Is there also a danger of tautological explanations 
of intergroup hostility? We measured hostility toward 
four ethnic minorities. Respondents are first asked 
about either Turks or Moroccans, then about either 
Surinamese or refugees and asylum seekers. Their feel- 
ings about each group are assessed using a measure of 
eight descriptive adjectives, six negative and two posi- 
tive, validated in a series of previous studies of group 
hostility.’ To the extent that the standard form of threat 
perception measures feelings toward the group identi- 
fied as a source of the threat as well as perceptions of 
the threat itself, measures of one threat should predict 
hostility to ethnic minorities approximately as well as 
any other threat. 

Table 2 summarizes the impact of different types of 
threat on hostility toward the four target groups when 
threat and group source of threat are coupled (first col- 
umn for each group) and decoupled (second column). 
For each target group, we report a measure of mul- 
ticollinearity associated with independent variables, 
their variance inflation factor (VIF)—in italics —and 
unstandardized OLS coefficients and their standard 
errors—in regular font. Consider first multicollinear- 
ity. While coefficient estimates are unaffected, mul- 
ticollinearity inflates their standard errors, making it 
harder to obtain precise—and statistically significant— 
estimates. One way to assess the degree of multi- 
collinearity associated with an independent variable 
is to calculate its variance inflation factor (VIF). The 
VIF for independent variable j is 1/(1— Rẹ), where 
R is the squared multiple correlation from a regres- 
sion of variable j on all other independent variables 
in the model. In other words, the VIF, is proportional 
to the variance of variable j explained by the other 
independent variables in the model. The larger the VIF, 
the harder it is to distinguish the effect of variable j. 
Table 2 includes estimates of the VIF. Multicollinearity 
is clearly a more serious problem in the “coupled” than 
the “decoupled” condition. To take an example, the 


7 The adjectives are honest, selfish, law-abiding, mtrusrve, slackers, 
violent, complainers, and inferior by nature. (See Appendix for com- 
plete wording.) For an account of the development and validation of 
this measure, see Sniderman et al 2000, chap. 2. 

8 Here as throughout both dependent and independent variables are 
scored from 0 to 1 and the regression coefficients are unstandardized. 
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estimated effect of collective cultural threat on group 
hostility toward Moroccans has a VIF of 2.27 in the 
coupled condition (Table 2), which corresponds to a 
50% inflation of the standard error (./VIF = 1.5). In 
the decoupled condition, the VIF for the same coef- 
ficient is only 1.31, which indicates an inflation of the 
standard error by only 14% due to multicollinearity. 
Note that almost all independent variables are highly 
significant in the coupled condition despite severely 
inflated standard errors (which make it more difficult 
to obtain significance). This is another indication that 
dependent and independent variables are dangerously 
close to measuring the same thing in the coupled condi- 
tion. In short, when threat perception indicators share a 
reference to the same group, ethnic minorities, it is very 
difficult to differentiate the impact of different types of 
threats on hostility to ethnic minorities, 

In contrast, when reference to what is threatened is 
decoupled from reference to ethnic minorities as the 
source of threat, the differential impact of different 
kinds of threats stands out. Most obviously, perceiving 
a threat to Dutch culture has by far the largest impact 
in provoking hostility toward minority groups. That is 
true for every group—Turks, Moroccans, Surinamese, 
and refugees and asylum seekers. In contrast, popu- 
lar discussion of problems of crime notwithstanding, 
threats to safety are the least important in account- 
ing for hostility to ethnic minorities, and this is true at 
both the individual and the national collective level. 
Specifically, perceived threats of violence and vandal- 
ism in the society as a whole are significantly related 
only to hostility to refugees. Moreover, perceived per- 
sonal threats of violence and vandalism are related 
only to hostility to Moroccans and refugees, and not 
only are the coefficients small in size but the sign of 
the relation is the opposite of what one would expect. 
Considerations of economic interest fall in between, of 
more consequence than threats to safety, of less than 
threats to cultural identity.!° Looking at reactions sum- 
marized across groups, shown in the far right-hand set 
of columns, the coefficients for perceived threats to 
individual and to collective economic well-being are 
approximately equal in size. 

Some oscillation in the size of coefficients across dif- 
ferent target groups should be noted. Both egocentric 
and sociocentric safety threats are significant predictors 
for hostility to refugees, but the former is puzzlingly 
negative in sign, as it also is for Moroccans. Also, socio- 
centric economic threats are significant for Surinamese 
and for refugees and asylum seekers but not for Turks 
or Moroccans. For that matter, Table 2 also shows that 
economic threats matter less, or not at all, as a source of 


9 It is worth underscoring that these data were gathered before 9/11. 
10 A presumption that concerns over economic well-being, particu- 
larly over personal economic well-being, are the driving motrvational 
factor in political choice seems to some axiomatc. It is accordingly 
worth observing that, when asked whether their chances of getting 
what they want for things like housing, job promotions, and the like 
are any better or worse because of ethnic minorities m the country, 
only 14% responded worse, and more than half did not believe that 
their chances would be very much worse or that it would be unfair if 
they were. 
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TABLE 3. ` Regresa ing Soclal Distance on Threats, by Group _ 
























== p< .001, “p< .01,*p<.05. 





Turks Moroccans Surlnamess Refugees 
constant 362" 369" 351" 350" 
(.028) (.030) (.027) (.029) 
Individual safety threat —.023 —.032 .018 —.016 
(.021) (.022) (.020) (021) 
Individual economic threat 066" .085* .052* .068™ 
(.023) (.025) (.023) (.024) 
Collective safety threat .013 .030 —.014 .059 
(.034) (. re (.033) (.035) 
Collective cultural threat 194 .1g2-" 144 172 
(.023) ( 024) (.022) (.023) 
Collective economic threat 061* .050 082" 052" 
(.024) (.026) (.024) (.025) 
Adj. R?: 15 14 12 13 
N 844 842 843 836 








Note: Cell entries are understandardized OLS coefficients, with standard errors in parentheses. Number of cases le unweigh- 





hostility to Turks and Moroccans but are a significant 
source of hostility to Surinamese and to refugees and 
asylum seekers. What might explain these differences 
between groups? 

One possibility follows from a confound of order and 
religion effects. Respondents were first asked about ei- 
ther Turks or Moroccans, then about either Surinamese 
or refugees and asylum seekers. Respondents thus al- 
ways were first asked about a Muslim group, and only 
then about a non-Muslim one. And because the first 
group is always Muslim, the (possible) impact of reli- 
gion is entangled with the (possible) impact of order. 
Another possibility follows from the price in confidence 
levels exacted by assignment of subjects to multiple ex- 
perimental conditions. The experimental variation of 
threat perception indicators into two forms, coupled 
and decoupled, cuts the sample size in half. In turn, the 
randomized assignment to one of a pair of minority 
groups cuts it in half again. Thus the effect of indi- 
vidual and collective economic threats on hostility to 
minorities, which is in the expected direction, may not 
reach conventional standards of statistical significance 
because of the limited number of observations. How 
can this pair of measurement problems—the slicing of 
the sample through assignment to multiple treatment 
conditions and the potential confounding of religion 
and question order—be dealt with? 

By design jour study included a second measure of 
hostility toward a group, namely, social distance.” This 
second measure of prejudice differs from the first in two 
respects. First, every respondent was asked the social 
distance questions about every group (rather than a 
randomly selected two of the four). So we have half of 
the full sample to work with in the decoupled condi- 
tion, not just one quarter.” In addition, the order of 


eee a aS 

11 The social distance index consists of two questons. “To have 
[group name] as a neighbor seems to me very attractive, somewhat 
attractive, somewhat unattractive, very unattractive” and “To have a 
[group name] as a life partner seems to me very attractive, somewhat 
attractive, somewhat unattractive, very unattractive.” 

12 We have only one half of the sample, even though every res- 
pondent was asked about every group, because one of every two 
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the groups was randomly varied. So both methodologi- 
cal concerns —the limited number of observations and 
possible confound of order and religion effects—can 
be spiked. 

Table 3 replicates the analysis in Table 2, substituting 
the social distance measure of prejudice for the stereo- 
type measure of prejudice. The results are clear and 
consistent. Perceived threats to safety, whether to the 
individual or to the groups as a whole, are not significant 
for any minority group. Perceived threats to economic 
well-being at both the personal and the national level 
are significant predictors of hostility for every minor- 
ity group. And perceived threat to the Dutch culture 
is far and away the strongest predictor of hostility to 
minorities whether or not the minority is Muslim. 

To this point, an explanatory account of intolerance 
toward immigrants centered on concerns about na- 
tional economic well-being has been tested indepen- 
dently of other explanations of prejudice. To what ex- 
tent do either of these concerns matter, it is necessary 
to ask, if the standard factors invoked in analyses of 
intolerance are taken into account? 

In Table 4 the analysis incorporates many of the 
larger family of sociodemographic and psychologi- 
cal factors established by previous research to be 
sources of intolerance—authoritarian values, educa- 
tion, self-esteem, occupational status,!* and level of 
employment—in addition to perceived cultural and 
economic threats. The analysis is done twice, to test 
the robustness of the findings, once for the stereotype 
measure of prejudice and once for the social distance 
measure. Two aspects of the results should be under- 
lined. Measures of authoritarianism (whatever it is that 
they actually measure) have been powerful predictors 
in previous studies of prejudice; and in this study the 
measure of authoritarian values has a substantially 








respondents was asked the threat questions in the “contaminated” 
coupled condition. 

13 See Appendm for question wording. 

14 The occupational status takes into account labor force status and 
nature of job—dependent worker, self-employed, and managerial. 
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TABLE 4. Explaining Prejudice 
Stereo 


161 (.019) 





Social Distance 
.144™* (.020) 





Individual economic .041* (.020) .043* (.021) 
threat 
Collective economic .035 (.021) .037 (.021) 
threat 
Authoritarian values 4.158" (.025) .141™ (.026) 
Education —.063* (.028) —.018 (.029) 
Self-esteem —.100™ (.033) —.083* (.033) 
Occupational status .007 (.021) —.035 (.022) 
Not fully employed —. (.024) —.012 (.024) 
Constant 246 (035)  .429°" (.036) 
Adj. A? 27 21 
N 848 871 





Note. Cell entnes are understandardized OLS coefficients, 
with standard errors In parentheses. Number of cases ls 
unweighted. Cases in the coupled condition are excluded. 
= p< 001; *p<.01; *p<.05. 


larger impact than that of any of the other “usual sus- 
pects,” including personality and education. Yet the im- 
pact of perceived cultural threat is every bit as large, for 
both measures of prejudice. The results thus indicate 
that perceived threat to cultural identity adds a substan- 
tial new component to the explanation of intolerance 
above and beyond the factors featured in traditional 
accounts of prejudice. In contrast, the impact of per- 
ceived individual economic threat on both measures of 
prejudice just manages to reach conventional levels of 
statistical significance, while that of perceived collective 
economic threat does not manage even that on either. 

For all analyses, then, threats to cultural identity cut 
deeper than considerations of economic interest (once 
the problem of confounding is dealt with). But perhaps 
threats to economic interest matter especially for those 
who are not well off economically. To test this conjec- 
ture, both measures of prejudice, stereotype and so- 
cial distance, were regressed on perceived threats, with 
perceived economic threat interacted successively with 
income, occupational status, and perceived economic 
well-being. Two sets of analyses were conducted. In one 
the observed values for the income and occupational 
status were used, ignoring missing data; in the other, 
the imputed values, to correct for missing analyses. The 
interaction effects were statistically insignificant in all 
cases. 

There is another and more subtle problem to con- 
sider, however. Just because people say that they per- 
ceive another group to threaten their cultural identity, it 
does not follow that they actually fear for their cultural 
identity. They may say they do to mask their true mo- 
tives. So they may say that their concern is to protect 
the well-being of the national culture and way of life 
when their concern, in truth, is to preserve their own 
economic well-being. 

It is accordingly worth examining the larger matrix of 
considerations in which threats to culture and economic 
well-being are embedded. Just so far as the measure 
of perceived threat to cultural identity measures what 


it is intended to measure, than it should be tied more 
closely to other measures of the importance of national 
identity than to judgments of economic circumstances. 
Similarly, just so far as perceived threats to economic 
well-being measure what they are intended to measure, 
then they should be tied more closely to judgments of 
economic circumstances and prospects rather to eval- 
uations of the importance of national identity. 

We have seen that the differential impact of different 
types of threat on hostility to ethnic minorities comes 
more clearly into view in the decoupled than in the 
coupled condition. A picture of the correlates of threat 
perception should also come more clearly into focus 
in the decoupled than in the coupled condition, if the 
“double-barreled” character of standard threat percep- 
tion measures is indeed a problem. Table 5 accordingly 
reports ordered probit regressions of perceived threats 
to cultural identity and to economic well-being, first 
in the “coupled” condition, then in the “decoupled” 
condition. 

In the coupled condition, indicators of threat to eco- 
nomic interests, whether personal and national, are 
entangled with considerations of identity. Conversely, 
threats to cultural identity are entangled with consid- 
erations of economic well-being. In the decoupled con- 
dition, in contrast, extraneous considerations tend to 
be trimmed. For both threats to cultural identity and 
to personal economic well being, then, there is a sharp- 
ening of discriminant validity when threat perception 
indicators are not double-barreled. We therefore con- 
centrate on the correlates of threat perception in the 
“decoupled” condition. 

Three principal results stand out. The first concerns a 
common basis for perceptions of threats. In a pioneer- 
ing study, Marcus et al. (1995) hypothesized that one 
component of a readiness to perceive a specific threat is 
a generalized readiness to perceive threats. A general- 
ized readiness to perceive threats and a lack of self- 
confidence and self-esteem go together (Sniderman 
1975). Accordingly, we use an index of self-esteem de- 
rived from the California Psychological Inventory, 
to index a generalized readiness to perceive threats. 
And as Table 4 shows, consistent with the hypothesis of 
Marcus and his colleagues, the lower individuals’ self- 
esteem, the more likely they are to perceive threats 
whatever the particular object possibly at risk, cultural 
or economic, individual or collective. 

The second finding concerns the distinctness of dif- 
ferent types of threats, notwithstanding this point of 
psychological commonality. To assess importance of 
national identity to individuals’ personal identity, we 
use an index developed by Luhtanen and Crocker 
(1992). To canvass people’s judgments of economic 
circumstances, we deploy questions about their per- 
sonal and the national financial situation, both retro- 


spectively and prospectively. (For question wording 


15 For a description of procedures for item selection and validation 
of this self-esteem index, see Sniderman et al. 2000. A battery of 
studies has shown that the lower people’s self-confidence and sense 
of self-worth, the more le they are to feeling a diffuse sense 
of apprehension and threat (Sniderman 1975). 
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TABLE 5. Predicting Different Threats 




















Collective Cultural Threat Individual Economic Threat Collective Economic Threat 
Coupted Decoupled Coupled Decoupled Coupled Decoupled 
Personal financial situation 
Two years ago —.15 .03 .09 Bm .12 —.02 
; (.16) (.16) (.18) (.17) (.16) (.15) 
Two years from now 57 10 51% 125" Aim 46" 
(.18) (.15) (.19) (.17) (.16) (.14) 
National financial situation 
Two years from now 5a" .36* .29 .56™ Aon 1.02 
, (.15) (.15) (.15) (.17) (.15) (.14) 
Two years ago .25* 34° 31" .04 5a bi 
; (.13) (.13) (.15) (.15) (.13) (.13) 
Identification with Dutch 1.34 1.53 8" 50" 1.20" son" 
Kdenttty | (.18) (.18) (.19) (.19) (.19) (.18) 
Self-esteem —.63" —.93" —1.08" —.98—" —1.09"" —.56* 
(28) (27) (.25) (.25) ( 25) (.26) 
1st cut polnt 57 —.45 50 48 42 .51 
2nd cut polnt 1.00 .11 108 1.19 .98 1.31 
3rd cutpolnt 179 88 1.69 1.90 1.88 2.21 
polkelneed —1122.6 —1081.2 —928.7 —972.5 —1032.7 —1080.7 
926 885 930 900 928 898 














He Ca eines ae Gerad prob coticara, wit ennard arte m E NUD TCDS “ o< 001, 
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of all items, see Appendix.) Once a common ref- 
erence to ethnic minorities is omitted, a readiness 
to perceive a threat to the national culture is pri- 
marily tied to the Luhtanen—Crocker measure of 
importance of national identity. Similarly, a readi- 
ness to perceive a threat to personal economic well- 
being is primarily tied to judgments of economic con- 
siderations, particularly to judgments that personal 
economic prospects look bleak over the next two 
years and that personal financial circumstances have 
deteriorated! 

The third finding has to do with the intermediate 
position of perceptions of a threat to the national econ- 
omy. On the one side, they are strongly tied to judg- 
ments of economic considerations. But on the other, 
they are substantially related to the importance of a 
national identity to a person’s sense of personal iden- 
tity. Perceived threats to the national economy thus 
have a strong symbolic component. We did not predict 
this result, but we believe that it makes intuitive sense. 
Threats to al country’s way of life and to a country’s 
economy share a common feature. Each focuses on the 
society as a whole, on a sense that “we” are vulnerable. 
And just so far as a perception of threats to the national 
economy has a whole involves an identification of an 
individual wih the larger society, it is not surprising that 
the more important a sense of national identification 
is to individuals’ sense of personal identity, the more 
likely they are to perceive a threat to the economic 
well-being of the nation. In contrast, the distinctness 
of threats toi self-interest and cultural identity under- 
cut the suggéstion that individuals are claiming to pert- 
ceive a threat to the national way of life to mask their 
concern about their individual (or national) economic 
well-being. 
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THE “FITTING-IN” EXPERIMENT 


The results to this point speak to differences between 
individuals in the threats they perceive to economic 
well-being and cultural identity. A second and quite 
different issue also needs to be examined. To what ex- 
tent does the same individual react differently when 
threats to cultural identity or economic well-being be- 
come salient? 

The Dutch study was expressly designed to assess, in 
parallel, situational and dispositional threats to eco- 
nomic interest and national identity. The Fitting-In 
experiment accordingly manipulates the salience of 
threats to economic interests and to national iden- 
tity by manipulating the characteristics of immigrants. 
Respondents are randomly assigned to one of four con- 
ditions. In the first and second, “a group of new immi- 
grants that may come here” is characterized either as 
“highly educated and well suited for well-paying jobs” 
or as “not highly educated or well trained and only 
suited for unskilled jobs.” In the third and fourth con- 
ditions, immigrants are characterized either as people 
who “speak Dutch fluently and have a very good chance 
to fit in smoothly with the Dutch culture” or as people 
who “don’t speak Dutch fluently and don’t have a good 
chance to fit in smoothly with the Dutch culture.” All 
are asked the same test item: “Do you think it is a good 
idea or bad idea for these immigrants to be allowed 
to come here?” Four response options are presented: 
very good idea, somewhat good idea, somewhat bad 
idea, and extremely bad idea. 

Which evokes the stronger reaction—the issue of 
economic integration or that of cultural integration? 
Figure 1 reports the distribution of responses as a 
function of immigrant characteristics. A poor fit either 
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FIGURE 1. The Fitting-In Experiment 
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Note: The question began with the same Introduction for all respondents: “Let me tell you about a group of new Immigrants that may 
come here.” Next, each condition described the immigrants differently. 

Fit in wed econormeally (N = 492). "They are highly educated and well sulted for well-paying jobs.” 

Do not fit in well economically (N = 516): “They are not highly educated or well trained and only surted for unskilled jobs.” 

Fit in wed! culturally (N = 526): “They speak Dutch fluently and have a very good chance to fit In smoothly with the Dutch culture.” 

Do not fit in well culturally (N = 473): “They don't speak Dutch fluently and don't have a good chance to fit In smoothly with Dutch 
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economically or culturally increases opposition to im- 
migration. But whether or not immigrants are likely 
to fit in culturally matters far more than whether or 
not they will fit in economically. As Figure 1 shows, 
four of every five Dutch want to stiffen immigration 
requirements when immigrants do not speak Dutch flu- 
ently and do not have a good chance to fit in smoothly 
with Dutch culture, compared to two of five when they 
do. In contrast, two of every three Dutch do so when 
immigrants are not highly educated and well suited for 
well-paying jobs, compared to one of two when they are. 
Moreover, not fitting in culturally evokes significantly 
more opposition to immigration than not fitting in eco- 
nomically, while fitting in culturally promotes signifi- 
cantly more support for it than fitting in economically.’® 
The pivotal question politically, however, is the im- 
pact of predisposing factors and situational triggers in 
combination. Suppose that public attention is drawn 
to the question of cultural conflict between established 
norms and those of new immigrants, say, by stories in 
the mass media or in the campaign of a national politi- 
cal figure. The salience of the issue of cultural identity 
can operate as a situational trigger. If the effect of this 
trigger is only (or primarily) to activate those already 


predisposed to respond negatively to immigrants, it will 


16 The difference of distributions between the two “not fit- 
ting” conditions 1s significant at p = .002 (x?[3] =14 4), the difference 
between the two “fitting” conditions at p < .0001 (x*[3] =26.6). 


not alter the fundamental political landscape. But just 
so far as the effect of this situational trigger is to in- 
crease opposition from those not already predisposed 
to respond negatively to immigrants, the result is to en- 
large the constituency throughout the society backing 
exclusionary reactions to immigrants. 

Table 6 assesses the specific way in which opposition 
to immigration is conditional both on a predisposition 
to perceive threats to cultural identity and economic 
interest and on the salience of threats to economic in- 
terests or cultural identity. Consider first the interplay 
of predisposing factors and situational triggers with re- 
spect to cultural identity. Column (1) in Table 6 presents 
an ordered probit regression of opposition to immi- 
gration on perceived cultural threat, the experimental 
treatment triggering the issue of cultural integration, 
and the interaction of the two. If triggering a concern 
about cultural integration has a galvanizing effect, then 
respondents more concerned about a threat to the na- 
tional culture should be disproportionately more op- 
posed to immigration of those who will have difficulty 
fitting in culturally, generating a significant interaction 
between cultural threat and experimental condition. In 
contrast, if triggering a concern about a cultural inte- 
gration has a mobilizing effect, it should evoke propor- 
tionately as strong a reaction across-the-board. Table 6 
shows that perceived threat increases opposition to im- 
migration; so, too, does experimental triggering of the 
problem of cultural identity; but there is no interaction 
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TABLE 6.: Responsiveness to Difficulty of Fitting as a Function of Predispositional Threat 





Opposition to Immigration 














(1) (2 (3) 
Collective cultural threat .92— (.26) L 
Experimental condition: Do not fit In culturally .64** (21) 
Experimental condition x collective cultural threat .12 (.33) 
Individual economic threat 57* (.27) 
Experimental condition: Do not fit In economically .40* (.18) 
Experimental condition x Individual economic threat .25 (.36) 
Collective economic threat 1.06° (.26) 
Experimental condition: Do not fit In economically .68** (.21) 
Experimental condition x collective economic threat —.33 (.37) 
tst cut polnt —.36 —1.12 —.84 
2nd cut polnt 72 18 .49 
3rd cutipolnt 1.65 1.01 1.33 
log-likelihood —549.5 —607.7 -593.1 
N . 444 496 492 











Note: Cell entries are ordered probit coefficients, with standard errors in parentheses Number of cases 18 unweighted. = p < .001, 


| Tp<.01,*p< 05. 


between them. In short, the results point to a mobilizing 
rather than a galvanizing effect. 

The robustness of this result can be tested. Equiv- 
alent analyses of the joint effect of a disposition to 
perceive economic threats and a situational trigger of 
economic threats are provided for both individual eco- 
nomic well-being (column [2] in Table 6) and national 
economic well-being (column [3] in Table 6). The re- 
sults parallel those for cultural identity. Triggering ei- 
ther type of economic concern evokes proportionately 
as strong a reaction from those who ordinarily are not at 
all concerned about them as from those who are most 
concerned about them. In short, the impact of situa- 
tional triggers is neither confined to nor concentrated 
among the ranks of those expressly concerned about a 
threat to cultural identity. On the contrary, there con- 
sistently is an across-the-board mobilizing reaction. 


THE “IDENTITY PRIMING” EXPERIMENT 


The Fitting-In experiment turns on direct reference to 
characteristics of immigrants, desirable in one condi- 
tion, frankly undesirable in the other. But of course 
there are constraints against direct public references 
to undesirable characteristics of immigrant minorities. 
Political aspirants who call attention to the problems 
of cultural or economic integration risk being labeled 
racist. They may do so anyway; indeed, have done so 
anyway. All 'the same, it is important to ask, Can op- 
position to immigration be mobilized in a way that is 
unambiguously legitimate? 

In the Identity Priming experiment, respondents are 
randomly assigned to one of two experimental condi- 
tions. In one: their identity as Dutch citizens is primed; 
in the other, their identity as individuals. In the na- 
tional identification condition, the question begins with 
the introduction: “People belong to different types of 
groups. One of the most important and essential of 
these groups is the nation which you belong to. In your 
case, you belong to the Dutch nationality. Each nation is 
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different.” In the personal identification condition, the 
question begins with the introduction: “People differ in 
many ways and each human being is unique. One per- 
son likes music, another likes to go for a walk, still an- 
other likes to go out. Everyone is different.” After the 
priming introduction, all respondents are asked, “Do 
you [as a Dutch person/personally] think that allowing 
new immigrants to enter The Netherlands should be 
made more difficult than it is now?” Responses take the 
form of a modified Likert scale, ranging from strongly 
disagree to strongly agree. 

The crucial question for the politics of immigration 
is whether bringing people’s sense of national (rather 
than personal) identity to the fore enlarges the con- 
stituency opposed to immigration above and beyond 
those already predisposed to oppose it out of a con- 
cern that the national way of life is threatened. Figure 2 
therefore reports attitudes toward immigration condi- 
tional both on the type of prime (national or personal) 
and on the extent of perceived threat to the Dutch way 
of life.!” 

It is natural to suppose that identity politics matters 
for those for whom considerations of national identity 
matter—that is, matter consciously. And they do for a 
large number. In our study, on the order of one in every 
four strongly agree that the Dutch way of life is un- 
der threat; and support for making immigration more 
difficult is very nearly universal among them in both ex- 
perimental conditions, as Figure 2 shows. There is thus a 
ceiling effect on the experimental treatment at one end 
of the continuum of concern about cultural threats. The 
striking aspect of the identity priming experiment re- 
sults accordingly lies at the other end of the continuum. 
Here are respondents who are ordinarily not concerned 
at all about a threat to the Dutch way of life. But when 
they are primed to think in terms of their national rather 
than their personal identity, opposition to immigration 


17 Given the findings on discriminant validity, the results reported 
are for the decoupled condition. 
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FIGURE 2. The Identity Priming Experiment 
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increases significantly—significantly substantively, not 
just statistically. Among the third of the sample that 
reports the lowest cultural threat, 45% agree that im- 
migration should be made more difficult in the per- 
sonal prime condition, compared to 69% in the national 
prime condition. The findings of the Identity Priming 
experiment thus fit those of the Fitting-In experiment. 
Both show that situational triggers mobilize opposition 
broadly through the general public rather than just 
activating a core constituency already predisposed to 


oppose immigration. 


CONCLUSIONS 


What is driving contemporary reactions against immi- 
gration and immigrant minorities in Western European 


democracies? Concerns over economic and over cul- 
tural integration are the two most frequently given 
answers, although concerns over crime and safety are 
also cited. All three are plausible explanations and they 
are not mutually exclusive. But depending on whether 
one or the other is at the center of public concern, we 
shall have a different understanding of what is impelling 
opposition to immigrants and of what can be done to 
relieve it. 
Realistic conflict is the most rigorously devel- 
oped explanation of intergroup conflict. Concerns 
about economic well-being are indeed a source of 
opposition to immigrants and immigration, our re- 
sults show. Moreover, the clash of economic inter- 
ests matters at two distinct levels. Perceived threats 
to individual and to national economic interests 
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evoke hostility to immigrants, in approximately equal 
measure. 

This may seem a self-evident finding. Claims of an 
economic basis of political choice have achieved a near 
axiomatic status. But evidence in their support, for the 
public at large, is sparse (Sears and Funk 1990). This 
scarcity of supportive evidence, Sigelman, Greenberg, 
and Wald (n.d) have suggested, may be a measurement 
artifact. Standard measures of self-interest impute 
gains and losses on the basis of indirect indicators such 
as social class. A risk of slippage is obvious. So Sigelman 
and his colleagues show that self-interest, individually 
assessed, plays a larger role than usually observed in 
public opinion studies. We find the same. But the par- 
allelism of findings based on economic interests self- 
assessed and economic costs experimentally manipu- 
lated provides a new basis for the hypothesis that cal- 
culations of economic advantage are a basis of reactions 
to immigrants and immigration. 

A caveat is in order. Self-assessed economic threats 
are tied to factors having nothing to do with economic 
well-being; self-assessed threats to culture, to factors 
that have nothing to do with culture. Thus both are 
connected te low self-esteem and each is connected 
with the other. Even so, perceptions of economic threat 
are tied to economic considerations in the largest mea- 
sure, and perceptions of threat to the national culture 
are tied to considerations of a national identity in the 
largest measure. And just so far as the two kinds of 
threat are distinct, it makes sense to ask to what extent 
the current anger and resentment against immigrant 
minorities in Western Europe spring from considera- 
tions of economic interest or national identity. 

The answer to this question has some relevance to 
the current policy debate over multiculturalism. To the 
extent that the driving motive is concern over being 
economically worse off, opposition to immigrants is 
rooted ultimately in the economic logic of people’s 
situations. And so far as that is true, it is possible to 
see how an extended period of economic progress can 
open up a path of conciliation. In contrast, to the ex- 
tent that the driving motive is culture, the root prob- 
lem is conflicting identities and values. And so far as 
that is true, there is no reason to believe that improve- 
ments in people’s material circumstances will have an 
ameliorating effect. It accordingly is worth underlin- 
ing that our results, both experimental and nonexper- 
imental, show that concerns over national identity are 
more of a driving force than concerns over economic 
interest. 

On a more general plane, it is self-evident that po- 
litical responses are a product of both predisposing 
factors and situational characteristics. What is not self- 
evident—indeed, what was not possible to investigate 
until the introduction of randomized experimentation 
into public opinion surveys—is how the two sets of fac- 
tors work in combination. Two alternatives stand out. A 
situational trigger may primarily galvanize those who 
already are broadly disposed to back a policy. Or it may 
mobilize support across-the-board in the public. Both 
the Fitting-In and the Identity Priming experiments 
throw light on the form of the relationship between 
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intersecting characteristics of individuals and features 
of their situation. The results of both experiments indi- 
cate that situational triggers mobilize support beyond 
the core constituency already predisposed to oppose 
immigration. 

This study is just a first step. It is necessary to begin 
specifying the conditions under which galvanizing or 
mobilizing effects are more likely to occur. Moreover, 
the design of any specific experiment is imperfect, and 
not only because all measurement is imperfect. There 
is always the issue of external validity. 

Consider the Fitting-In experiment. In real life, 
individuals—and groups—have many attributes. In the 
Fitting-In experiment, immigrants are characterized 
only in terms of their economic skills or linguistic com- 
petence. It is natural to ask, then, whether variation in 
either economic skills or linguistic competence would 
have comparable effects in real life. Here is our best 
estimate, acknowledging a fringe of uncertainty. The 
point about relations between majority and minority 
groups is precisely that the image each holds of the 
other tends not to be individuated. In a word, the 
stripped-down characterization of immigrant minori- 
ties in the Fitting-In experiment corresponds to their 
stripped-down and stereotyped depiction in mass me- 
dia and public discourse. 

A deeper question of external validity centers on 
the implications for politics in the Identity Priming 
and the Fitting-In experiments. Both experiments show 
that the effect of situational triggers is additive, enlarg- 
ing the circle of opposition to immigration above and 
beyond the core constituency already predisposed to 
oppose it. And both experiments, by showing that situ- 
ational triggers make a contribution above and beyond 
that of predisposing factors, thus point to a mechanism 
for “flash” politics. How well does this fit the actual 
politics of The Netherlands? 

For fully five decades, the conservative party, the 
VVD, was in the minority. In the early 1990s Frits 
Bolkenstein, a VVD figure, broke the elite consensus 
against public discussion of problems of the cultural 
integration of minorities. Warning “publicly against 
giving in too much to the cultural iarities of im- 
migrants and argu[ing] that they should be expected 
to integrate much more into the Dutch way of life” 
(Thranhardt 2000, 172), he rode a wave of public pop- 
ularity to the leadership of the VVD, and then to the 
shared leadership of the “Purple Coalition” with the 
Social Democrats, from 1994 to 2002. The meteoric 
rise of Pim Fortuyn is a still more dramatic example of 
the flash potential of anti-immigrant politics. In 2001, 
he burst into public prominence with attacks on Islam 
as “a backward culture” coupled with outright calls for 
reduction in immigration for both immigrants and asy- 
lum seekers. Polls predicted that he would lead one of 
the largest parties in government, possibly the largest: 
He was, partly for this reason, assassinated nine days 
before the national elections in 2002.18 Neither Bolken- 
stein nor Fortuyn could have amassed so much support, 


18 news bbe.co uk/I/hi/world/europe/1971462.stm. 
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in so little time, in the usual quarters—among the more 
maladjusted, less educated, and more marginal. Com- 
bining randomized experimentation and representa- 
tive sampling opens up an opportunity to achieve an 
understanding of sources and dynamics of the “flash” 
potential of anti-immigrant politics. 

Caution is called for, naturally. Our findings—and 
political examples—are drawn from the experience 
of one country only. How far they apply elsewhere is 
necessarily an open question. A distinction between 
generalizability and replicability may be useful here, 
however. Comparative research has shown that 
hostility to immigrants and opposition to immigration 
are tied to the same network of explanatory constructs 
across countries in Western Europe, including 
Germany, France, Belgium, and The Netherlands 
(Hagendoorn and Nekue 1999). We have been able 
to go beyond what has so far been established by 
taking advantage of randomized experiments. But for 
all the advantages of new experimental procedures, 
the disadvantage is precisely that they are new. So 
we would like to underline that the strongest point of 
our study ironically is the one most in need of further 
support. 

A final implication of our results has to do with the 
possibly distinctive character of current strains over 
immigration in Western Europe. Part of the problem, 
quite possibly the largest, has to do with prejudice pure 
and simple. So far as this is so, the story is an old one, 
and the main characters in it—ignorance, parochial- 
ism, economic self-interest, dogmatically moralistic 
and judgmental outlooks, diffuse hostility—play de- 
pressingly familiar roles. But, increasingly, the strains 
over immigration in Western Europe are being cast in 
terms of a division between European majorities and 
Muslim minorities. 

Our findings lend support to a hypothesis of culture 
conflict. A perception that Dutch culture is threatened 
is the dominant factor in generating a negative reaction 
to immigrant minorities. And the issue of cultural in- 
tegration, when it becomes salient, evokes proportion- 
ately just as strong a reaction from those who are least 
concerned about a threat to Dutch culture as from those 
who are most concerned about one. This second finding 
goes substantially beyond the first—for it indicates that 
a readiness to respond on the issue of culture is not 
confined to those actively and consciously concerned 
about the issue. It instead extends throughout Dutch 
society. 

It remains to be determined precisely what lies 
behind a concern with protection of a culture and way of 
life. But these findings raise a worrying possibility. Cul- 
ture is a condensation of shared convictions as to what 
is right and should be valued and what is wrong and 
should be prohibited. Oversimplified and erroneous 
images of the “other” play a part in conflict between 
national majorities and Muslim minorities groups. But 
the divisions between them cannot be attributed solely 
to misperceptions and misunderstandings. There are 
points of genuine and deeply felt differences in values, 
for example, over the right of women to enter Western 
society as equals, without loss of honor to their families 
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or risk of violence to themselves.” Our findings of the 
centrality of the issue of cultural integration accord- 
ingly point to the possibility that the strains over immi- 
gration in Western European democracies are rooted 
in a genuine conflict of values, to an extent yet to be 
determined. 


APPENDIX: QUESTION WORDING 


Threat tems 


Individual Safety Threat: “Now I’m going to read you some 
other statements and this time I want to know whether you 
agree or disagree with each. 

Do you agree strongly, agree somewhat, disagree somewhat, 
or disagree strongly?” ‘I am afraid of increasing violence and 
vandalism in my neighborhood [by ethnic minorities].” 


Individual Economic Threat: “I am afraid that my economic 
prospects will get worse [because of ethnic minorities].” 


Collective Safety Threat: “I am afraid of increasing violence 
and vandalism in Dutch society [by ethnic minorities].” 
Collective Cultural Threat: “These days, I am afraid that the 
Dutch culture is threatened [by ethnic minorities].” 


Collective Economic Threat: “J am afraid that the economic 
prospects of Dutch society will get worse [because of minori- 
ties].” 


Group Hostility (Elght-Item Additive Index) 


“Now we will talk about some of the different groups present 
in our country. For each of the characteristics that I mention, 
can you tell me whether or not it applies to the majority of 
persons belonging to that group. 

Let’s begin by talking about [minority group]. 

Do you agree or disagree that most of them are trustworthy? 
That they behave properly and act honestly? 

Do you agree strongly, agree somewhat, disagree somewhat, 
or disagree strongly with this description?” 


“Do you agree or disagree that most of [minority group] are 
selfish? They think only about themselves, without concern- 
ing themselves very much about others.” (reversed) 


“Do you agree or disagree that most of [minority group] are 
law-abiding? They behave like good citizens, observing the 
regulation and laws of the state.” 


“Do you agree or disagree that most of [minority group] are 
intrusive? They press themselves on you in an annoying and 
insistent way.” (reversed) 


“Do you agree or disagree that most of [minority group] are 
slackers? That they try to avoid working or in any case they 
avoid to do tiring heavy work.” (reversed) 


“Do you agree or disagree that most of [minority group] are 
violent? They often use physical force or threaten to use it, 
in order to impose their will in their relations with others.” 
(reversed) 


19 A literature examining specific cases of value conflict between 
European majorities and Muslim ummigrants is accumulating (e.g., 
Levy 2000 and Wikkan 2002). 
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“Do you agree or disagree that most of [minority group] 
are complainers? That they try to make others feel sorry for 
them?” (reversed) 


“Do you agree or disagree that most of [minority group] are 
by nature inferlor to the Dutch people?” (reversed) 


Social Distance (Elght-ltem Additive Index: 
Two Items Repeated for Each of Four 
Groups) 


“To have [group name] as neighbor seems to me very at- 
tractive, somewhat attractive, somewhat unattractive, very 
unattractive.” 


“To have a [group name] as a life partner seems to me very 


attractive, somewhat attractive, somewhat unattractive, very 


unattractive.” 
| 


Economic Perceptions 


“In general, do you think that you and your family are better 
off, worse off, or about the same financially compared with 
two years ago?” 

“Looking ahead, do you think that two years from now you 
will be better off financially, worse off, or just about the same 
as now?” 


“Looking ahead, do you expect that the economy will get 
better, get worse, or stay about the same in the next two 
years?” 

“Now let’s talk about the country as a whole. Would you say 
that most families in The Netherlands are better off, worse 
off, or about the same financially compared with two years 
ago?” 


Identification with Dutch Identity (Four-lttem 
Additive Index) 


“Now I’m going to ask you about how you fell being Dutch. 
As I read each statement, please tell me if you agree strongly, 


agree somewhat, disagree somewhat, or disagree strongly 
with ıt? 
‘I often think of myself as Dutch.” 


“I consider myself a typical Dutchman.” 
“Tm proud that Pm Dutch.’ 


“If someone said something bad about Dutch people I feel 
almost as if they said something bad about me.” 


Self-Esteem (Five-tem Additive Index) 


“Tm going to make a few statements about people’s mentality 
in general and yourself, Please tell me whether your think they 
are true or false. 

‘When in a group of people, I usually do what others want, 
rather than make suggestions.” 


“I would have been more successful if people had given me 
a fair chance.”. 


“I certainly feel useless at times.” 
“Teachers often expect too much work from their students.” 


“I commonly wonder what hidden reason another person 
may have for doing something nice for me.” 
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Authoritarian Values (Three-Item Additive 
Index) 


“Whenever a private or public employer finds it necessary 
to reduce the number of employees, the first to be let go 
should be women who have a husband who is working. Do 


you agree strongly, agree somewhat, disagree somewhat, or 
disagree strongly with this statement?” 


“Here is the next statement: ‘Only the elderly, children and 
handicapped should receive public assistance.” 


“And how about: ‘It is better to live in an orderly society in 
which the laws are vigorously enforced than to give people 
too much freedom.” 
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Election Time: Normative Implications of Temporal Properties of the 


Electoral Process in the United States 
DENNIS F THOMPSON Harvard University 


lections take place at intervals, most citizens vote on the same day, and the electoral outcome is 
irrevocable until the next election. Each of these temporal properties—periodicity, sirmultaneity, 
and finality—is grounded in fundamental democratic values Analyzing the properties shows how 
and why several anomalies in electoral theory and practice in the United States should be eliminated. 
Together the properties mark off electoral politics as a sphere that requires different and often more 
stringent standards than the rest of political life outside of government. Periodicity requires that the 
control of redistricting be independent of legislatures. Simultaneity opposes publishing the results of 
exit polls, expanding the use of early voting, and granting legislatures the authority to select presidential 
electors on their own schedule. Finality justifies more stringent regulation of campaign practices, such as 


electioneering ads. 


n election marks a moment of politics—a dis- 

continuous phase in a continuous process. The 

electoral moment can be specified by three 
temporal properties: periodicity (the intervals at which 
citizens vote), simultaneity (the range of time in which 
citizens vote), and finality (the extent to which the re- 
sult of their votes is conclusive until the next election). 
The temporal properties are so familiar that they are 
usually taken for granted, but the way they structure the 
electoral process has significant theoretical and practi- 
cal implications that have not been sufficiently appre- 
ciated. 

The temporal properties are grounded in basic values 
shared by most conceptions of democracy. All three 
support popular sovereignty—the capacity of majori- 
ties to control government—in different but related 
ways. Because elections take place periodically, current 
majorities can overcome the dead hand of past majori- 
ties. To the extent that voting takes place simultane- 
ously, elections express the will of a determinate ma- 
jority rather than the preferences of a series of different 
majorities. Because elections produce final results, they 
legitimate the authority of a current majority until the 
next election. As will become clear, other democratic 
values, such as fairness and civic engagement, are also 
strengthened to the extent that the electoral process 
realizes these temporal properties. 

The temporal properties come in different combina- 
tions and take on different values in different systems. 
In the United States, citizens vote at regular intervals 
(more so than in parliamentary systems), mostly all on 
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the same day for any given election, and with conclusive 
results that are not normally reversible until the next 
election. This particular combination of temporal prop- 
erties creates the distinctive rhythm of election time in 
the United States. But because electoral practice in the 
United States does not consistently follow this rhythm, 
it produces anomalies that undermine the democratic 
values that the temporal properties support. By ex- 
amining three of those anomalies (each corresponding 
to one of the temporal properties), we can see more 
clearly the significance of election time and the impli- 
cations of taking its properties seriously. 

The first anomaly concerns the process for drawing 
electoral districts. Elections are supposed to choose 
representatives, yet representatives or their colleagues 
control the drawing of the electoral districts and 
thereby decide who the representatives will represent. 
The second anomaly refers to the way that voting is 
scheduled. Elections are held on one day so that citizens 
can vote more or less at the same time with access to 
the same information, yet TV networks call elections 
before the polls have closed, citizens in some states vote 
many days earlier, and state legislatures have the power 
to choose presidential electors after they see how other 
states have voted. A third anomaly involves the division 
between electoral and other periods of political activity. 
Electoral politics are more strictly regulated than other 
kinds, yet the demand for freedom from regulation in 
campaigns is no less great than in ordinary politics. 

These anomalies have not escaped critical notice, but 
the conventional criticisms do not recognize that the 
anomalies arise from the same source and, therefore, 
do not identify the fundamental error that produces 
them. What the anomalies have in common is that they 
all result from failing to appreciate the special character 
of election time. If we take the temporal properties 
seriously, we can see more clearly how the anomalies 
should be resolved. For each anomaly, we should accept 
its first element (which respects the temporal property) 
and reject its second element (which undermines the 
property). An analysis based on temporal properties, 
it will be argued here, provides a more cogent basis for 
eliminating the anomalies than do conventional app- 
roaches. 
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PERIODICITY: WHY REPRESENTATIVES 
SHOULD NOT CONTROL THE UNITS OF 
REPRESENTATION 


In most states in the United States, the legislature still 
has the dominant role in the contentious process of 
redrawing electoral districts to reflect shifts in popula- 
tion since the previous census. Even in most of those 
states that have redistricting commissions, political par- 
ties choose at least some of the members.! Typically, dis- 
tricts are drawn to protect the party in power and the 
seats of incumbents. The party fortunate enough to con- 
trol the state legislature thus controls the redistricting. 
Less directly but usually no less effectively, parties in 
each state govern the process that draws the Congres- 
sional districts. 

On its face, this practice seems objectionably un- 
democratic. It undermines popular control by giving 
the representatives who are to be controlled significant 
influence over the means by which they are controlled. 
The representatives or their party colleagues create the 
districts and, therefore, define the electorate. They de- 
termine the set of citizens who are to be represented 
and who are to judge the performance of the represen- 
tatives. The process appears to be not one “in which the 
people select their representatives,” but one in which 
“the representatives have selected the people.”* 

The problem could be avoided or at least mitigated 
by adopting a different electoral system. For exam- 
ple, district boundaries would be much less impor- 
tant in a system of multimember districts, which could 
be much larger than current districts. A multimem- 
ber system would also permit the use of cumulative 
voting. Because in this system each citizen may cast 
as many votes as there are seats to be filled, the dis- 
trict would not be as likely to be dominated by one 
party. More radical reforms would abolish districts al- 
together and elect representatives by statewide or na- 
tional forms of proportional representation. In many of 
these systems, voters in effect form their own districts, 
and each representative has a unanimous constituency. 
The various systems have comparative advantages and 
disadvantages. Yet none has been adopted on the 


1 For current information on the practices in the states, see the 
National Conference of State Legislatures http//www.ncsl.org/pro- 
grams/legman/elect/statevote2000.htm Also see the state govern- 
ment Web sites of the individual states. An earlier survey can be found 
in National Conference of State Legislatures 1999, Appendices E, F. 
2 The quotation comes from the opinion of Judge Edith H. Jones ın a 
district court decution, Vera v. Richards, 861 F. Supp. 1304, 1334 (S.D. 
Tex. 1994), which struck down racial districting in three Congres- 
sional districts in Texas. She wrote that “the Leguslature obligingly 
carved out districts of apparent supporters of mcumbents, as sug- 
gested by the incumbents, and then added appendages to connect 
therr residences to those districts. ..The final result seems not one 
in which the people select their representatives, but in which the 
representatives have selected the people.” Some scholars, such as 
Guinier and Torres (2002, 168-83), object not only to politiclans’ 
control over redistricting, but also to geographical districting itself 
as well as the winner-take-all rule. It is important to distinguish the 
objections to this. control from the objections to other aspects of 
the electoral process because cach requires different arguments. The 
argument from periodicity focuses on redistricting control 

3 For a helpful survey and further citations, see Issacharoff, Karian, 
and Pildes 2001a, 1089-1151. 
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national level in the United States, and in most jurisdic- 
tions in the United States the winner-take-all district 
systems are likely to continue to prevail. As long as 
districts exist in their current form, they will need to be 
redrawn periodically. 

The current practice of granting redistricting author- 
ity to legislators has frequently been criticized, but none 
of the usual arguments against it is entirely satisfactory. 
First, it'is claimed that redistricting should not be sub- 
ject to a political process at all because the standards 
of a fair electoral process are objective. They express 
a form of perfect procedural justice, rather than pure 
procedural justice. The districts should match the pop- 
ulation within certain limits, allowing for some vari- 
ation to respect existing political jurisdictions. As far 
as possible, districts should respect the constitutional 
standard of “one person, one vote.” But this standard 
is not very helpful in making the hard choices involved 
in redistricting. Strictly interpreted as requiring every 
vote to count equally, it would prohibit patterns that 
give any category of voter more weight than any other; 
only a random distribution would be acceptable. Even 
in random redistricting, votes would not be weighted 
equally for most categories because people do not 
distribute themselves randomly among districts. Inter- 
preted more permissively, as it has been in practice, 
the standard allows officials to take into account some 
of the factors that have traditionally determined the 
boundaries of districts—such as geographical features, 
historical communities, partisan advantage, and even 
incumbency protection. Armed with powerful com- 
puter technology scarcely imagined at the time the stan- 
dard was proposed, politicians can now create equal 
districts that give more weight to any of a wide range 
of different categories, 

Redistricting therefore involves making choices 
among a complex set of values. There is no objective 
or neutral standard that can determine these choices. 
Choosing is an unavoidably political process. The most 
recent criteria to be adopted by a state [Arizona con- 
stitution, art. IV, pt. 2, sec. 1(14)] are typical of those 
found in many other states: equal population, com- 
pactness, contiguity, community of interest, geograph- 
ical saliency. Plainly, the criteria can yield conflicting 
results, and choosing among them requires deciding 
which should weigh more or less heavily. Interpreting 
and applying the criteria, as well as making the other 
choices between the various goals we have canvassed, 
calls for political judgment (Butler and Cain 1992). As 
long as minority rights are not violated, the political 
process—specifically the legislature—would seem to be 
the appropriate place to decide these inherently polit- 
ical questions. The argument from objectivity or neu- 
trality thus does not support the case against legislative 
control of redistricting. 

A second argument appeals to the importance of 
competition. Letting legislatures control redistricting, 
it is said, is likely to reduce the competition necessary 
for a healthy democratic system.‘ Competition requires 





4 The most recent advocate of this view among constitutional schol- 
ars 18 Issacharoff (2002a, 2002b). Although many of his cnticisms of 
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that voters have real alternatives. Partisan balance may 
shift from election to election, and in some states and 
the nation as a whole the division between the parties 
may be close. The 2002 midterm Congressional election 
was in that respect highly competitive. Even in safe 
districts competition in primaries can be keen. But the 
choice that most voters face is in the race in the general 
election in their own districts. That is the competition 
that matters most for their voting decision. 

In most states citizens can vote only for the can- 
didates in their own party primary. In most districts, 
incumbents almost always win, and by large margins. 
One of the most striking patterns in contemporary leg- 
islative politics in the United States is the high rate of 
reelection of incumbents not only in Congress but also 
in state legislatures (Ansolabehere and Snyder 2002a; 
Center for Voting and Democracy 2002; Cox and Katz 
1996; Ferejohn 1977; Gelman and King 1990). The high 
rate of reelection may indicate that most voters are sat- 
isfied with their incumbents, or that incumbents have 
an unfair advantage from the resources of office, or 
simply that incumbents and strong challengers do not 
run when they expect to lose.° Even if legislative control 
of redistricting is not the principal cause of the lack of 
competition, continuing to give the most power over 
the redistricting process to incumbents or to the cur- 
rently dominant party would not seem to be the most 
effective way to encourage more of it. 

Although more competition may be desirable in 
most district elections in the United States, the case 
against legislative control over redistricting cannot rest 
mainly on this goal. In the first place, electoral competi- 
tion is not an end itself, but a means of allowing change 
that voters may want. It may also be a necessary means 
of discovering whether voters want change, or helping 
them recognize that they want it. But even as an instru- 
mental goal, electoral competition can be misleading if 
it is conceived, as it usually is, on the model of economic 
competition.® It neglects forms of contestation that do 
not readily fit within the categories assumed by mar- 
ket analogies. It gives more attention to the interplay 
of interests and preferences than to the interaction of 
principles and ideas. Furthermore, the preoccupation 
with making electoral contests more competitive dis- 
tracts attention from other equally worthy goals, such 


partisan control of redistricting are consistent with the argument pre- 
sented here, his theoretical rationale tends to grve “competition as an 
independent value” an excessively privileged position in democratic 
theory. More generally, see the critique by Persily (2002). 

5 Cox and Katz (2002) show that strategic decisions to enter or exit 
a race account for much of what is usually attributed to mcum- 
bency advantage. Incumbents and strong challengers are “gettmg 
better at avoiding one another,” m part as a result of the predictable 
regularity of redistricting, rather than party control (p. 198). But 
Ansolabehere and Snyder (2002b) argue that “strategic retirement 
and entry .. cannot explam either the magnitude of the incumbency 
advantage m state elections today or the growth of the mcumbency 
advantage over the past years.” 

© One of the most sophisticated applications of the market analogy 
to electoral politics is Issacharoff and Pildes 1998. In other writings, 
one of the authors recognizes the significance of other forms of com- 
petition. See, e.g., Pildes 2001 For further criticisms, see Thompson 
2002, 7-8, 96, 158-59, 175-77, 185. 


as recruiting and retaining better qualified candidates 
and controlling the growth of the cost of campaigning. 
More generally, concentrating too much on promot- 
ing competition may distract citizens and legislators 
from attending to the business of government. The goal 
should be to find the optimal level of competition, not 
to maximize it. 

In the second place, although there is still some dis- 
pute about the sources of the high rates of incumbent 
reelection, there is little evidence that partisan control 
of redistricting is the main cause (Persily 2002). States 
that have redistricted do not consistently have higher 
rates of reelection than states that have not done so. 
Incumbents also tend to be reelected in elections that 
have no districts (such as gubernatorial and senatorial 
races). Furthermore, politicians usually produce plans 
with districts that are more competitive than those that 
existed prior to redistricting. Redistricting tends to in- 
crease responsiveness (the degree to which the partisan 
composition of the legislature responds to changes in 
voter partisan preferences) and reduce partisan bias 
(the degree to which an electoral system favors one 
political party in the conversion of the total vote into 
the partisan division in the legislature).’ Two factors 
seem to account for this tendency: Parties have multiple 
and conflicting goals (protecting the party may require 
sacrificing incumbents), and the electoral effects of any 
redistricting plan are more uncertain than is often as- 
sumed. 

The third argument against legislative control of re- 
districting comes closer to capturing its fundamental 
problem. Giving politicians or their partisan colleagues 
control of their own means of reelection, it is claimed, 
undermines accountability, one of the main purposes of 
elections.® Politicians are accountable if they must face 
an electorate that can decide on the basis of their past 
performance whether or not to remove them from of- 
fice. When politicians can adjust the boundaries of their 
district and manipulate other electoral rules to preserve 
or increase their relative advantage, they weaken the 
ability of voters to decide their fate. Some of these ef- 
fects continue throughout their term until the next re- 
districting. When representatives believe that they are 
less vulnerable to defeat, they may while in office pay 
less attention to all constituents, and even less attention 
to constituents who are not in the majority that elected 
them. Also as a result of redistricting, voters who wish 
to decide on the basis of the past record more than 





7 The leading analysıs is Gelman and King 1994; but see Cox and Katz 
(1999), who argue that partisan control may affect responsiveness and 
bias. 


8 Competition 1s an important means of maintaming accountability, 
since it gives voters a positive alternative to the reelection of an in- 
cumbent, but it 1s a distinct concept. A race between two candidates 
who have never served in office and therefore are not yet accountable 
can be competitive. A race can be noncompetitive but still an instance 
of accountability, as when an incumbent decides not to run because he 
anticipates defeat. Competition can also work against accountability: 
If competition results m very high rates of turnover ın legislatures, 
voters may be less able to find legislators to hold accountable for leg- 
islation. For other interpretations of accountability, see Przeworski, 
Stokes, and Manin 1999. A useful critique of the conventional view 
of accountability is Philp 2002. 
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future performance may find that the representative 
whose record they would judge is running in another 
district. Further, they may find themselves placed in a 
district where they are no longer in the majority and are 
now unable to discipline the party whose record they 
wish to judge. 

Although control over redistricting does give politi- 
cians a tool to avoid being called to account, the argu- 
ment against control cannot rest entirely on the value of 
accountability. First, even representatives in safe seats 
generally act as if their reelection is in doubt and there- 
fore tend to be responsive to their constituents (Mann 
1977). Second, some voters vote prospectively, rather 
than retrospectively (Fearon 1999; Powell 2000). The 
fact that your district has been altered, that you find 
yourself in a different district, does not matter so much 
if you are choosing on the basis of a prediction about 
how candidates may act in the future. 

Third, the argument from accountability is incom- 
plete because it does not recognize the significance of 
changes in the identity of the legislators to be held 
accountable, and the voters who are to hold them ac- 
countable. With respect to the identity of legislators, 
it does not cover many cases where politicians do not 
run for reelection. Some cannot be held accountable 
even if the districts remain the same. With respect to 
the identity of the voters, the argument does not take 
into consideration the shifts in the composition of the 
district. The majority who elected the representatives 
at the last election cannot be the same majority that 
is calling them to account at this election. New voters 
arrive, some voters move, and others die. The greater 
the interval, the more the change in the identity of 
the electorate, but some significant change is likely to 
occur between most elections. Consequently, an argu- 
ment based on accountability that assumes a link over 
time between specific representatives and a determi- 
nate set of constituents cannot constitute a complete 
case against letting politicians control the process. 

The more fundamental objection is that legislative 
control of redistricting does not adequately respect 
periodicity.” Elections are not one-time events. An elec- 
tion is not like the ratification of a constitution or its 
amendments, a popular initiative or referendum, or 
even a typical plebiscite. Each election, though a dis- 
crete event, stands in an indefinite series. Because one 
purpose of periodicity is to enable voters to hold rep- 
resentatives accountable for their actions, the account- 
ability objection is to that extent correct. But that objec- 
tion depends on assuming that there is more continuity 
in the electorate and the elected from one election to 
the next than there usually is. Even if there were no con- 
tinuity, periodicity would be important. It provides the 
means by which present majorities can escape the dead 
hand of past majorities. Elections must be held periodi- 
cally so that, when new majorities replace old majorities 
in the electorate, they can then decide to what extent to 
replace an old'with a new legislature. Periodicity keeps 


open the possibility of change—specifically, change in 


9 Manin (1997) provides a helpful discussion of a concept (“repeated 
character”) that is similar to the idea of periodicity. 
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the membership of the legislature and thereby in the 
nature of the people’s representation. In this respect, 
periodicity is an expression of the democratic value of 
popular sovereignty. 1? 

To complete the objection to legislative control of 
redistricting, periodicity must be combined with a prin- 
ciple of authority, which specifies who should decide 
the rules that determine whether and to what extent 
change occurs in the legislature. The principle holds 
that decisions about such rules should not be made 
by people who have a preponderant interest against 
(or for) change in the membership of the institution 
in question (Thompson 2002, 133-34, 166-68, 179-80). 
The point is not that change is always desirable. The 
aim is to try to ensure that choices between change and 
continuity—in this case, decisions about redistricting— 
should be made on their merits, giving due weight to the 
many relevant factors that affect the democratic pro- 
cess. The assumption is that these decisions are more 
likely to be made on the merits if legislators and their 
parties do not make them. When an old legislature 
elected by the old majority sets the rules that define the 
new majorities that will elect the new legislature, this 
possibility of renewal and change in membership is less 
likely. Legislators, like members of most institutions, 
generally approve of the mechanisms by which they 
were selected. 

The principle extends an idea originally expressed 
by James Madison during the debates on the Constitu- 
tion. He sharply distinguished ordinary legislation from 
electoral regulation. We can trust the normal process 
of representation, he said, provided that the issue un- 
der consideration is one in which representatives share 
a common interest with their constituents, not one in 
which they “have a personal interest distinct from that 
of their constituents” (Madison [1787] 1966). But when 
representatives decide questions that affect their own 
status or that of their party, they will tend to give 
more weight to preserving their privileges and more 
generally perpetuating the practices of the institution. 
They will give less weight to promoting other values 
of the democratic process. On such issues, we should 
be “jealous” of assigning the representative body final 
authority. 

We do not have to assume that the only problem is 
the “personal interest” of individual legislators in re- 
election, or even that this interest is always opposed to 
the interest of their constituents or the public interest 
on questions concerning the electoral process. Legis- 
lators may conscientiously (sometimes even correctly) 
believe that the best interest of the institution and the 
democratic process lies in preserving the current mem- 
bership or at least the current rules for choosing mem- 
bers. But with regard to the rules that determine who is 
to be elected, citizens have good reason to doubt that 








10 More specifically, the value may be interpreted as an expresmon of 
the princaple of majority rule (suitably constrained by the values of 
liberty and equality). Majority rule implies, mer alia, that a current 
majonty should have the power to override past majonties. For a 
defense of the moral basis of majority rule, see Waldron 1999. For 
other analyses of the justification of majority rule, see Bertz 1989, 
Dahl 1989, and Gutmann and Thompson 1996. 
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legislators and their parties are impartial judges. If leg- 
islators do not pursue their individual or partisan self- 
preservation, they are likely to engage in institutional 
self-perpetuation. Whatever differences the parties in 
the legislature may have, they share a common interest 
in minimizing challenges from outside. They are likely 
to join in opposing term limits and campaign finance 
regulation, which reduce the influence of incumbency; 
and they are likely to work together to discourage fu- 
sion candidacies, which help third-party challengers.” 
Redistricting and other electoral measures—and the 
choice between change and continuity—are more likely 
to receive consideration on their merits if those who 
are deciding whether to adopt them are not the same 
as those whose electoral future they would deter- 
mine. 

The Madisonian principle refers to the circumstances 
of judgment rather than any particular conclusion that 
politicians might reach. It is concerned with the rea- 
sons that redistricting officials are likely to consider 
(and the institutional incentives that favor some rea- 
sons over others), rather than the results officials reach 
(and the institutional legitimacy of any particular re- 
sult). The principle therefore does not necessarily re- 
quire more (or less) competition in elections or more 
(or less) turnover in legislatures. Nor is it satisfied by 
showing that the competition in the system as a whole 
is greater than in most districts or that representatives 
in safe districts are responsive and accountable to their 
constituents. 

Moreover, even if the high rate of reelection of in- 
cumbents is the result of strategic entry or exit decisions 
rather than any advantages incumbents may enjoy, the 
principle would still cast doubt on giving the dominant 
party control over redistricting. That party would still 
be deciding how to redistrict in circumstances in which 
it would be less likely to give due weight to all the rele- 
vant values at stake. For example, a party attempting to 
maintain control of the legislature could decide it had 
to pit two experienced and popular incumbents against 
each other, even if the decision caused one to withdraw 
from the race. Or a party seeking to create safe districts 
may need to produce more ideologically homogeneous 
districts, which could have the effect of increasing par- 
tisan polarization. A party could prefer a Democrat 
who is more liberal or a Republican who is more conser- 
vative than might be optimal for the political health of 
the process. A districting decision that has the effect of 
favoring less moderate and less centrist candidates may 
be justifiable in some circumstances. But as with other 
decisions that have systematic effects on the democratic 
process, the Madisonian principle implies that the par- 
ties are not the best judges of whether the results of such 
a choice serve constituents, let alone the public interest. 

The value of periodicity combined with the 
Madisonian principle implies that the authority for gov- 
erning elections in general and redistricting in partic- 
ular should be located outside the ordinary legislative 
11 The leadmg court case on fusion is Timmons v. Twin Cities Area 


New Party, 520 U.S. 351 (1997), which upheld the Minnesota legisla- 
ture’s ban on fusion candidacies. 


process. The need for independent judgment may sug- 
gest that courts should have this authority. Certainly, 
courts have an important role to play in regulating the 
electoral process. They are well suited to protecting in- 
dividual rights, notably the constitutional requirements 
of equal protection and free speech, and are appropri- 
ately involved in redistricting cases that involve racial 
justice. But questions about change in the composi- 
tion of majorities—to what extent districts should per- 
mit or encourage new majorities to form—raise issues 
that are less about individual voters than about insti- 
tutional structures and the nature of representation 
itself, The conflicting demands of democratic majori- 
ties neither state legal claims that individuals usually 
have standing to bring nor permit remedies that courts 
usually have competence to apply. They involve the 
form of government—in constitutional terms, the guar- 
anty clause—which courts are not well placed and are 
usually reluctant to adjudicate. Furthermore, courts 
are expected to respect precedents, and therefore are 
bound, more than other institutions, to the will of past 
majorities and to the decisions of supermajorities in 
constitutional ratification. 

A more promising way to exercise control over redis- 
tricting (and potentially other aspects of the electoral 
process as well) is through an independent commission. 
Australia (since 1902) and Canada (since 1964) have 
relied with evident success almost completely on non- 
partisan commissions for this purpose.” In the United 
States, six states now authorize commissions to redraw 
their own legislative districts, and eight more states use 
commissions to revise congressional as well as legis- 
lative districts. The most recent state to join this 


12 While active in assessing racial redistrictng, courts have gener- 
ally avoided ruling on partisan gerrymandering. Yet partisan gerry- 
mandering can make it more difficult to accomplish constitutionally 
acceptable racial gerrymandering. When legislators are trying to pro- 
tect incumbents, they do not have as many possibilities for creating 
majority-minonty districts and are more likely to draw unusual and 
irregular boundaries. The “bizarre” shape of the notorious majority- 
black District 12 in North Carolina, cited by the Supreme Court m 
finding the redistricting unconstitutional, could have been avoided if 
the state had not given such a high priority to protecting incumbents. 
See Justice White’s dissent in Shaw v. Reno, 509 US. 630, 674 n. 
10 (1993). For an argument that racial and partisan gerrymandenng 
should be treated similarly in some respects, see Frient 1998. For an 
argument that partisan does not violate any equality 
principle (at least none that should be adjudicated by the courts), see 
Schuck 1987. Also see Guinier and Torres 2002, 176-83, 194-202. 

13 The Court has rarely invoked this clause (art. IV, sec. 4), gener- 
ally treating claims that might arise under it as “political questions,” 
not appropriate for judicial resolution. The classic case rs Luther v. 
Borden, 48 U.S. (7 Howard) 1 (1849). For an argument that more 
cases involving political rights should be decided under the guaranty 
clause, see McConnell 2000. 

14 See Australian Electoral Commission Information Center 2003 
and Courtney 2001. The Boundary Commission for England is 
chaired by the Speaker of the House of Commons but it has func- 
tioned in a relatively nonpartisan way. See McLean and Butler 1996 
(which also mcludes some discussion of the Australian and New 
‘Zealand systems) and Rallings and Thrasher 1994. 

15 The states in the first group are Alaska, Arkansas, Colorado, 
Missoun, Ohio, and Pennsylvania; in the second group are Anzona, 
Hawaii, Idaho, Maine, Montana, New Jersey, Rhode Island, and 
Washington. Vermont’s reapportionment board drafts the initial leg- 
talatrve redistrictmg plan, but the legislature has final authority. Texas 
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movement is' Arizona, whose voters in the 2000 elec- 
tion approved an initiative “creating an independent 
commission of balanced appointments to oversee the 
mapping of fair and competitive congressional and 
legislative districts.”!6 

A well-composed commission is better placed to give 
due weight to.all the relevant factors that should be con- 
sidered in drawing districts—equal population, com- 
pactness, contiguity, community of interest, geograph- 
ical saliency, ‘and even partisan politics. Members of 
commissions çan view their role not as registering pref- 
erences of constituents, or even acting on their own 
conscience and convictions, but as working together 
to identify and express the will of the appropriate 
majority. 

Most states acknowledge the unavoidably political 
nature of redistricting decisions by permitting former 
politicians to serve on their commissions (though most 
bar current and recent office holders). Commissions are 
properly political creatures, but they can cultivate a po- 
litics different from, or at least more wide-ranging than, 
the ordinary kind. They can be designed so that they do 
not serve entrenched majorities of any time or place. 

But politics probably intrudes too much into the ap- 
pointment process of most commissions now. Legisla- 
tive leaders typically appoint the majority of the com- 
mission members, sometimes from a restricted pool of 
candidates (for example, nominated by a judicial com- 
mittee). If the selection process were more indepen- 
dent of the legislature, commissions could better take 
into account all the political factors that legislatures 
consider but give greater priority to providing oppor- 
tunities for changes in membership than to preserving 
the privileges of current members. 

Commissions are less democratic than legislatures 
in the sense that they are less directly accountable. 
But the democratic authority of a commission could 
be maintained in various ways: by ensuring that voters 
have an opportunity to approve its constitutional role 
(its scope of authority and general procedures) and the 
nature of its membership (the qualifications of mem- 
bers and their method of selection). The loss of some 
direct democratic control in the present is surely worth 
the gain of greater democratic control in the future. 
Commissions have that negative virtue so critical to the 
democratic process: They are not self-perpetuating. In 
this way, they help fulfill a central democratic purpose 
of electoral periodicity. 


SIMULTANEITY: WHY CITIZENS SHOULD 
VOTE AT ABOUT THE SAME TIME 


On election night 2000, before all the polls had closed 
in Florida, the networks called the state for Al Gore. 





has a backup commission, which acts 1f the leguslature fails to adopt 
a plan. This information 1s based on the state government Web sites 
of the mdrvidual states (as of August 2002) and National Conference 
of State Legislatures 1999, Appendices E, F. On commissions in the 
United States more generally, see Kubin 1997 and Thompson 2002, 
168-79. 

16 State of Arizona, Proposition 106: <http:wwwsosaz.com/election/ 
2000/info/pubpamphlet/english/prop106.htm> (June 19, 2003). 
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(By then, Florida’s electoral votes were seen as crit- 
ical to victory.) Some four hours later, the networks 
changed their minds, and declared George Bush the 
winner. About two hours later, they retracted that call, 
but only after Gore had phoned Bush to concede. The 
reports may have caused some people not to vote.!” 
Whatever the effects on turnout, the projections contin- 
ued to shape the opinion and events in the month-long 
controversy that followed. In the House hearings de- 
voted to election night coverage, Democratic represen- 
tative Henry Waxman argued that Gore’s “concession” 
prompted by the networks’ projection of a Bush victory 
“set in motion a chain of events that were devastat- 
ing to Al Gore’s chances. And it immeasurably helped 
George Bush maintain the idea in people’s minds that 
he was the man who won the election” (Political Staff 
2001; U.S. House Committee 2001). Republicans also 
complained that the earlier call for Gore caused some 
of their supporters in the western states to stay home 
(Correspondents 2001; Political Staff 2001). 

Inthe aftermath of this election, much of the criticism 
centered on the errors the networks made rather than 
the effects their projections had on turnout and legiti- 
macy. In the House hearings, several network witnesses 
conceded that they had made errors and needed to 
change their methods but said they doubted that projec- 
tions had any harmful effects (U.S. House Committee 
2001). The president of the Associated Press heatedly 
declared that the mere act of holding these hearings 
was itself a threat to freedom of the press. Some of the 
networks seemed inclined to make modest reforms, but 
none wanted Congress to legislate in this area. An out- 
side group recommended that the networks cease using 
exit polls to call elections, stop relying on one source to 
collect and collate data, and undertake organizational 
changes to ensure that accuracy in reporting takes pri- 
ority over speed (Konner et al. 2001). Clearly these 
and similar reforms, whether legislated or not, would 
be desirable. If the media are to make projections and 
report polls, it is important that they do so accurately. 

But the preoccupation with accuracy neglects the 
more general problem—the potential damage to the 
democratic process that results from reporting projec- 
tions even when they are accurate. It is not simply 
inaccurate projections that we may wish to limit, but 
any projection that gives some people but not others 
information relevant to deciding how or whether to 
vote while the election is in progress. (In this respect, 
the greater the perceived accuracy, the greater the po- 
tential harm.) We can begin to develop the justification 
for limiting such projections by examining two common 
arguments against doing so. 





17 Jackson (1983) presents one of the few analyses based on informa- 
tion about mdividuals (using the American National Election Study, 
University of Michigan). The studies that do not find this effect on 
turnout mostly use aggregate data, which ignore the large regional 
differences in the campaigns, relative salience of issues, and variations 
in the weather. Jackson acknowledges that early reporting of projec- 
tions may affect turnout only in elections m which the projections 
differ from prior expectations, as when people expect a close race but 
the projections indicate a clear victory (632). See also the analysis by 
Hansen (20022). 
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The first appeals to freedom of the press. Although 
merely holding hearings on proposals to regulate pro- 
jections does not of course violate freedom of the press, 
prohibiting the press from publicizing projections could 
do so. An outright ban on reporting exit polls would 
rightly be considered a violation of the First Amend- 
ment. Yet freedom of the press is not absolute in the 
electoral sphere. The courts have permitted greater reg- 
ulation of the media in election-related speech than in 
many other areas.'® Electoral politics, as will be shown 
more directly in the next section, should be regarded 
as a special sphere, potentially subject to greater regu- 
lation than ordinary politics. 

But even without directly regulating the media, 
the government could discourage electoral projections. 
There is no First Amendment objection to mandating 
uniform poll closings or embargoing the release of offi- 
cial returns (Ortiz 2002). Uniform closings would make 
the projections less significant. Embargoes on returns 
would make exit polls less reliable or more expensive.) 
Furthermore, the networks could voluntarily agree to 
forgo making projections until all the polls are closed 
in all states. Finally, and most importantly, the main 
purpose of freedom of the press is to protect not the 
press but citizens and the democratic process. 

If we can overcome the argument from press free- 
dom, we still face the argument from voter freedom. 
Restricting election night projections, it is said, would 
deprive voters of the chance to vote on the basis of 
information some think they need to make a decision, 
and therefore would impair their free choice. A “true 
freedom to vote as one chooses” includes “the right to 
not vote at all or even to vote as everyone else does” 
(Barlow 1990). The assumption evidently is that free 
choice requires providing the option of deciding not 
to vote because you believe other voters have already 
determined the outcome, and the option of basing your 
vote on how others are voting. “Neither the networks, 
nor the lawmakers should be permitted to decide which 
information is useful to voters and which is not. .. . The 
choice must belong to the individual” (Barlow 1990). 

This argument misconceives the nature of free 
choice. In a process in which projections are publi- 
cized, citizens may choose what information they wish 
to use, but they are denied some other kinds of choices. 
They cannot choose how information about their own 
choices and those of others is used. They vote in secret 
but they do not have the option of keeping the results 
of their votes secret until all of their fellow citizens 
have voted. They would have that option in a different 


18 Ses, ag, Red Lion Broadcasting Co v. FCC, 395 U.S. 367 (1969); 
Turner Broad. Sys. v. FCC, 520 U.S. 180, 227 (1997) (Breyer, J, 
concurnng in part) (ating Red Lion as supporting policy that 
“seeks to facilitate the public discussion and informed deliberation, 
which. . democratic government presupposes and the First Amend- 
ment seeks to achieve”); and Denver Area Educ. Telecomm. Consor- 
tum, Inc. v. FCC, 518 US. 727, 741 (1996). Generally, see Schauer 
and Pildes 1999. 

19 In a close election the typical poll’s margin of error 1s likely to be 
greater than the vote difference between the candidates, and a poll- 
ster needs the precinct and county returns in order to call the election 
with any confidence. Without these returns, a pollster must increase 
the sample size and, thereby, the cost of the survey (Hansen 2002a). 


system, but the choice of the system does not “belong 
to the individual.” Free choice is not simply a matter of 
individual decision, but also of institutional structure, 
which only citizens acting together can choose. 

Although restrictions on publicizing projections 
would limit free choice in some respects, they could 
enhance it in others. Citizens may reasonably decide 
that the individual choices they make in an election are 
more valuable if no one has information about how they 
voted until the polls have closed. When citizens decide 
to limit the range of choice for the sake of improving 
the value of choice, they are choosing between different 
aspects of liberty, not between liberty and other values. 
Their decision may of course also take into account 
other values, such as the quality of the democratic pro- 
cess, but it is not correctly characterized as favoring 
collective over individual choice. Other countries that 
respect individual choice regulate not only publication 
of election results but also public opinion surveys in the 
period immediately before the election (Feasby 1997, 
242). 

The argument so far shows only that there is no 
presumption in favor of publicizing projections—that 
freedom of the press and voters would not be violated 
by restricting projections—but it does not provide any 
positive reason for such restrictions. Why should citi- 
zens choose a system that reports the results all at once? 
We can formulate a positive justification for restricting 
projections if we take notice of the second temporal 
property of elections—simultaneity. 

In Congress and many other legislative bodies, mem- 
bers vote one at a time (as in a roll call vote) and then 
at the conclusion have the chance to change their votes 
after they see the provisional totals. How they vote 
is affected by how others vote. In an ordinary elec- 
tion, each voter acts as if he or she is voting at the 
same time as every other voter. No voter knows how 
others have voted until everyone has voted. No one can 
change a vote in response to how others have voted. 
And everyone votes with at least potential access to 
the same information. The constitutional requirement 
that all states hold elections on the same day (art. II, 
sec. 1) may be read as an expression of the idea that 
as far as possible the electoral experience should be 
the same for everyone. No more than 24 hours elapse 
from the casting of the first vote (usually just after mid- 
night Eastern Standard Time in Hart’s Location and 
Dixville Notch, New Hampshire) to the closing of the 
polls at 6 p.m. Hawaiian-Aleutian Time. Election-day 
customs similarly reinforce the idea. Candidates stop 
campaigning when the voting starts, and supporters are 
not permitted to demonstrate near the polling places. 
While voting continues, politics ceases. 

The simultaneous character of an election is not an 
arbitrary or merely conventional procedural require- 
ment. It has a normative rationale.We should distin- 
guish two aspects of simultaneity, because each rests 
on somewhat different values. The first is voting at the 
same time; the second, voting without regard to how 
others vote. 

The practice of requiring citizens to cast their bal- 
lots at more or less the same time, usually on the same 
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day, rests partly on the value of popular sovereignty. 
The more that voting is concentrated in time, the more 
that the election expresses the will of a determinate 
majority (or plurality). The outcome can then be seen 
as the result:of a single collective decision rather than 
the product of a series of decisions made by different 
majorities (or pluralities). Simultaneous voting in this 
way creates a more coherent popular sovereign. 

Simultaneity rests even more importantly on the 
democratic value of fairness. If citizens vote at the same 
time (or have only information they would have had if 
they were voting at the same time), the value of each 
citizen’s choice is no greater than that of any other citi- 
zen. All make their choices with equal access to relevant 
information. Election projections deny some voters in- 
formation that other voters have. Also, to the extent 
that election projections discourage efforts by parties 
and candidates to mobilize voters in the western states, 
some citizens who might have voted do not make a 
choice at all. To the extent that information is unevenly 
distributed, an election is less fair. 

The unfairness can be seen most clearly by consider- 
ing cases in which a violation of simultaneity is more ex- 
treme. In the aftermath of the 2000 presidential election 
some people proposed to resolve the controversy by 
rerunning the election in several counties in Florida. 
Even if a rerun had been practicable, it would have 
clearly violated the simultaneity norm. The Palm Beach 
county voters would have been voting with the specific 
knowledge that the election was very close. Many 
would have probably voted differently (shifting their 
votes from Nader to Gore, for example). 

Such a violation of simultaneity should be distin- 
guished from a different kind of strategic voting: the de- 
cision in a single election to vote for your second-choice 
candidate (e.g., Gore) instead of your first-choice can- 
didate (e.g., Nader) in order to decrease the chances 
of your third-choice candidate (e.g., Bush). This kind 
of strategic yoting is neither a violation nor a con- 
sequence of simultaneity. It is consistent with simul- 
taneity because it takes place in a single election. It 
is a consequence of the winner-take-all rule that op- 
erates in most current systems in the United States, 
It would not be necessary, for example, in a system 
that uses forms of proportional representation or even 
one that employs the instant runoff method. The latter 
method provides a way of combining what would other- 
wise be sequential elections into a single simultaneous 
election,” 

Fairness is not the only value at stake in simultaneous 
voting. When citizens go to the polls on the same day, 
publicly participating in a common experience of civic 
engagement, they demonstrate their willingness to con- 
tribute to the democratic process on equal terms. Going 
to the polling station and standing in line with one’s 





2 Occasionally, new elections have been ordered, but only m spe- 
cific districts or localities, never in whole states or m a presidential 
election See Issgcharoff, Karian, and Pildes 2001b, 108-16. 

21 For an application of the mstant runoff method, see Vermont Com- 
mission 1999, For a description of the system in other countries, as 
well as a survey of alternative methods, see Farrell 2001. 
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neighbors may not rank among the most exciting 
moments in life, but election-day voting serves an im- 
portant expressive purpose in a democracy. It enables 
citizens to express their commitment to a common 
project in which they participate on equal terms— 
the process of choosing the leaders who will represent 
them. When the media publicize projections, they un- 
dermine this sense of participating in an important civic 
activity. Voters in the western states lose the chance to 
take part with their fellow citizens in an event that is still 
in progress rather than one that is already on its way to 
the history books. The same media practice that causes 
the electoral process to treat some citizens unfairly also 
causes it to convey to all citizens the message that civic 
engagement is not all that important. 

The expressive value of simultaneity has further 
practical implications. It strengthens the case for mak- 
ingelection day a national holiday.” Because most peo- 
ple who are registered already go to the polls on elec- 
tion day, making election day a national holiday is not 
likely to increase turnout. It could even reduce turnout 
if people decide to travel instead of vote. However, the 
reform would enhance civic engagement in other ways. 
Holding the election on a national holiday not only 
strengthens the sense of participation in a shared civic 
activity, but also more concretely increases the number 
of election workers and polling locations. The improved 
facilities promote civic engagement on equal terms by 
making voting more accessible. 

The norm of simultaneity also casts doubt on the 
increasingly widespread use of early voting—absentee 
ballots and voting by mail. Voting before election 
day has nearly doubled since 1980.7 In Oregon all 
statewide elections have used a mail-in ballot since 
1995. In the 2000 presidential election, a quarter of the 
California vote and two-fifths of the Texas vote were by 
absentee ballot. Early voting weakens the value of the 
experience of participating in a civic activity. The public 
affirmation of voting together openly on the same day 
is quite different in meaning from the private trans- 
action of filling out and mailing in an absentee ballot. 
In addition to this expressive value, such activities can 
contribute in modest ways to the development of the ca- 
pacities of equal citizenship. Participating openly with 
others is more likely to reinforce civic attitudes than is 
voting alone at home. Voting alone may be worse than 
bowling alone. 

To be sure, the value of this kind of civic engage- 
ment is not so important that it should take precedence 
over fairness. Absentee voting is certainly justified for 
citizens who would otherwise have difficulty in going to 
the polls on election day—especially some older voters, 
disabled persons, and members of the armed forces. 
Absentee or other forms of early voting could also be 
justified if they substantially increased turnout espe- 
cially among groups that are now underrepresented 





Z The National Commision recommended that in even-numbered 
years the Veterans Day holiday be held on the Tuesday after the 
first Monday m November and serve also as election day (National 
Commission on Federal Election Reform 2002). ` 

B For these data and further analysis, see Hansen 2002b. 
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at the polls. But so far absentee voting has increased 
turnout only modestly if at all. Unrestricted absen- 
tee voting and the further extension of other kinds of 
early voting may even reduce voter participation by 
discouraging efforts to mobilize voter turnout. Further- 
more, the effects of any increase in turnout have not 
been equally distributed. Whites are twice as likely as 
blacks to vote by absentee ballot. Thus, while measures 
that erode simultaneity may be warranted if they make 
voting more fair, they are harder to justify if they are 
implemented only for the purpose of making it more 
convenient.” 

The second aspect of simultaneity is independence 
of voting—a norm that holds that citizens should not 
adjust their votes according to how others have voted. 
This partly rests on fairness. An election that enables 
citizens to adjust their votes in this way privileges strate- 
gic voting. Strategy of course has a place in the cam- 
paign, but the value of a vote should not depend on the 
strategic savvy of the voter. 

The norm against strategic voting does not rest en- 
tirely on the value of fairness, however. We can imag- 
ine an internet voting system that enabled each voter 
to vote at more or less the same time (with access to 
the same information and with the opportunity for the 
same civic engagement), but with the option of chang- 
ing one’s vote after seeing the provisional totals. There 
would of course be significant practical difficulties with 
any such system. A limit on the number of times voters 
could change their votes would need to be established, 
and a deadline by which all voting would cease would 
have to be set. Protections against the use of “sniper” 
software would be required so that users could not 
exploit the advantage of switching at the last possi- 
ble moment, as occurs in some online auctions. But 
assume that we could adequately protect against the 
many forms of potential fraud and manipulation, and 
also ensure that the technology is highly reliable and 
the devices readily accessible to all citizens. Is there any 
objection in principle against this kind of interdepen- 
dent voting? What is wrong with adjusting your vote in 
light of how others are voting? It might seem that de- 
liberative democrats especially ought to welcome this 
kind of interaction. 

Although voters may certainly take into account the 
views of their fellow citizens, the kind of vote switch- 
ing that would occur in a typical online system would 
be more likely to reflect conformist attitudes than de- 
liberative interactions. This vote switching would un- 
dermine not only the norm that voters should make 
up their minds on the basis of what they hear in the 
campaign, but also the norm that voters should exer- 
cise independent judgment. Of course, many voters do 
not exercise much independent judgment in the cur- 
rent system, but online nonsimultaneous voting would 
create additional and specific incentives for voters to 


4 See Hansen 2002b, Oliver 1996, and Stein 1998 

3 Significantly, all three of the major studies prompted by the 2000 
election recommended against increasing absentee voting Caltech/ 
MIT Voting Technology Project 2001, Constitution Project 2001, and 
National Commussion on Federal Election Reform 2002. 
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decide—suddenly and often irreversibly—to follow the 
crowd. It would invite politicians and their campaign 
managers to encourage just that kind of decision mak- 
ing. Whether or not voters or politicians generally acted 
in these ways, such a system would express in an es- 
pecially pure form a denigration of independent judg- 
ment. 

Simultaneity applies not only to individual voters but 
also to institutions. In the midst of the disputes after the 
2000 presidential election, the Republican-led Florida 
House of Representatives resolved on December 12 to 
appoint its own slate of presidential electors, all reliably 
committed to George W. Bush (State of Florida 2000). 
The House majority feared that the courts or Congress 
would challenge the state’s electors to the electoral 
college. To ensure that Florida’s electoral votes would 
go to Bush, the legislators invoked this rarely used 
constitutional authority. Had the state Senate followed 
suit, the House would have decided the outcome of 
the presidential election for the nation. (As it turned 
out, the resolution proved unnecessary because the U.S. 
Supreme Court stopped the recounts, and the Florida 
Senate adjourned without taking action.) 

The problem is not that legislators were acting as 
partisans. Indeed, it could be argued that this is exactly 
how they should have acted because they were elected 
on the basis of their party affiliation. The problem is 
rather that legislators were deciding the election in 
light of how other voters cast their ballots (as well as 
information about the vote totals in other states and 
the totals and margins of the candidates). Deciding the 
outcome under such conditions undermines simultane- 
ity (Thompson 2002, 151-61). 

The norm of simultaneity applies directly only to sin- 
gle elections and, therefore, does not necessarily rule 
out sequential elections such as the presidential pri- 
maries that the national parties hold in many states. 
Some of the same considerations (such as strengthen- 
ing the sense of civic engagement) that support same- 
day voting would also favor a national primary or at 
least regional primaries. But sequential primaries do 
not raise fairness concerns to the same extent as does 
the strategic legislative selection of presidential elec- 
tors. Although the outcome of the New Hampshire pri- 
mary often has substantial influence on the subsequent 
primaries, the decision is not final in the way that the 
Florida legislature’s would have been. In the absence 
of a compelling argument that the lack of simultaneity 
in successive elections is unfair, other considerations 
carry more weight. Preserving some version of the cur- 
rent mixed system of state primaries and caucuses may 
then be justified by pointing to the greater opportu- 
nities it provides for challenging candidates in many 
different settings, and the lesser scope for influence it 
gives money and the media in the nominating process 
(Polsby and Wildavsky 2000, 232-39). 

Simultaneity in its various forms is thus important for 
reasons of fairness and civic engagement. Not merely 
voting, but voting at more or less the same time, plays an 
essential role in the democratic process, Democracies 
should therefore try to avoid the anomaly created by 
practices that permit citizens increasingly to choose the 
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time they vote and should preserve as far as possible 
the practices that encourage voting together on a sin- 
gle election day. They should disfavor publishing the 
results of exit polls, expanding the use of absentee bal- 
loting, and granting legislatures the authority to select 
presidential electors on their own schedule. 


FINALITY: WHY CAMPAIGNS SHOULD BE 
REGULATED MORE STRICTLY THAN 
ORDINARY POLITICS 


In declaring the expenditure limitations of the 1974 
campaign finance law unconstitutional, the Supreme 
Court, in Buckley v. Valeo, held that “only expenditures 
for communications that in express terms advocate the 
election or defeat of a clearly identified candidate” may 
be regulated (424 U.S. 1 [1976] at 44). The distinction is 
between financial support for advocating the election 
of candidates (which may be regulated) and support 
for advocating a particular policy or general program 
(which must be left unregulated). Candidate advocacy 
is part of a campaign, and is only partially protected 
under the First Amendment. Issue advocacy is part of 
ordinary political debate, and is fully protected. 

But the line between the two kinds of advocacy was 
anything but clear in the Court’s opinion, and subse- 
quent attempts to clarify it have proved difficult. The 
problem goes beyond the disputes about the differ- 
ences between contributions and expenditures, the dis- 
tinctions between money and other kinds of influences, 
and the controversies about private and public financ- 
ing (Issacharoff, Karlan, and Pildes 2001a, 533-45). The 
problem is not completely resolved by any reform pro- 
posed so far. It would remain even if Buckley were 
overturned. Although the problem of issue advocacy 
has received less attention than that of soft money, its 
practical effects have been no less pernicious, and its 
theoretical implications may be even more significant.” 

For many'years after Buckley, the criterion for ex- 
press advocacy was whether a communication used 
what became known as the magic words: “vote for,” 
“elect,” “support,” “cast you ballot for,” “Smith for 
Congress,” “vote against,” “defeat,” and similar terms. 
Subsequent court decisions broadened the test some- 
what to include statements such as those that “pro- 
vide in effect an explicit directive” to vote for named 
candidates?” Communications that merely discuss pub- 
lic issues, evén if their purpose is to influence elections, 
could not be regulated at all. 

The ambiguities of the distinction between express 
and issue advocacy began to be exploited in the 1996 
election in a major way. By the time of the 2000 elec- 
tion the proliferation of sham issue advocacy ads (to- 
gether with soft money) had for all practical purposes 
completely subverted the purpose of the regulations. 
Here is a now notorious example of a sham issue ad 


25 For an application of democratic principles to campaign finance 
more generally, see Thompson 2002, 105-18. 

TI See, for example, Federal Election Commission v. Massachusetts 
Cuizens for Life, Inc, 479 U.S. 238 (1986) at 248-50 
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(broadcast in the final weeks of 1996 state election in 
Montana): 


Who is Bill Yellowtail? He preaches family values, but he 
took a swing at his wife. Yellowtail’s explanation? He “only 
slapped her,” but her nose was broken. He talks law and 
order, but 1s himself a convicted criminal. And though he 
talks about protecting children, Yellowtail failed to make 
his own child support payments, then voted against child 
support enforcement. Call Bill Yellowtail and tell him we 
don’t approve of his wrongful behavior. Call 406 443 3620. 
(Quoted by Briffault 1999). 


The campaign reform act signed into law in March 
2002 seeks to deal with these sham issue ads by adopting 
a broader concept than express advocacy. It defines an 
“electioneering communication” (which is subject to 
regulation) as a broadcast ad that clearly identifies a 
federal candidate within 30 days of a primary or 60 
days of a general election and is targeted to his or her 
constituency (U.S. Congress 2002). Evidently uneasy 
about the constitutionality of this definition (or eager 
to invite a challenge), the legislators added this provi- 
sion: If the primary definition is held “constitutionally 
insufficient,” then an “electioneering communication” 
should be taken to mean any ad that supports or op- 
poses a candidate and can be plausibly interpreted only 
as an “exhortation” to vote for or against a specific can- 
didate. This fall-back definition is not quite the express 
advocacy test again (that test is expressly repudiated 
in the legislation), but it is so close to current practice 
that it would probably not deal with the problem of 
sham issue ads.” The primary definition (identifying a 
candidate in the specified period) is much better than 
the current practice, but it still calls for interpretation 
and is likely to continue to generate disputes. 

The practical problem has been to formulate a crite- 
rion that is strict enough to rule out sham issue ads but 
loose enough to permit robust debate about issues. The 
theoretical challenge, which has received less attention, 
is even greater. It can be brought out by asking: Why 
should we regulate any kind of advocacy during a cam- 
paign? After all, no one would propose trying to control 
candidate advocacy all the time (even though it cer- 
tainly takes place months before the official campaign 
period). The problem is not how to distinguish differ- 
ent kinds of advocacy, but how to distinguish electoral 
from nonelectoral politics. Why are elections special? 
This challenge goes beyond the advocacy problem. We 
have to justify a distinction between electoral and ordi- 
nary politics to defend even minimal campaign reforms, 
such as financial disclosure. (Presumably we would not 
want to require everyone who takes part in ordinary 


28 Without waiting for any constitutional challenge, the Federal Elec- 
tion Commission seems to have adopted the language of the fallback 
position in its definition of a “public communication” (which it de- 
fines as a communication that “promotes or supports, or attacks or 
opposes any candidate”) (Code of Federal Regulations, 100 14, [b] 3, 
as amended by the Commission on June 22, 2002) “Public commu- 
nication” includes print, not only broadcast ads, but 1s not regulated 
as stringently as electioneering communications. 

2 For some suggestions about how to draw the line, see Brennan 
Center 2000. 
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political activity to reveal all of his or her sources of 
financial support.) 

The usual answers to the challenge are not adequate. 
First, some try to deny that the regulations assume any 
fundamental distinction between campaigns and other 
political periods. The aim of most campaign finance reg- 
ulation is to control political fundraising, which takes 
place continually. Ads are regulated during a campaign 
simply because that is when the money is spent. But this 
reply does not sufficiently recognize that the purpose of 
the fundraising, whenever it occurs, is for the campaign 
and, therefore, that the regulation still assumes that 
there is something special about the campaign period. 
Funds raised for other purposes (even funds raised dur- 
ing the campaign) are not subject to regulation. 

Second, some argue that the regulations in question 
do not create a significant difference between cam- 
paigns and other political periods because they do not 
prohibit any citizen from engaging in candidate advo- 
cacy. “No speech is ‘banned’ under the new Act. ... The 
only restrictions apply to the sources of funding, not to 
the speech itself” (Potter 2002). It is true that under the 
law individuals can still advertise for or against a candi- 
date as much as they wish—or can afford (U.S. Congress 
2002). They have to disclose only the funding above 
certain thresholds. Corporations and labor unions may 
not use their treasury to sponsor electioneering ads, 
but even they can continue to run ads in the middle of 
the campaign, provided they use only funds from their 
PACs (which must be funded voluntarily by individual 
employees, shareholders, or union members). National 
political parties can sponsor electioneering ads if they 
use “hard” money (which is raised within strict limits 
and is subject to disclosure). 

Although these considerations may adequately de- 
fend the new law against the charge that it bans im- 
portant forms of free speech, they still do not explain 
why the regulation of the sources of funds target only 
campaign-related activities. Even if no one’s liberty 
is seriously impaired and the integrity of the demo- 
cratic process is significantly enhanced, the fact remains 
that the campaign finance law still constrains election- 
related activity more than other kinds of political ac- 
tivity. It imposes limits on campaigns that are not 
tolerated in other parts of political life. We are still 
left with the question: Why single out election-related 
activity? 

The third set of arguments emphasizes the critical 
importance of the discussion that takes place in the 
campaign. Elections are periods of “heightened value 
debate and political opinion formation” and, likewise, 
periods of heightened opportunity for citizens to trans- 
mit those values and opinions to their representatives 
(Baker 1998). Citizens pay more attention to political 
communications during this period, and the amount 
and variety of political debate are greater than at 
most other times. Although many citizens may make 
up their minds before the campaign, undecided voters 
are more susceptible to political advertising during the 
campaign. Finally, the stakes are higher because voters 
are choosing their leaders, not simply trying to influence 
particular policies or decisions. 


Vol. 98, No. 1 


The trouble with these arguments is that they are 
double-edged. The very importance of campaign dis- 
cussion provides a reason for restraining it less. The 
more attentive and the more receptive citizens are, the 
more vigorous and freewheeling the debate should be. 
In the discussion of some issues (such as decisions to 
take military action or to cut taxes), the stakes are 
no less high between elections. Furthermore, privi- 
leging noncampaign over campaign debate could give 
political advocates “a perverse incentive to conceal 
their electoral preferences in order to avoid burden- 
some regulations”’—amounting to “the closeting of 
electoral speech” (Sullivan 1998). Encouraging citizens 
and politicians to disguise their electoral positions as 
general policy positions is “inconsistent with the notion 
of robust, uninhibited, and wide-open debate about the 
policy bases for choosing among our representatives.” 

Despite the difficulty of separating electoral from 
nonelectoral politics, the need for some distinction re- 
mains. Without it, we are faced with the dilemma of 
either abandoning any attempt to control the exces- 
sive influence of money in elections or imposing overly 
broad prohibitions that risk stifling debate in the wider 
political process. 

Beyond campaign finance reform, many other sen- 
sible rules depend on the assumption of the distinc- 
tiveness of the electoral process; we reasonably accept 
many regulations in the electoral realm that would be 
intolerable in the general domain of public discourse. 
There are limits on what voters are permitted to express 
at the ballot box, as well as requirements to disclose 
the identity of political speakers. There are content- 
based regulations of electoral speech, ranging from 
constraints on electioneering near polling places to se- 
lective bans on contributions from some speakers such 
as corporations (Schauer and Pildes 1999). 

The suggestion that the stakes are higher at election 
time, while not correct in the form that it is usually 
presented, points in the right direction. What raises the 
stakes is not the relative importance of the questions 
that elections decide, but the relative finality of the re- 
sult that elections produce. Finality, the third temporal 
property of elections, provides a more fundamental ba- 
sis for treating electoral politics differently from other 
kinds of politics.” Finality is the flip side of periodicity. 
Elections occur at intervals but once an election takes 
place its results are final for the intervening period. 

Electoral finality has two aspects. First, unlike or- 
dinary political activity, elections (and the campaigns 
that precede them) come to a definite conclusion at a 
foreseeable time. An election takes place at a particular 
moment, which until the next election marks an end to 


X The basis for the distinction developed here is closest to that 
presented by Briffault 1999. Other mportant contributions by legal 
scholars who are developing the idea of an electoral domain distinct 
from ordinary politics are Baker 1998 and Schauer and Pildes 1999. 
The electoral verdict is also final ın a sense similar to that of the 
concept of finality as a formal condition in a theory of justice: “When 
the course of practical reasonmg.. has reached its conclusions, the 
question is settled....We cannot at the end count [existing social 
arrangements and... self-mterest] a second time because we do not 
like the result” (Rawis 1999). 
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the process of deciding who will hold office. The results 
of elections ‘are not reversible until the next election. 
Electoral accountability is not continuous so that those 
elected can govern effectively for a period without the 
risk of immediate removal from office.! 

Second, unlike ordinary political activity outside 
government, elections result in decisions that are bind- 
ing on all citizens, Although noncampaign activity also 
often affects’ the outcome of elections, its purposes are 
broader. It is not directed exclusively at influencing a 
decision that, without any further political discussion or 
representational mediation, immediately obligates all 
citizens. The winner of an election gains the authority 
to act for all citizens. 

These two aspects of finality together justify subject- 
ing electoral politics to more stringent standards than 
those that govern the rest of politics outside govern- 
ment. Because of their finality, elections and the cam- 
paigns leading up to them should be considered more 
a part of government than a part of politics that influ- 
ences government. Consequently, all citizens have not 
only an interest in the integrity of the process, but also a 
claim to participate in setting the standards that govern 
it. The standards that control the conduct of elections 
are therefore more appropriately determined by col- 
lective decision than by individual discretion. Citizens 
acting together should set the standards for the process 
that determines who will make and execute the laws by 
which they all are to be bound. 

Finality thus provides in principle a normative basis 
for adopting stricter regulations for campaigns than 
for other political activity. But it does not supply a 
criterion for distinguishing between the two kinds in 
practice»? Like many other normative distinctions, 
such as that between private and public, the princi- 
pled difference between electoral and other kinds of 
politics can be more readily established in theory than 
in practice. That is why any practical criteria—both 
those that specify the length of the campaign and those 
that identify campaign-related content—are arbitrary 
in the sense that there is no compelling reason to draw 
the line at the point they designate than at any of an 
indefinite number of other points Why 60 days be- 
fore an election rather than 90? Why not shorten the 
campaign period itself, as many European countries 
have done? Why is “referring” to a candidate neces- 
sary or sufficient test? To answer these and similar 
questions that arise in devising practical criteria, we 
must take into account a variety of considerations 
such as administrative convenience, enforceability and 
transparency. Yet the criteria need not be objectionably 


31 Recall ıs unusual, and its desirability us nghtly questioned by many 
democratic theorists (Cronin 1999). Its reputation has not been en- 
hanced by its most recent usxe—Califormia’s gubernatorial recall in 
2003. In most parliamentary systems, the government is subject at any 
time to a vote of no confidences and a consequent general election, but 
it still usually enjoys at least as much security as do representatives 
in other systems. Bnoging down a government 1s more difficult be- 
cause party discipline is greater, and also more perilous and therefore 
less likely because the representatives of all parties have to face the 
voters. 

2 See the objection raised by Sulhvan 1998 
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arbitrary if they represent a good faith effort to capture, 
within the limits of these practical considerations, the 
principled difference between electoral and ordinary 
politics. 


CONCLUSION 


Electoral politics runs on a different rhythm than other 
kinds of politics. Its character is expressed by three tem- 
poral properties—periodicity, simultaneity, and finality. 
Elections take place at intervals, citizens vote on the 
same day, and the electoral outcome is irrevocable un- 
til the next election. These properties are grounded in 
important values that any adequate democratic process 
should promote. Different systems realize these prop- 
erties in different ways and to varying degrees. The spe- 
cific ways and particular degrees found in the current 
U.S. system create a rather peculiar electoral rhythm 
and give rise to the three anomalies criticized here. 

Understanding how the electoral rhythm generates 
these anomalies and why they should be eliminated illu- 
minates the important theoretical values at stake in the 
electoral process, and the relationships among them. 
Periodicity serves popular sovereignty by enabling new 
majorities to escape the control of old majorities and 
to effect change when they believe it is needed. Peri- 
odicity highlights the importance of a key form of that 
control—the power to draw electoral districts. Simul- 
taneity promotes fairness by increasing the chances that 
each citizen will have access to the same information 
and the opportunity to participate on equal terms in an 
important democratic rite. It can also enhance civic en- 
gagement and protect independent judgment. In gen- 
eral, the more temporally concentrated an election, the 
more adequately it expresses on equal terms the will of 
all voters. Also, the more clearly it manifests the will of a 
determinate majority, the more fully it realizes popular 
sovereignty. 

Finality also supports popular sovereignty by giving 
continuing legitimacy to the will of an electoral major- 
ity until the next election. Finality works together with 
periodicity but emphasizes a different and sometimes 
conflicting aim. While periodicity stresses the need to 
permit change in representation from time to time, fi- 
nality recognizes the need to support stability in gov- 
ernment over time. It gives leaders the opportunity to 
govern by temporarily freeing them from the threat of 
removal. 

Respecting the temporal properties also has signif- 
icant practical implications. If we appreciate the dis- 
tinctive character of election time, we can see more 
clearly why and how we should try to resolve some of 
the anomalies that characterize the current U.S. elec- 
toral process. We will have stronger justifications than 
those that are conventionally given for establishing in- 
stitutions that give citizens more control over how their 
representatives are chosen, enhancing opportunities to 
participate in elections on equal terms, and granting 
greater authority to regulate campaigns. 

Together the temporal properties mark off electoral 
politics as a sphere that requires special attention and 
demands different and often more stringent standards 
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than those that govern political life that lies outside gov- 
ernment. Periodicity requires that critical decisions that 
affect the membership of representative institutions— 
such as redistricting—are not biased in favor of ei- 
ther continuity or change and, therefore, are not based 
mainly on considerations of incumbent protection 
or partisan advantage. Coupled with the Madisonian 
principle that cautions against letting representatives 
control their own future, periodicity strengthens the 
case for giving independent commissions the responsi- 
bility for redistricting. Simultaneity suggests that we 
should resist publishing the results of exit polls, ex- 
panding the use of absentee balloting, and granting 
legislatures the authority to select presidential electors 
on their own schedule. Finality points to the funda- 
mental reasons that electoral procedures and campaign 
practices require more regulation than ordinary pol- 
itics. Finality justifies, for example, requiring disclo- 
sure of funding sources for electioneering ads and im- 
posing stricter limits on organizations that raise funds 
for such ads. That the results of an election are tem- 
porarily irrevocable and binding on each citizen means 
that all citizens have a legitimate interest in oversee- 
ing the standards that govern the whole electoral pro- 
cess. 

The three properties considered together yield a 
more general implication. They create the character- 
istic rhythm of a healthy democracy—periods of or- 
dinary politics interrupted by campaigns, punctuated 
by simultaneous voting, concluded by a final decision. 
This rhythm is disrupted by the increasing length of 
campaigns that has occurred in recent years in the 
United States). When campaigns go on indefinitely, 
the difference between electoral and ordinary politics 
begins to blur. When representatives are continually 
running or preparing to run, campaigning and govern- 
ing begin to merge. The hybrid political creature that 
this trend creates—the permanent campaign—does not 
serve well the goals of either campaigning or govern- 
ing. Respecting the distinctive temporal character of 
the electoral process calls for resisting this trend and 
sharpening the difference between running for office 
and acting in office. 

The sphere of electoral politics that these three prop- 
erties identify is not a sharply defined area with clear 
and unambiguous boundaries. A line dividing the elec- 
toral process from the rest of political life is not to be 
discovered by undertaking only empirical explorations 
in the political world, as if we were setting out to look 
for some natural boundary like a river. It has to be 
constructed, often one case at a time, as we seek to 
create and sustain an electoral process that promotes 
democratic values. The general principles of demo- 
cratic theory, including the values brought together by 
the temporal properties described here, can guide those 
political constructions. Delineating and regulating 
the content of the electoral process will proceed more 
democratically and with a better chance of achieving 
democratic results if we attend to its temporal charac- 
ter. To preserve that character and the values of the 
democratic process it promotes, we need to make sure 
that our political life runs on election savings time. 
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nos. We argue that the logic suggesting that majority-minority districts suppress turnout ts 


| | Je inquire whether residence in majority-minority districts raises or lowers turnout among Lati- 


flawed and hypothesize that the net effect is empowering. Further, we suggest that residing in 
multiple overlapping majority-minority districts—for state assemblies, senates, and the U.S. House— 
further enhances turnout. We test our hypotheses using individual-level turnout data for voters in five 
Southern California counties. Examining three general elections from 1 996 to 2000, we demonstrate that 
residing in a majority-Latino district ultimately has a positive effect on the propensity of Latino voters 
to turn out, an effect that increases with the number of Latino districts in which the voter resides and is 
consistent across the individual offices in which a voter might be descriptively represented. In contrast, 
the probability that non-Hispanic voters turn out decreases as they are subject to increasing layers of 


majority-Latino districting. 


Section 2 of the Voting Rights Act (VRA), in 

conjunction with the preclearance provisions 
of Section 5, the creation of majority—minority districts 
has become the standard method for securing minor- 
ity representation in legislative institutions.’ This ap- 
proach has been remarkably successful, at least descrip- 
tively, generally resulting in the election of minorities 
to legislative office. 

Less certain, however, is the impact of these districts 
on the political behavior of citizens residing in them. 
Creating majority—minority districts provides minority 
voters with a new-found opportunity to elect a candi- 


. s a consequence of the 1982 amendments to 


Matt A. Barreto is Doctoral Candidate, Department of Political 
Science, University of California, Irvine, and Tomás Rivera Policy 
Institute at the Unrversity of Southern California, 3151 Socal Science 
Plaza, Irvine, CA 92697 (MBarreto@uci.edu). 

Gary M Segura is Associate Professor, Department of Political 
Science, University of Iowa, 341 Schaeffer Hall, lowa City, LA 52242 
(gary-segura@uiowa.edu). 

Nathan D. Woods is a consultant with Welch Consulting and Re- 
search Scholar with The Center for The Study of Los Angeles at 
Loyola Marymount University. Address: c/o Welch Consulting, 1640 
Fifth Street, Santa Monica, CA 90401 (NWoods@welchcon.com). 

Authors’ names are presented alphabetically. Communications 
and questions should be directed to Gary Segura. We would like to 
thank Harry Pachon of the Tomás Rivera Policy Institute for use 
of the County Registrar data sets and Steve Voss of the University 
of Kentucky and Ricardo Ramurez of the University of Southern 
California for their helpful feedback. We also wish to thank partici- 
pants in the political scence research colloquia at the University of 
Jowa and the University of Wisconsin for their extensive comments. 
Responsibility for the analysis, mterpretation, and remaining errors 
is solely that of the authors 
1 Though thus practice increased dramatically after the 1982 changes, 
there were majonty—munonty districts in existence prior to the 1991 
redistricting process. The term Mayorityminority smply refers to 
electoral districts drawn with a sufficient minority population so that 
the minonty population can elect a candidate of choice, usually can- 
didates of like race or ethnicity. What constitutes “suffiaent” popu- 
lation suze is a source of some debate but typically ranges between 
55% and 65% See a recent exchange between Cameron, Epstein, 
and O'Halloran (1996) and Lublin (1999) for a thorough review of 
this discussion. 


date of choice, empowering this previously excluded 
group and increasing their incentives to turn out and 
vote. On the other hand, low levels of competition in 
these districts, coupled with disappointment associated 
with the lack of perceived policy effects from increased 
descriptive representation, may serve as dual disincen- 
tives to participation and dissipate any gains in turnout. 

What, then, is the net effect? The central question 
of this paper is whether majority—minority districts are 
empowering to minority voters or, in contrast, whether 
they actually drive down turnout. Focusing on actual 
turnout at the individual level in five counties of south- 
ern California, we estimate the effects that residing in 
majority—minority districts has among Latinos and non- 
Latinos alike. 

In light of the impending debate over renewal of the 
VRA,’ understanding the behavioral implications of 
minority districting is important. If Latino turnout is 
enhanced by these structures, then majority-minority 
districts unambiguously enhance Latino political influ- 
ence. If, however, Latino representation is gained at 
the expense of Latino turnout, these districts may be 
detrimentally affecting the overall influence of Latinos 
by reducing their impact on up-ballot races where out- 
comes are less certain. 


MAJORITY—-MINORITY DISTRICTS AND 
MINORITY VOTER TURNOUT 


Literature that directly, and empirically, examines the 
notion of minority voter turnout within majority— 
minority districts is still comparatively rare. This is 
somewhat surprising, especially given that scholars 





2 There 1s some chsagreement in the literature over whether the crit- 
ical threshold of minority concentration is actually 50% (Grofman, 
Handley, and Lublin 2001). Nevertheless, we begin here with the 
recognition that securing minority representation has often required 
some degree of electoral engineering, whose behavioral! effects we 
explore here. 

3 The Voting Rights Act comes up for reauthorization in 2007. 
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frequently document the importance of institutional 
context to political attitudes and participation (Jack- 
man 1987; Pantoja and Segura 2003). More recently, 
work examining the sociopolitical context of participa- 
tion (Huckfeldt and Sprague 1995) has undermined the 
very individualistic assumption made about the process 
whereby citizens choose to engage in political activity. 
The role of racial or ethnic context (Leighley 2001) 
and organizations (Pantoja and Woods 1999; Ramirez 
2002), for example, has been identified as central in 
determining who is likely to be mobilized through elite 
efforts and group solidarity and who is likely left be- 
hind. 

In addition to the relative paucity of research on the 
behavior effects of minority districting, empirical ob- 
servation has been limited to anecdote, case study, and 
aggregate-level analyses yielding mixed results. Sev- 
eral early researchers found that minority participation 
is actually enhanced with regard to local, particularly 
mayoral elections. Bobo and Gilliam (1990) find that 
the participation of African Americans increases in 
cities with black mayors, while Lublin and Tate (1992) 
find a similar effect given the presence of African- 
American mayoral candidates. Roll-off also seems di- 
minished where African-American candidates are on 
the ballot (Vanderleeuw and Utter 1993). 

In contrast, other research is less sanguine about the 
potential for increased voter participation as a result 
of the presence of a minority candidate or previously 
elected official. Gaddie and Bullock (1994) find that no 
increased voting occurred among African Americans 
at the school board level, even as a result of newly cre- 
ated black-majority districts. Brace et al.(1995) concur. 
Using aggregate turnout figures at the precinct level for 
congressional, State Senate, and State House districts 
in Florida, they find that turnout is not necessarily in- 
creased in safe districts, even as “candidates of choice” 
were elected. Haeberle (1997) similarly finds that white 
majority—minority districts (in this case in Birmingham, 
Alabama) did not increase turnout. 

Most recently, Gay (2001) examined precinct-level 
turnout data from eight states with African Ameri- 
cans elected to Congress. Using ecological inference 
analysis (King 1997), Gay rejects both the optimistic 
and the overly pessimistic scenarios regarding minor- 
ity turnout and majority—minority districts. While she 
finds very little evidence that the election of African 
Americans to Congress produces meaningful increases 
in black turnout, she does see significant declines in 
white turnout in those same majority-black districts. 
This decline in white participation, she suggests, may 
well be the cause of low overall turnout rates in major- 
ity black districts and serve as evidence that majority— 
minority districts do not have a demobilizing effect on 
the minority voters in question.‘ 

In each of these analyses, turnout rates are mea- 
sured at the level of precinct, district, or jurisdiction. 
The results differ primarily with regard to the focus 


4 This, of course, ıs why simply comparing turnout across majority- 
black and majority-white districts presents an maccurate portrait of 
the differences. 
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on minority turnout in mayoral elections, versus the 
elections of minority candidates from districts. Where 
district contests are the focus, what little evidence is 
provided implies that voter turnout among minorities 
is not stimulated under safe-districting formats. These 
analyses are limited in a number of important ways, not 
the least of which is their focus on aggregate turnout 
rates and the participation of African Americans. Brace 
et al. (1995) take up the question of Latino partici- 
pation, and only in Florida, where the heavily Cuban 
population is not at all representative of the Latino 
population in other states. Barreto et al. (2003) have 
recently examined this question, but with respect to 
mobilization rates rather than actual turnout. And the 
propensity of minority voters to turn out when they 
reside in more than one majority-minority district has 
been, until now, unexamined. 


THEORY AND ARGUMENT 


Our theory is built on two specific points. First, we 
suggest that the argument that majority—minority dis- 
tricts suppress turnout is logically flawed. Second, we 
suggest that properly estimating the effect of majority— 
minority districts on turnout requires us to understand 
better the electoral context in which minority voters 
choose to go—or not to go—to the polls. Since indi- 
vidual electoral contests are conducted simultaneously 
with others, we need to account for whether the voters 
have incentives to turn out for a variety of contests on 
the ballot, rather than merely one. We examine each in 
turn. 


The “Empowerment” View 


Repeated trips to the polling place with nothing to show 
for your effort would get old, fast. In places where 
minority vote dilution has undermined the voices of 
African-American and Latino voters, no level of vigi- 
lance or turnout would be sufficient to secure victory 
against a determined majority.* Losing every election, 
in short, is discouraging, undermines the perceived le- 
gitimacy of the system, and should drive down partici- 
pation. 

Once descriptive representation is secured through 
the creation of majority—minority districts, minority 
voters no longer face the discouragement associated 
with repeated votes and zero victories, While incum- 
bency may briefly impede success, these districts have 
generally led to successful attempts on the part of the 
ae in question to elect a candidate of their choos- 

ing, presuming, that is, that a candidate of the same 
racial or ethnic background (Lublin 1997) is their pref- 
erence. Newly successful minority voters experience an 
enhanced sense of empowerment. Since their votes now 


5 The level of white “determinaton” necessary depends on the pop- 
ulation distributions in the jurusdiction. The level of white unity nec- 
essary for minority exclusion 1s pomtively related to the mmonity’s 
share of the electorate See Grofman, Handley, and Lublm 2001 for 
a fuller discussion. 
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may be perceived to matter, turning out to vote is a 
more rewarding—and hence more likely—experience. 

If this is the case, the value of majority—minority dis- 
tricts to the overall cause of minority representation is 
unquestionably positive. These districts serve the dual 
purposes of both increasing descriptive representation 
and increasing the overall minority share of the elec- 
torate. As minority turnout increases, those voters will 
exercise a larger impact on the outcomes of up-ballot 
elections where they do not constitute a majority of 
the electorate, as well as secure their representation by 
coethnics in the down-ballot districts where the group 
is, in fact, a majority. 


The Skeptical View 


But do their votes now matter? As long as the district’s 
demographics do not change for the worse, these vot- 
ers will pile success on top of success. Guinier (1994, 
58-60) and others have suggested that, while turnout 
might increase in the first chance to elect a minority 
officeholder, this success might also be problematic. 
In a safe, majority-minority district, the outcome is as 
invariant as the outcomes in the majority-white, minor- 
ity vote-diluted context. Yes, the outcomes are better 
for the minority group—they win—but the incentive to 
participate repeatedly is undermined by the irrelevance 
of turnout to the outcome. In addition, the assumption 
that incumbents are then unbeatable, and the reluc- 
tance of primary challengers to emerge from within 
the group, reduces competition in these electoral en- 
vironments. Further, disappointed that the election of 
a minority legislator fails to produce tangible results, 
voters might find themselves even more disillusioned 
than before. If the result is a disillusioned electorate 
with little motivation for participation, the argument 
goes, turnout may well decline. 


OUR VIEW 


Our task here is to sort out these competing logics. We 
hypothesize that minority turnout is enhanced through 
the creation of majorityminority districts. While there 
may be a disincentive for turning out in repeated 
electoral success, we believe that its magnitude pales 
when compared with the disincentive implied in re- 
peated electoral failure. More importantly, pessimistic 
arguments that majority—minority districts discourage 
turnout proceed from two fundamentally flawed as- 
sumptions. First, these analysts assume that there is 
no competition to attract turnout in majority—minority 
districts, that is, that they are effectively “rotten bor- 
oughs.” Second, this assumption is juxtaposed with 
another—that majority-white jurisdictions have com- 
petitive races, i.e., are not “rotten.” 

Neither assumption is justified. While cautions re- 
garding electoral invincibility, incumbency advantage, 
and machine politics are well taken, we think that they 
are nonunique. Scholars of congressional elections reg- 
ularly document how few House seats are meaningfully 
competitive. In any given election year, most observers 








TABLE 1. Latino Population Percentages In 
Majority-Latino Districts, 1991 Districting 
Plan 








District Percentage Latino 1990 
Congressional 
20 55.4 
26 52.7 
30 61.5 
31 58.5 
33 83.7 
34 62.3 
Senate 
16 50.8 
22 68.7 
24 59.3 
30 75.4 
Assembly 
31 52.2 
39 62.2 
45 63.2 
48 70.3 
48 52.0 
49 55.1 
50 88.6 
57 63.5 
58 62.3 
69 64.6 
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agree that only about 45 or 50 of the 435 House seats 
have genuine races in which the outcome is uncertain. 
In this instance, of course, noncompetitive refers to 
the partisan outcome but it may as well also refer to 
the racial outcome, as there are currently no African- 
American or Latino members of the House currently 
representing a district that is not majority—minority or 
was not at the time of their first election.® The politics of 
redistricting is driven by incumbent protection, a pat- 
tern replicated at the state level. As a practical matter, 
then, the vast majority of non-Hispanic white voters in 
America lives in jurisdictions where both the racial and 
the partisan outcomes of legislative elections are rarely, 
if ever, in doubt. And while this noncompetitiveness no 
doubt does drive down political participation, it is not 
at all clear that it affects one group more than another. 

In contrast, a quick examination of Table 1 shows 
just how narrow minority voters’ grip is on the little 
political power they do have. Table 1 reports the Latino 
percentage of the total population in each district coded 
“majority—minority.” Many of these majority—minority 
or majority-Latino districts are considerably less safe 
than the majority white districts against which we are 
comparing them. The population pluralities are often 
modest, and important differentials in citizenship, vot- 
ing age, registration, and turnout are significant. In- 
deed, the very reason why some researchers fetishized 
the 65% threshold in the early studies of majority— 
minority districts was precisely the uncertainty of vic- 
tory associated with a smaller plurality. 

Those expecting lower turnout among minority vot- 
ers in majority—minority districts assert that, whereas 








6 The last to date was L C. Watts (R-OK), who stepped down in 2002, 
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in a vote-diluted environment, no amount of turnout 
could secure a win, in a majority-minority district, no 
amount of abstention can risk a loss. But this, of course, 
is an overstatement. There is a fundamental asymmetry 
between majority and minority status, and an apathetic 
majority can lose the occasional election, while even the 
most zealous minority cannot, alone, change the fate 
of their candidate. While a determined majority could 
always exclude a minority of Latino or black voters 
from electing a candidate of their choice, a majority 
of minority voters could win, but only with sufficient 
turnout to make their demographic majority translate 
into an electoral one. 

We logically conclude that while noncompetitive 
electoral environments are a disincentive to everyone’s 
turning out, the incentives for participation among mi- 
nority voters are systematically stronger in majority- 
minority districts than in majority white districts. 


HI: Voter turnout will be significantly greater 
among Latinos living in majority—minority dis- 
tricts than among other Latinos. 


Accounting for Ballot Context 


Previous work on the behavioral effects of majority- 
minority districts has invariably focused on a single 
office, specifically the school board, mayoral elections, 
or Congress. But decisions regarding turnout in elec- 
tions for different offices should not be viewed as inde- 
pendent trials when the elections occur simultaneously. 
Though a voter need not cast a ballot in every race, and 
roll-off is a well-documented phenomenon, most schol- 
ars would agree that the biggest self-selection process 
is between those who show up and those who do not. 
And in this sense, the incentive to vote is a product of 
more than simply the characteristics of only one of the 
races on the ballot. 

If there are State Senate, Assembly, and U.S. House 
elections on the same day, the level of turnout for one 
race is invariably endogenous to circumstances of the 
others. Latino voters living in a majorityminority As- 
sembly district may or may not be more empowered 
by this fact, but their decision about whether to vote 
is driven in part by their level of interest in the State 
Senate and House races, as well. Some of these Latinos 
living in a majority—minority Assembly district reside in 
more evenly divided Senate and House districts, while 
others may live in majority—minority districts for all 
three bodies. The behavioral effects of living in minority 
districts, then, should be stronger among those living in 
multiple, overlapping majority—minority districts. 

H2: As theinumber of majority-minority jurisdic- 

tions in which a Latino citizen resides increases, so 
does his/her probabulity of turning out to vote. 


DATA AND DESIGN 


To test these hypotheses, we turn to Registrar of Voters’ 
records for all registered voters from five counties in 
southern California: Los Angeles, Orange, Riverside, 
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San Bernardino, and Ventura.’ Specifically, we exam- 
ine data on general elections from 1996 to 2000. Our 
dependent variables are constructed using the actual 
record of whether or not individual registered voters 
turned out for a particular election. 

Our unit of analysis is the individual, a departure 
from previous work. As Gay’s (2001) innovative work 
pointed out, the use of aggregate turnout numbers can 
often mask what is really happening at the level of in- 
dividual choice. This approach provides us with two 
advantages and arguably one potential disadvantage. 
First, unlike analyses that use polling data, we do not 
need to make inferences from samples to populations. 
Second, since Registrar of Voters’ actual records do not 
rely on self-reporting, overreporting due to a social- 
acceptability bias is not a problem. 

The potential disadvantage has to do with the self- 
selected nature of our universe of analysis. Our mea- 
sure of turnout is, of necessity, only among registered 
voters. It is very likely the case that the empowering 
or demobilizing effects of living in a majority-minority 
electoral district would be reflected at the voter regis- 
tration stage, as well as the actual election-day decision 
to vote. In that sense, some of the variance that can 
be explained by majority—minority districts has already 
been lost. On the one hand, this may well cause us to 
underestimate any effect. On the other, our confidence 
in a significant finding is raised since the self-selected 
nature of the registered voter pool makes such a finding 
more difficult. 

We estimate the effect of living in a majority- 
minority district on the likelihood that a Latino citizen 
turns out to vote, controlling for other well-recognized 
determinants of behavior. We examine individual voter 
turnout in each general election, as well as an index 
of turnout across three elections for voters registered 
for the entire time period. For each individual election, 
the dependent variable is Voted and is coded one if 
the registrant signed into the polls on election day and 
zero otherwise. For voters registered for the entire time 
period, a second variable, Voted 96-00, ranges from zero 
to three. For voters registered for the last two elections 
(including those registered earlier), we code Voted 98- 
00, ranging from zero to two. In both instances, the 
values are simply the frequency at which the registrant 
signed in at the polls or voted absentee for the three 
(or two) general elections in question. 

In estimating the principal effect, we take advantage 
of the natural quasi-experiment built into the multiple 
layers of representation each voter receives at the state 
and federal level to see if the frequency of voting in 
a majority—minority context is associated with the hy- 
pothesized effect. At a general election, each registered 
voter in our universe of analysis has the opportunity to 
vote for a State Assembly member, and U.S. House 
member and in alternating elections a State Senator. 





7 With the exception of registered voters in Assembly districts 67, 
70, and 73 in the 1996 and 1998 election and district 73 ın the 2000 
election. Due to errors by Riverside County in collecting and record- 
ing vote history data, these data are not available Fortunately, these 
areas are not within Latino-mayomty jurisdictions. 
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The voter, then, could conceivably be living and vot- 
ing in as many as three majority—minority districts or 
as few as none.® Majority-Minority, then, captures the 
overall experience of the voter with living and voting in 
majority-minority districts for state and federal repre- 
sentation. It varies from zero to three and is the num- 
ber of majority-Latino districts’ in which the registered 
voter resides for State Assembly, State Senate, and U.S. 
House elections.” 

The vast majority of both Latino (64.4%) and non- 
Hispanic (89.4%) citizens live outside of majority- 
Latino districts. Owing to the “nested” nature of legisla- 
tive districts in California, the next most common value 
is three for both non-Hispanics (5.4%) and Latinos 
(24%). Relatively smaller percentages of both live in 
only one majority-Latino district, with the value two 
being the least common. Fortunately, our universe is 
large enough that even the smallest cell has approxi- 
mately 60,000 citizens. 

Since not all representative offices may be equally 
salient to the voter, our design allows us to test whether 
voting in majority-minority districts for particular of- 
fices has different marginal effects. To estimate these 
effects separately, we use individual variables to cap- 
ture whether the registrant lives in a majority-minority 
district for each office. MM-Assernbly, MM-Senate, and 
MM-Congress are each coded one if the registrant’s 
district for that office is majority-Latino and zero oth- 
erwise. 

Identification of Latino voters is accomplished 
through the use of the Census Bureau’s Spanish sur- 
name list, which flags those regi ts with commonly 
occurring Hispanic surnames."! We identify Latino reg- 
istered voters with the variable Latino, which is coded 
one if the voter’s surname indicates Hispanic origin. 
Given long-standing findings on the lower rates of 
turnout among all minority voters, ceteris paribus, we 
expect the coefficient on this variable to be negative, 
though some more recent findings about Latino mo- 


8 Yn fact, a atzen may also live in a majorty—minonity city or city 
council district, school board district, and county Board of Supervi- 
sors district. These elections, by and large, are not conducted simul- 
taneously with statewide and general elections, so while they might 
have a contextual impact on the catizen’s overall level of empower- 
ment, they are not specifically relevant to turnout ın general elections 
9 For our purposes, we count a district as majority Latino when more 
than 50% of the voting-age population in the jurisdiction 1s identified 
as Latmo Replication of all of the following results at any level above 
50% does not appreciably change the findings. 

10 As a practical matter, there are no truly majority-black districts ın 
the five-county area studied. There are, of course, districts where 
Afncan Americans are a plurality, but since we are studying the 
voting behavior of Latincs—in part because we cannot determine 
conclusively whether a registrant is black—these districts would not 
be “empowering” to the Latino electorate and are hence coded zero 
on these measures. 

11 The Spanish Surname list is based on the 1990 census and is con- 
structed by tabulating the responses to the Hispanic origin question. 
Each surname is categorized by the percentage of individuals that 
identified themselves as “Hispanic.” Though the use of this instru- 
ment results ın a modest underestimate, given the presence of Latinos 
with non-Hispanic surnames, the Census Bureau estimates that this 
captures 93.6% of all Hispanics, and less than 5% of those identified 
are false. For a full explanation on the methodology of the list see 
Word and Perkins 1996 


bilization in California give us caution with regard to 
these expectations (Barreto and Woods 2000). 

In order to differentiate the effects of living in 
majority-Latino districts on Latinos and non-Latinos, 
we interact these two variables. MM-Latinos identifies 
the effect of overlapping majority—minority districts 
only on Latinos, as opposed to a companion effect 
among non-Hispanics. Gay’s (2001) findings on whites 
suggest that we might expect the coefficient on the main 
effect to be negative. If our argument is correct, how- 
ever, the coefficient on the interaction of that effect 
with the Latino identifier will be positive. This coeffi- 
cient, then, represents the degree to which the effect 
among Latinos is a departure from the effect among 
non-Hispanic voters. The overall net effect of majority- 
Latino districts on Latino turnout is obtained by adding 
the two coefficients together.” 

The large concentration of Asian Americans in 
California necessitates a demographic control for this 
group as well. We again use a Census Bureau database 
to identify Asian surnames. Asian-American is a di- 
chotomous variable coded on the basis of surname. 
Asian-American turnout also consistently lags behind 
that of whites, so we would expect the coefficient to be 
negative. 

African-American voters have similarly demon- 
strated a lower propensity to turn out. Unfortunately, it 
is not possible to identify which registrants are African 
Americans. To control for this effect, we code Prob- 
ability Black as the likelihood that a given voter is 
African American, inferred from the proportion of 
non-Hispanic and non-Asian residents in that Zip Code 
that is black, ranging from zero to one. This vari- 
able is set to zero when the registrant is coded as ei- 
ther Asian or Latino (since their probability of being 
African American is known). While this estimate is 
of limited use for inferential purposes, it is helpful in 
separating out the potentially different effects of living 
in majority-Latino districts on African Americans and 
non-Hispanic whites. We would expect the coefficient 
to be consistently negative. 

Party ID captures partisan registration. Republicans 
are coded as one, with all other voters coded as zero. 
Since Republicans tend to vote more frequently, we 
would expect this coefficient to be consistently posi- 
tive. Female is a dichotomous variable. Determination 
of gender is directly from Registrar of Voters’ records. 
Age is year minus date of birth and is also coded from 
records. Since younger citizens have been consistently 
found to vote less often, we would expect a positive 
coefficient. 

In addition to these individual-level effects (or prox- 
ies, as in the case of Probability Black), we include a 
battery of contextual effects to control for other well- 
recognized factors influencing turnout that are not part 
of the registrars’ database and, hence, not available for 
each individual registrant. Each variable is coded using 


12 For estimations of the separate effects of each office, the interactive 
terms are named MMA-Latinos, MMS-Latinos, and MiC-Latinos 
and will be the product of the Latino dummy and the dummy vanable 
for whether the distnct for the particular office is majonty Latino. 
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the Zip Code as the unit of analysis, and the data are 
drawn from the 1990 national census, with the excep- 
tion of Probability Black, which is drawn from the 2000 
census. 

Income and education are the obvious necessary con- 
trols. Median Income is median household income in 
the Zip Code, and Percent College captures the per- 
centage of residents with a college education or bet- 
ter. Both should be strongly and positively associated 
with turnout. Since we are primarily interested in the 
behavior of Latinos, we need to control for the ef- 
fects of nativity. Percent Foreign Born serves as a 
proxy for individual nativity. Since naturalized citizens 
generally turn out less often, we would expect lower 
rumour in areas where the foreign-born population is 
larger.!* 

Finally, we also control for two district-level political 
effects. Since'Senate terms in California are four years, 
voters in biennial Assembly and House races will not 
always have a Senate race in which to vote. Senate Elec- 
tion is a dummy coded one if there is a Senate race in the 
voter’s district in that election. Since we would expect 
turnout to be greater when there are more choices to 
be made, the coefficient should be positive. 

We also control for whether one or more of the 
seats are secure for Democrats. Safe Democratic counts 
the number of specific races (year and office) repre- 
sented in the particular dependent variable that could 
be considered safely Democratic. Should the first 
and second hypotheses be sustained, one potential 
alternative explanation for this heightened turnout 
would simply be partisanship. That is, since majority— 
minority districts are heavily Democratic, the positive 
effect on turnout could be driven by the enthusiasm of 
Democrats electing Democratic officeholders, without 
regard to ethnicity. By including this variable, we are 
able to isolate the specific effects of TIA) OH -CUnCHTY 
districting controlling for that of partisanship alone.! 


D In a sumular vein, language use is often a measure of political in- 
corporation, and we generally expect Spanish speakers to be less 
attentive to political and social matters in the United States and 
potentially less informed. To that end, we created Percent Spanish 
Speaking, a variable that accounts for the percentage of households 
in the Zip Code that speak pmmarily Spanish ın the house Unfor- 
tunately, this variable correlates highly with Percent Foreign Born 
(r = 795), above the usual threshold for multicollineanty, and we 
are unable to include both in the estmations 

14 One caution ıs necessary In a recent piece, turnout among recently 
naturalized immigrants was examined and found to be significantly 
greater than among longer-term immigrants (Pantoja, Ramirez, and 
Segura 2001) This finding, contrary to earlier work, they argued, was 
primanly drrven by the highly pohticized environment in California 
over issues of ethnicity resulting from a series of ballot inttiatives in 
the mid-1990s perceived to be Latino-targeted 

7 This variable ranges from zero to three for models of single election 
years or single offices, zero to mx for the model pooling all three 
jurisdictions for two election cycles, and zero to nine for the model 
pooling all three jurisdictions across three election cycles 

6 One potentially important variable mng from this analysis is 
the level of competition in the races m which the individual can vote. 
We calculated Average Competitrveness as the mean level of com- 
petition (100 minus the margin of victory) ın the districts facing an 
election in which the resident votes Since competition has long been 
thought to increase turnout, we would anticipate positive effects. The 
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TESTING AND RESULTS 


Electlon-by-Election Analysis 


To estimate the effect of majority—minority districts on 
the propensity of individuals to turn out to vote in 
each election year, we modeled participation in each 
election using Probit. The results, presented in Table 2, 
vary only modestly across the three election years pre- 
sented. Overall, the models are very effective, correctly 
predicting between 63.5% and 70.8% of the cases and 
yielding a proportional reduction of error (Goodman— 
Kruskal’s t-c) ranging from .259 to .351, a significant 
improvement over a null hypothesis of random propor- 
tional assignment of outcomes. 

The effect of living in a majority—minority district is 
consistently negative and significant, while the coeffi- 
cient on the interactive term for Latinos is consistently 
positive and significant. These findings are consistent 
both with our hypotheses and with Gay’s (2001) earlier 
findings on whites. Non-Hispanics living in majority— 
minority districts are less likely to vote, and this decline 
in likelihood appears to be directly related to the num- 
ber of overlapping majority-Latino constituencies in 
which they reside. In contrast, Latinos differ from this 
pattern. The positive coefficient indicates that Latino 
turnout is either less depressed, or actually increased, 
by residing in majority-Latino constituencies. 

The net effect, calculated by summing the coeffi- 
cients, varies across the elections. For 1996, the sum 
remains negative (—.015), suggesting that majority- 
minority districts depress turnout among Latinos as 
well, though to a much smaller degree than among 
non-Hispanics. We are reluctant to place much stock 
in this single finding for two reasons—the sum is per- 
ilously close to zero, indicating a very small net effect, 
and the result is not repeated for 1998 and 2000. For 
those two models, the sum of these two coefficients is 
positive—.04 in 1998 and .059 in 2000. Latinos living in 
overlapping majority—minority jurisdictions are more 
likely to turn out and vote. Even considering the some- 
what more modest finding for 1996, it appears clear that 
Latinos are mobilized by living in environments where 
they have a chance of winning with a candidate of their 
choice. 

The direct effect of being Latino appears to be neg- 
ative, as expected. In general, Latinos have historically 
voted less frequently than non-Hispanics and this ap- 
pears to be replicated here. Results for each of the other 
variables are nearly all as expected." Other minority 
groups are less likely to turn out; Republicans are more 


resulting variable, however, correlates mgnificantly (and negatively) 
with the safe seat control, making the stmultaneous mclusion of both 
problematic, In results not presented here (but available from the 
authors), we replace the Safe Democratic control with the Average 
Competitrveness contro] This change has no meaningful effect on 
the results presented 

17 Differences in election technology, registration patterns, and his- 
tory of political mcorporation suggest that each of the five counties 
mught have histoncally different rates of turnout, urespective of the 
other exogenous factors captured. This presents possible error-term 
problems For this reason, we coded for the five counties, one of 
which (Ventura) is omitted as a predictor in each estrmation as the 
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TABLE 2. Problt Estimates for Election-Spectflc Turnout of Registered Voters, 1996-2000 
































Voted 1996 Voted 1998 Voted 2000 
Coefficient Coefficient Coefficient 
Varlable (SE) Min — Max (SE) Min > Max (SE) _ Min > Max 
MM-Latinos 036°" .038 .059™" .069 074" .076 
(.002) (.001) (001) 
Majortty—-Minortty —.051* —.056  -—.019" —.023 —-—.015"" —.017 
(.001) (.001) (.001) 
Latino —.0107 —.004 —.010™ —.004 -—.067—" —.024 
(.003) (.002) (.002) 
Probability Black —.214™" —.077 —.140°° —.053  —.297™ —.107 
(.005) (.004) (.003) 
Party ID 113°" 041 089% .035 145 052 
(.002) (.001) (.001) 
Age 012" 381 .019"* .608 .015* .437 
(4.62 x 10-5) (3.87 x 10%) (3.13 x 10%) 
Senate Election — —.016 024°" .010 12 004 
(.002) (.002) (.001) 
Asian-American —.184°* —.069 —.153"" —.061 —.170"" —.063 
(.004) (.003) (.002) 
Female 032" .012 .0O07™* .003 057" .020 
(.002) (.001) (.001) 
Percent College 147 .044 2424 .078 184°" 054 
(.011) (.009) (.007) 
Median Income 3.26 x 10" 125 2.29 x 10° .097 4.23 x 10-6 .159 
(8.54 x 10-*) (7.15 x 10) (6.04 x 10?) 
Percent —.3825™™ —.087  —.249™ —.073 -. —.084 
Forelgn-Born (.008) (.007) (.006) 
Safe Dem .040—" 043 —.012" —.015 —.011™ —.012 
(.001) (.001) (.001) 
Los Angeles —.og0"" —.033  —.015— —.006 .003 .001 
(.003) (.003) (.002) 
Orange —.434—" —.165  —.281™ —.112 .089™* .032 
( 003) (.003) (.002) 
San Bernardino —1.508°" —.540 -. —.009 —.025"" —.009 
(.004) (.003) (.003) 
Riverside 501 —.194 126" .050 .106™" .038 
(.004) (.003) (.003) 
Constant —.059"" — —.738" — — — 
(.006) ( 005) (.004) 
x? 408,722.70°" 317,837.75" 389,842.14" 
N 3,391,123 4,317,827 6,660,566 
PPC 70.8 63.5 67.6 
PRE (t-ġ .351 .259 .273 








== p< .001, two-talled tests 








likely to turn out. Coefficients on aggregate measures 
of income and education are both positive, indicating 
that each is positively associated with turnout. Simi- 
larly age is positively associated with the probability 
of showing up at the polls, while the estimate on the 
percentage born outside the United States is negative 
and significant. Finally, women appear more likely to 
vote than men. 

Two anomalies do appear, however. First, the effect 
of a district being safe for Democrats is inconsistent in 
its effect, positive in 1996 and negative in the other two 
elections. Our priors on the net effect of this variable 


unexpressed category. We have no strong priors on what the nature of 
these relationshrpe will be, nor do we believe them to be theoretically 


mnportant. 


were not strong, yet we find the inconsistent results 
curious. Perhaps the unusually good Democratic year 
(and unusually weak GOP nominee) enhanced Demo- 
cratic turnout vis-a-vis Republicans in 1996. Second, 
the coefficient on having a Senate election that year is 
negative for 1996. This is very odd in that it suggests that 
the presence of more voting opportunities on a single 
ballot is associated with a lower propensity to turn out. 
In contrast, results on this measure in 1998 and 2000 
are positive and significant, as predicted. 

Two methodological notes are worth mentioning 
here. First, we used 50% +1 as the definition of 
majority—minority, while the literature has often sug- 
gested that the electorally n threshold is much 
higher—as high as 65%. In results not reported here, re- 
peating the analysis by recalculating all majority-Latino 
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Changes from the Previous Value 


TABLE 3. : Predicted Probablitties of Voting’ by Group Across Values of Majority—Mlnortty, with 














MalMin = 1 Ma}Min = 2 Ma}Min=3 Total Change, 
Year Group Ma}-Min = 0 (Change, O0— 1) _ (Change, 1 2) (Change, 2— 3) Min > Max 
1996 Latinos .6669 .6617 .6564 .6512 —.0157 
(—.0052) (—.0052) (— 0053) 
Non-Latinos -6703 .6520 .6334 .6144 —.0559 
(—.0183) (—.0186) (—.019) 
1998 Latinos 5592 5751 .5908 .6065 .0473 
(.0159) (.0158) (.0156) 
Non-Latinos -5830 5556 5481 5406 —.0224 
(—.0074) (—.0075) (—.0075) 
2000 Latinos .6504 .6717 6925 7126 .0623 
(.0213) (.0208) ( 0202) 
Non-Latinos .6747 .6691 6635 .6578 — 0169 
(—.0058) (— 0056) —.005 









thet mean value 
2 Among registered voters. 





variables using a 65% cutoff does not appreciably 
change any of the results (and, in fact, makes the net 
effect on Latinos in 1996 positive, thereby eliminating 
the somewhat anomalous finding).'8 

Second, this analysis is conducted using a universe, 
not a sample. So when we speak of statistical signifi- 
cance, we are really only referring to the amount of er- 
ror surrounding the estimated relationship. We are not, 
therefore, engaged in statistical inference from samples 
to populations and there are no confidence intervals— 
the relationship reported is the one in the population 
of registered voters, without question. Using statistical 
significance, then, as some indicator of the importance 
of a relationship—always an error—is even less useful 
here. l 

What, then, is the substantive significance of the iden- 
tified effects? We calculated the changes in predicted 
probabilities, also reported in Table 2, in columns di- 
rectly adjacent to the relevant parameter estimates. !9 
The negative effect of majority-Latino districts on the 
predicted probability of turnout for non-Hispanic vot- 
ers ranges from about 1.7% to 5.6%. In contrast, the 
net effect for Latinos (again, summing across the di- 
rect and interactive effects, this time using changes in 
the predicted probabilities) varies from just below zero 
(—1.8% for 1996, again) to an increase of 5.9%. That 
is, non-Hispanics living in three majority-Latino con- 
stituencies turn out at a rate as much as 5.6% less than 
that of non-Hispanics living in no majority-Latino dis- 
tricts, while for Latinos, living in three majority-Latino 
districts increases turnout by as much as 5.9% com- 
pared to that of Latinos living in non-Hispanic districts, 

In addition, the changes estimated by the models in 
Table 2 make it clear that this is genuinely an additive 
process—iiving in three majority-Latino districts does 
have a much. larger effect than living in just one. In 


18 All results referenced but not reported are available from the 
authors. 


1? Changes in probabilities are calculated umng Long’s (1997) 
Change program for Stata. 
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Note: Estimates for Latinos reflect the combined effect of Majortty—Minortty and the Interactive term. All other variables held constant at 





Table 3, we report both the predicted probability of a 
registered voter turning out at each value of Majority- 
Minority (0 to 3), holding all other variables constant 
at their means, and the change from the previous value 
(Long 1997). For non-Latinos, the effect is estimated by 
varying the value of Majority—Minority, while for Lati- 
nos, itis the summed effect of this variable and the inter- 
action term. It is readily apparent that a unit change in 
the variables capturing residence in majority—minority 
districts has a much smaller effect than a change from 
zero to three, and the relative size of the effect is con- 
stant across the range of values. Moreover, in all three 
instances, Latinos go from being disadvantaged to ad- 
vantaged, vis-à-vis non-Latinos, in propensity to turn 
out. We conclude that the effect of living in majority— 
minority districts, generally positive for Latinos and 
negative for non-Latinos, is greater among both as the 
frequency of that occurrence increases. Hypothesis 2 is 
sustained. 

The results from individual elections are convincing. 
Residing in majority-Latino districts serves as a dis- 
incentive to turn out among non-Latinos but appears 
to have a generally more positive effect on Latinos. 
The effect grows when the citizen resides in multiple 
majority-Latino districts, While a 5.91% increase may 
not seem very large, 384,728 Latinos reside in three 
majority—minority constituencies, meaning that the net 
effect of this variable just on those Latinos produces 
22,737 more voters at the polls (not including additional 
turnout among those living in two or one majority- 
Latino electoral districts) than if they lived in non- 
Hispanic districts.” 


2 In results not presented, we replicated these estimations using 
the multielection indices Voted 96-00 and Voted 98-00 as the depen- 
dent variables. The results of the Poisson regressions on these pooled 
measures of participation were completely consistent with those from 
indrvidual election years. While majority—minority districts did have a 
negative effect on the turnout of voters ın general, for Latinos the net 
effect was positive in both nstances Moving from no majority-Latino 
constituencies to three decreased the average number of votes cast by 
non-Hispanic registrants by .068 for the thres elections between 1996 





American Political Science Review 


TABLE 4. Polsson Estimates for Offlce-Specific Effects on Voter Turnout Indexed Over Elections 
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==- p< .001, two-talled test. 


Estimating Effects from Different Offices 


One remaining question is whether there are differ- 
ences in the effects of each specific office. We have 
been treating these offices as additive by simply cod- 
ing whether the registrant lives in zero, one, two, or 
three majority Latino constituencies. If, however, the 
salience, visibility, or accessibility of one office has a 


and 2000 and by .026 in the 1998 and 2000 elections only. In contrast, 
the same difference resulted m a .116 increase in the average number 
of votes cast by Latino registrants for the entire period and .138 for 
the last two elections. All of the remaining results are consistent m 
magnitude and effect with the earlier specifications 


Assembly Only Senate Only Congress Only 
Coefficient Avg. Change, Coefficient Avg. Change, Coefficient Avg. Change, 
Varlable (S Min > Max (S Min > Max _(SE) Min > Max 
MMA-Latinos .091 .180 
(.002) 
MMS-Latinos .078"" 154 
(.002) 
MMC-Latinos .069™* .135 
(.002) 
MM-Assembly —.045" —.084 
(.002) 
MM-Senate —.012" —.022 
(.002) 
MM-Congress —.028" —.052 
(.002) 
Latino —.022"" —.041 —.013"" —.024 —.018"" —.035 
(.001) (.001) (.001) 
Probability Black —.100°* —.176 —.089™ —.158 —.113" —.199 
(.002) (.002) (.002) 
Party ID 054" 104 .053™ .102 054" 104 
(.001) (.001) (.001) 
Age .008"* 1.643 .008=" 1.839 .008% 1.641 
(1.91 x 10%) (1.91 x 107) (1.91 x 10) 
Aslan-Amerkan —-—.001°™ —.166 —.093"" —.170 —.095"" —.173 
( 002) (.002) (.002) 
Female 014" .026 013" .026 ois" .026 
(.001) (.001) (.001) 
Percent College 073" 116 114 183 075" 120 
(.004) (.004) (.004) 
Median Income 1.79 x 10-6 383 1.72 x 104 .368 1.84 x 10-3" 394 
(3.11 x 10) (3.11 x 10) (3.15 x 10) 
Percent —.188°" —.259 —.188" —.259 —.201"" —.277 
Forelgn-Born (.003) (.003) (.003) 
Safe .003=* .017 —.004"* —.024 —.005"" .031 
Democratic (3.18 x 10-4) (3.16 x 10) (3.25 x 10~*) 
Los Angeles —.01366"" —.026 —.008"" —.015 —.015°" —.029 
(.001) (.001) (.001) 
Orange —.11094" —.203 —.111 —.204 —.110™ —.202 
(.001) (.001) (.001) 
Riverside —.28616" —.485 —.280"" —.475 - —.489 
(.002) (.002) (.002) 
San Bermardino .00045 .001 .002 .002 .003 
(.001) (.001) (.001) 
x? 305,524.90" 303,972.40" 305,263.21 
N 3,391,123 3,391,123 3,391,123 





greater impact on Latino voter efficacy, it could be ex- 
erting a greater effect on turnout than the others. 

To estimate these effects separately, we replace 
the count variable Majority-Minority with an office- 
specific variable (MM-Assembly, MM-Senate, MM- 
Congress) coded one if the district for that office in 
which the citizen resides is majority-Latino. The inter- 
active term for Latinos is similarly replaced with office- 
specific interactions. 

To eliminate any idiosyncratic effects from elections, 
we indexed each citizen’s record of participation in 
all of the elections under study. The dependent vari- 
able, then, is a count of the frequency with which the 
registrant signed in for a general election. We include 
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only registrants who were registered for all three elec- 
tions, so as not to make assumptions about the turnout 
of individuals who may have left the jurisdiction and 
registered elsewhere. Since the dependent variable is 
not continuous but takes on more than two discrete 
values, we employ Poisson regression, the appropriate 
functional form for event-count data neither over- nor 
underdispersed.” The results are presented in Table 4, 
with the estimates accompanied by average changes in 
expected values. 

In terms of magnitude, direction, and significance, 
the results are highly consistent for all variables across 
the three specifications. Turning specifically to the vari- 
ables of interest, being in a majority-Latino district 
is again uniformly negatively associated with turnout 
among all voters, yet uniformly positively associated 
with turnout among Latino voters, even when we con- 
sider the net of the two effects. As indicated by the 
average changes in the expected values reported in 
columns 2, 4, and 6, the number of times non-Hispanic 
voters turn out declines by .084 for Assembly districts, 
.022 for Senate districts, and .052 for House districts. In 
contrast, among Latino voters, the mean frequency of 
turnout across the three elections increases by .096 for 
Assembly districts, .132 for Senate districts, and .083 
for House districts. That is, it is clearly not the case that 
elections for a single office are driving the multioffice 
findings presented earlier. 

The similarity in these results is not surprising given 
the significant overlap in values on these variables. As- 
sembly districts are the smallest units of aggregation 
here and, logically, should have the largest percentage 
of Latinos living in majority-Latino districts, and this 
is in fact the. case. Approximately 34% of Latinos in 
this universe live in Latino Assembly districts, 24% in 
Latino Senate districts, and 29.4% in Latino House dis- 
tricts. Among Latinos, MM-Assembly correlates with 
MM-Senate at r =.7826, and with MM-Congress at 
r = .8217.* This overlap, of course, is precisely the ori- 
gin of our contention that to estimate the effects of 
living in a majority—minority district, one really must 
consider the context of other offices that will almost 
certainly exacerbate this effect. 


CONCLUSION 


In this effort, we set out to determine whether liv- 
ing in majority—minority districts was mobilizing for 
the Latinos and whether these effects would be better 
estimated by considering the larger electoral context. 


21 Tn results not presented here, the same estimations on the other 
mdex, Voted 98-00, and on each election year separately yield results 
consistent with earlier findings and those presented here. 

7 Changes in expected values were calculated unng Clarify 
Tomz, and Wittenberg 2000; Tomy, Wittenberg, and King 2001) 

B The inversion of Senate and Congress values is not surprising. As 
the state constitution specifies the size of the Senate at 40, Senate 
districts are actually larger in population than the state’s 53 House 
seats (52 at the trme of these data). Since smaller and more numerous 
districts tend to increase the share of minorities living ın a majority- 
minority district, the closer association between the Assembly and 
the House is not surprising. 
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Focusing exclusively on legislative elections and using 
turnout data at the individual level, we have demon- 
strated that both are the case. Having the opportunity 
to elect a candidate of your choosing is a consistently 
empowering circumstance. Latinos vote more when in a 
majority-Latino district, contrary to the expectations of 
those who expected or feared minority demobilization. 
In contrast, non-Hispanics living in Latino majority dis- 
tricts have less to cheer about and, apparently, less to 
drag them out to the polls on election day. 

The larger electoral context was found to play 
an important role in establishing the incentives or 
disincentives to vote. Our findings are unambiguous 
in this regard as well. If living in one majority Latino 
district is good for turnout propensity, living in two or 
even three is better. And by devoting our attention to 
Latinos rather than African Americans, our findings 
fill a sizable lacuna in the literature on minority 
voting. 

We need to highlight three potential caveats. First, 
focusing on southern California may limit the general- 
izability of these findings. Nevertheless, testing the ef- 
fects of majority—minority districts requires a focus on 
regions containing significant numbers of such districts. 
Even in the largest concentration of Latino voters in the 
United States, only 36% of Latinos lived in even one 
majority-Latino district. The fairly broad distribution 
of voters across majority-Latino and majority-white 
districts, then, makes this environment well suited to 
our purposes, 

Second, as we reported earlier, voter registration is a 
self-selection process and our results might be different 
if we estimated the effect on all citizens. Fortunately, 
this selection bias actually makes it harder to sustain 
our hypotheses. Since the incentives and disincentives 
for participation undoubtedly affect the willingness of 
Latinos and non-Hispanics to register, at least some of 
the effect of majority-minority districting is absorbed 
by the decision to register—yet our results are still as 
we expected. 

Finally, we have little to say about the impact on 
African Americans since we are not able to identify 
black registrants and since there are only black influ- 
ence districts, rather than majority African American, 
in this region. In order to test our hypothesis among 
African Americans, a data set from another region is 
required. 

Majority—minority districting boosts Latino turnout, 
even as it increases Latino descriptive representa- 
tion. The potential up-ballot advantages of this in- 
crease in participation are profound and suggest that 
Guinier’s (1994) and others’ concerns about the po- 
tentially detrimental effects of these instruments are in 
error. 
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lıbrium outcomes are asymmetric and do not correspond to the median. There are three main 


Wie two candidates of different quality compete in a one-dimensional policy space, the equi- 


effects. First, the better candidate adopts more centrist policies than the worse candidate. Second, 
the equilibrium is statistical, in the sense that it predicts a probability distribution of outcomes rather than 
a single degenerate outcome. Third, the equilibrium varies systematically with the level of uncertainty 
about the location of the median voter. We test these three predictions using laboratory experiments and 
find strong support for all three. We also observe some biases and show that they can be explained by 


quantal response equilibrium. 


andidate quality differences can produce signif- 

icant changes in the nature of political com- 

petition. The equilibrium properties of spatial 
competition between two candidates who differ in 
quality have been analyzed theoretically and the aim 
of this paper is to test them using experimental data. 
The main properties are that higher-quality candidates 
tend to choose more moderate locations and that uncer- 
tainty about the median voter helps lower-quality can- 
didates. We find that all the main qualitative proper- 
ties of this equilibrium are clearly observed in the data 
obtained from laboratory experiments. Moreover, the 
results are robust across subject pools and experimental 
context. 

Candidate quality is widely considered to be a critical 
variable in electoral competition. It affects the deci- 
sions of politicians to run for office, campaign fund- 
raising and advertising, voter behavior, election out- 
comes, and, ultimately, policy outcomes. While direct 
measurement of candidate quality is often elusive, few 
doubt its importance in electoral politics. Quality dif- 
ferences between two candidates can arise for many 
reasons, including charisma, officeholding experience, 
incumbency, advertising, scandal, and other nonpolicy 
dimensions that can affect the relative attractiveness of 
two candidates. Political scientists have demonstrated 
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sional elections identify challenger quality as a critical factor. See, 
e.g. Jacobson and Kernell (1981), Krasno (1994), and Squire (1992). 
Incumbency itself can also be viewed as an indicator of quality. 


over several decades of careful empirical research the 
importance of these and other image factors, or the 
“valence dimension,” as it is referred to in numerous 
articles and books? 

Itis obvious that, all else constant, high-quality candi- 
dates will fare better than low-quality candidates. What 
is less obvious, but equally important, is that quality 
differences produce significant changes in the nature of 
political competition. Recent papers by Ansolobehere 
and Snyder (2000), Aragones and Palfrey (2002), and 
Groseclose (2001) report a number of theoretical re- 
sults about the equilibrium properties of spatial com- 
petition between two candidates who differ in quality. 
The results are striking and suggest that the indirect 
equilibrium effects of candidate quality differentials 
may be even more important in determining candidate 
policies and election outcomes than the direct effects 
of producing more votes for one candidate than the 
other.’ 

The main insight about spatial competition if the 
candidates differ in quality (or along some other va- 
lence dimension) is that the better candidate has an 
incentive to copy the policies of the inferior candidate, 
or at least move in that direction, while the disadvan- 
taged candidate has the opposite incentive, to move away 
from the advantaged candidate. Theoretically, the ad- 
vantaged candidate will win all the votes if the two 
candidates choose sufficiently similar policies. Thus, in 
the standard Downsian model where candidates are 
purely office-motivated, the disadvantaged candidate 
must mix in order not to be predictable. However, in 
order for mixing to be an equilibrium strategy for the 


2 See, eg., Kiewiet (1983), Kiewiet and Zheng (1993), and Stokes 
(1963) 

3 There are also some earlier theoretical papers that studied other 
kinds of asymmetry, such as incumbency and partisanship, e.g., 
Adams (1999), Bernhardt and Ingberman (1985), Ingberman (1992), 
and Londregan and Romer (1993). 

4 Usıng a different approach, Banks and Kiewlet (1989) show that 
candidate quality differentials can have important and surprising 
equilibnum effects on the entry of challengers in congressional elec- 
tions. Dasgupta and Williams (2002) study information aggregation 
through polls in a principal-agent model with incomplete voter in- 
formation about candidate quality. They report experimental results 
supporting their rational expectations hypothesis. 
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disadvantaged candidate, the advantaged candidate 
also must be mixing. 

This implies the first of three key properties of equi- 
librium in these models: The equilibrium makes sta- 
tistical predictions, not point predictions. If both candi- 
dates have complete information and symmetric beliefs 
about voters, then the equilibrium is generally in mixed 
strategies. If candidates have private information with 
continuous types, then this mixed equilibrium can be 
“purified.” That is, there will exist an equilibrium in 
pure strategies, where the equilibrium locations of can- 
didates will vary with this private information. More- 
over, both the pure and the mixed strategies produce 
distributions of location decisions that share similar sta- 
tistical properties (Aragones and Palfrey 2004). 

The second key property is that the distribution of 
location decisions of the two candidates will be differ- 
ent from each other, and the differences are systematic. 
We call this the quality divergence hypothesis. The main 
difference between the two candidate locations is that 
the distribution of locations of the better candidate is 
concentrated in the center of the policy space (i.e., the 
expected location of the median voter), while the loca- 
tion of the disadvantaged candidate tends toward the 
extremes. That is, better candidates tend to choose more 
moderate locations. Groseclose (2001) notes that this 
is consistent with two regularities that have been iden- 
tified in empirical studies of congressional elections. 
One is the lack of support for the marginality hypothe- 
sis, documented in Fiorina (1973). That is, Fiorina finds 
that incumbents who are in marginal districts tend to 
moderate less than incumbents from safe districts. This 
is clearly consistent with the quality divergence hypoth- 
esis. Second, a recent paper by Ansolobehere, Snyder, 
and Stewart (2001) compares the spatial locations of 
three categories of candidates: (1) incumbents seeking 
reelection, (2) candidates for open seats, and (3) chal- 
lengers running against an incumbent. They find that 
incumbents are the most moderate, followed by open 
seat candidates, and that challengers adopt the most 
extreme positions, To the extent that quality correlates 
across these three categories as expected, then this pro- 
vides further corroboration of the quality divergence 
hypothesis. ' 

The third property is that the two candidates’ equi- 
librium distributions of locations varies systematically 
with the degree of uncertainty about the median voter. 
Uncertainty helps the disadvantaged candidate, who 
chooses more moderate locations as a strategic re- 
sponse to greater uncertainty. This implies that the 
quality divergence effect is weakest when there is a lot 


5 An equilibrium in mixed strategies ıs guaranteed to exist (Aragones 
and Palfrey 2002), and has intuitive properties. A pure strategy equi- 
hbrum may exist if candidates obtain utility from wining policies 
as well as from holding office, under certam conditions, Groseclose 
(2001) studies the propertes of stable pure strategy equilibria, un- 
der the maintamed assumption that they exist, but does charactere 
conditions for existence. He presents an example suggesting that ex- 
tence us especially problematic for small to intermediate values of 
the quality advantage and if officeholding is the prrmary motivation 
of candidates. The properties of pure strategy equilibria are smmilar 
to those of mixed equilibria. 


78 


February 2004 


of uncertainty or if the electorate is highly polarized, 
and the effect is strongest when there is little uncer- 
tainty or a high degree of consensus in the electorate. 
We call this the polarization hypothesis. 

Because the nature of equilibrium is very subtle in 
these asymmetric spatial competition games, and be- 
cause the equilibrium (with complete information) is 
mixed, one cannot help but be skeptical about whether 
the features of the theoretical equilibrium might ac- 
tually occur in practice. While the evidence put forth 
by Groseclose (2001) is consistent with the quality di- 
vergence hypothesis, that evidence could be explained 
by other theories. For example, the correlation be- 
tween incumbency (i.e., electoral success) and mod- 
eration is also consistent with the standard Downsian 
model or the more general models by Calvert (1985) 
and Wittman (1983), which include policy motivations. 
Thus, the evidence is suggestive that the theory may 
be on the right track but does not provide a conclusive 
test of the model. Unfortunately, the kind of field data 
one would need to test these predictions is simply not 
available, due to the difficulty of obtaining reliable and 
accurate measurement of the “quality” variable, the 
degree of uncertainty or polarization in the electorate, 
and the location of candidates, and because the statisti- 
cal nature of predictions would require a large number 
of observations. We believe that direct testing of the 
theory is needed. 

With this in mind, we designed and conducted lab- 
oratory experiments to test directly both the quality 
divergence hypothesis and the polarization hypothesis. 
By doing so, we hope to find out if the basic predictions 
of the theory are accurate and, if not, what sort of mod- 
ification of the theory might be required. This paper 
reports and analyzes the data from those experiments. 

There are three main findings. First, all of the qual- 
itative predictions from the equilibrium theory were 
clearly observed in the data. Both the quality diver- 
gence hypothesis and the polarization hypothesis are 
strongly supported by the data. Location decisions were 
statistical, the advantaged candidates located more 
centrally on average, and all of the comparative static 
predictions of changes in the distribution of voters 
were observed. In particular, when the distribution 
of voters was more polarized, there was less diver- 
gence. Second, the results are robust across subject 
pools and experimental contexts. We used two subject 
pools, one from California, USA, and the second from 
Barcelona, Spain. While the subjects were university 
students in both cases, they were from much different 
backgrounds both culturally and politically. The results 
are virtually identical in the two datasets. Most of the 
experimental sessions were conducted in a context-free 
setting, as is standard practice in controlled game the- 
ory experiments. In addition, we conducted one ses- 
sion where subjects played the role of vote maximizing 
candidates in a sequence of two-candidate elections. 
In each election, they selected policies. The number 
of votes they received was a (commonly known) func- 
tion of their policy and the policy of the opposing can- 
didate. This function was constructed to be the same 
as the “point” payoffs in the abstract setting. Subjects 


American Political Science Review 


Vol. 98, No. 1 


AmMernçam OTUCA eee eI a a a 


earned money in direct proportion to the number of 
votes they received.® Third, while the main hypothesis 
were clearly supported, the exact distribution of loca- 
tion choices was somewhat different from the quanti- 
tative predictions of the theory in systematic and sur- 
prising ways. There were two noteworthy differences: 
(a) Both candidates showed a bias toward centrist 
locations, relative to the theoretical predictions; and 
(b) the disadvantaged candidate’s location distribution 
was less responsive to changes in uncertainty than pre- 
dicted by the theory. 

In order to account for these anomalies, we consider 
an extension of Nash equilibrium theory that allows 
for a limited amount of bounded rationality. This ap- 
proach, called Quantal Response Equilibrium (QRE, 
McKelvey and Palfrey 1995, 1996), is based on two 
principles. The first principle is that players of any game 
respond continuously, but imperfectly, to the incen- 
tive structure of the game. While they do not optimize 
perfectly, they will choose on average better strategies 
more often than worse strategies. The second principle 
is that players are aware that other players are also 
imperfect and take this into account when choosing 
their actions. 

This boundedly rational version of Nash equilib- 
rium often leads to surprising and unintuitive predic- 
tions about behavior in games and provides a statisti- 
cal model for data analysis. Recently this model has 
been used in a number of political science applica- 
tions, spanning empirical studies, experimental stud- 
ies, and theoretical modeling. For example, Signorino 
(1999) has applied it in a model of international conflict, 
Guarnaschelli, McKelvey, and Palfrey (2000) apply it to 
jury decision making, and McKelvey and Patty (2002) 
use it as a basis for a theoretical model of candidate 
competition with probabilistic voting. To analyze our 


data using this approach, we fit the data to the Logit ver- 


sion of QRE. We find that the simplest one-parameter 
version of that model provides an excellent fit to the 
data and accounts for the two unexpected findings. 


THE MODEL 


We begin with a synopsis of the model and basic re- 
sults of Aragones and Palfrey (2002). The reader is 
referred to that paper for details. The policy space is 
one-dimensional and consists of the set ofn > 1 equally 
spaced points on the [0, 1] interval.’ There are two 
candidates, A and D, who are referred to as the ad- 
vantaged candidate and the disadvantaged candidate, 
respectively. Each candidate’s objective is to maximize 
the probability of winning the election." Each voter has 
a utility function, with two components, a policy com- 
ponent and a candidate image component. The policy 


6 Alternatively, we could have explained these payoffs as the prob- 
ability of winning, with identical equilibrium predictions. We opted 
for the vote maximization model because it is simpler to explain. 

7 This us just a discrete version of the standard spatial model. 

8 This is equivalent to a model of vote maximizng candidates facmg 
a known distribution of voters. 


component is characterized by an ideal point in the pol- 
icy space with utility of alternatives in the policy space 
a strictly decreasing function of the distance between 
the ideal point and the location of the policy, symmetric 
around the ideal point. We assume that there exists a 
unique median location. Candidates do not know the 
exact location of the median voter but share a com- 
mon prior belief about it. This commonly shared belief 
is represented by a uniform distribution. The image 
component of a voter’s utility is captured by a posi- 
tive constant that is added to the utility each voter gets 
if A wins the election.’ The image component is as- 
sumed to be small, relative to the policy component of 
utility. 

The game takes place in two stages. In the first stage, 
candidates simultaneously choose positions in the pol- 
icy space. In the second stage, each voter votes for 
the candidate whose victory would yield the higher 
utility. 

A pure strategy equilibrium is a pair of candidate 
locations such that both candidates are maximizing the 
probability of winning given the choices of the other 
candidate. A mixed strategy equilibrium is a pair of 
probability distributions over locations such that both 
candidates are maximizing the probability of winning 
given the mixed strategy of the other candidate. 

There are five main results, each of which we state 
without proof.!° 


Result 1: There does not exist a pure strategy equi- 
librium." 

Result 2: There exists an essentially unique equilib- 
rium in mixed strategies. 

Result 3: The distribution of D’s equilibrium strategy 
is U-shaped (with a local minimum at the expected 
median). 

Result 4: The distribution of A’s equilibrium strategy 
is single-peaked (at the expected median). 

Result 5: The probability that A wins is higher than 
the probability that D wins. 


Summarizing, the main results are that there exists 
an essentially unique equilibrium in symmetric mixed 
strategies with no gaps. In this equilibrium, A is the 
more likely candidate to win. That is, A’s quality ad- 
vantage leads to an electoral advantage. The supports 
of the equilibrium mixed strategies are the same, but 
otherwise the two distributions of the two candidates’ 
are much different. The better candidate is more likely 
to locate in the center of the policy space than at the 
extremes, while the opposite is true for the lower- 
quality candidate. The latter property follows from 
results 3 and 4, and we call it the quality divergence 


effect. 


9 The constant could be different for different voters. For smpliaty, 
we have assumed that it is the same. 
10 There are some additional technical assumptions. The reader is 
referred to Aragones and Palfrey (2002) for formal statements and 
fs of these results. 

1 Versions of this proposition can be found in Berger, Munger, and 
Potthoff (2000) and Groseclose (2001). Ansolobehere and Snyder 
(2000) contains some related results 
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THE EQUILIBRIUM EFFECTS 
OF UNCERTAINTY 


This paper conducts an experiment to test a number 
of the predictions of the theoretical model. We use a 
variation on this model that can be solved explicitly 
for arbitrary distributions of the median voter. This 
is important because the equilibrium strategies of the 
candidates are sensitive to the distribution and vary in 
systematic ways with the distribution. This variation, 
in which candidates can choose one of only three lo- 
cations (left, center, right), also has the virtue of being 
quite simple and intuitive to explain to subjects.” This is 
important in the laboratory context, since we use naive 
subjects who have had no real experience in candidate 
location games. 

` Denote the three possible locations, L, C, and R, 
for left, center, and right, respectively, where L < C < 
R. The probability that the median voter is located 
at ideal point L is denoted a; similarly the proba- 
bility of being located at ideal point C or R is de- 
noted $ and y, respectively, with æ +8 +y =1. Sup- 
pose that the utility functions of the voters are as 
described in the previous section, and assume that 
I(R-C)-(C-— L)| <8 < max{(C — L), (R— C)}. To 
maintain symmetry in the problem, we assume that 
a=y <j. Sincea+f+y=1, this implies that a < 4 
and £ = 1 — 2a, so the model is reduced to a single 
parameter, a, which is proportional to the variance of 
the distribution. Thus, we call a the uncertainty (or po- 
larization) index.’ When a > }, the distribution of the 
median voter’s ideal point is bimodal. We refer to this 
as the case of high uncertainty. When a < }, the distri- 
bution of the median voter’s ideal point is unimodal. 
We refer to this as the case of low uncertainty. The 
case of a = § is called the uniform case. The degree of 
uncertainty, low, uniform, or high, is the primary treat- 
ment variable in the experiment. Hence it is important 
to understand how the equilibrium varies along this 
dimension of uncertainty, or polarization. We explain 
this below. 

The payoff matrix for the game is given in Table 1, 
where A is the row player and D is the column player. 
The (unique) mixed equilibrium is solved in the stan- 
dard way and yields the following pair of equilibrium 


12 This method is also used m the Dasgupta and Wilhams (2002) 


t 
13 A completely general solution to this three-location models given 
in Aragones and Palfrey (2002). We present only a synopsis of the 
derrvation here. ` 
14 That is, the quality advantage is large enough, so a C-location voter 
will vote for L, when the two candidates choose opposite extremes, 
but small enough that voters at D’s location will vote for D unless 
Ais also located there If the quality advantage 1s outside this range, 
the equilibria are trivial and uninteresting. 
15 Formally this model is equivalent to a model in which the popula- 
tion distribution of the voters’ ideal points rs given by (a, £, y) and 
each candidate maximizes the expected vote. In this context, a can 
also be interpreted as a measure of the polanzation of the electorate. 
If a > 4, the distribution of voters is bimodal, while if æ < 4, the 
distribution of voters is more concentrated in the center. Because the 
two versions of the model yield results that are formally equivalent, 
we use the terms uncertainty and polanzation mterchangeably, and 
both terms refer to the level of a. 
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To simplify notation, we denote the equilibrium 
(p*,q*), where p* = oé and q* =o are the equi- 
librium probabilities that A and D locate in the cen- 
ter position, respectively. The probabilities of locating 
at L (or R) are therefore (1 — p*)/2 and (1 — q*)/2, 
respectively. Using this notation, the equilibrium is 
p* = (2 —3a)/(2 — a) and q* = a/(2 — æ). 

This equilibrium solution has several interesting 
properties. First, note that sincea < 4, this implies that 
of < of and o? > oẸ. This implies the quality diver- 
gence hypothesis: The advantaged candidate is most 
likely to locate at the center, while the opposite is true 
for the disadvantaged candidate, who is more likely 
to locate on the extremes. This holds generally for all 
values of a. 

Second, the comparative statics with respect to the 
uncertainty index, a, are very interesting and a bit sur- 
prising and counterintuitive. First, 9p*/da <0, so that 
as uncertainty increases (or the electorate becomes 
more polarized) the advantaged candidate becomes 
more likely to adopt an extreme policy. Surprisingly, 
the opposite is true for the disadvantaged candidate. 
That is, dg*/da >0, implying that the disadvantaged 
candidate becomes more likely to adopt a centrist policy 
as uncertainty increases. In other words, as the polar- 
ization index increases, the advantaged candidate tends 
to move away from the center and the disadvantaged 
candidate moderates. This is the polarization hypoth- 
esis. In the extreme case, when a = 5 (i.e., the median 
is at one of the two extremes), both candidates mix 
uniformly over the three locations. 

Finally, uncertainty benefits the weaker candidate. 
The equilibrium probability that D wins is given by: 


2a(1 — a) 

2-a ` 
The change in this equilibrium probability as a changes 
is found by computing the derivative of II% (œ), which 
is given by 
diih (a) a É —2a a(l- | E 


da 2—a (2-a)? 


Mp(a) = 








The derivative is positive because a <}. Notice that 
if there is very little uncertainty (a close to 0), 
the disadvantaged candidate almost never wins. The 
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Uniform (a=) (3, t) 44 : 
Sesslon 1 10 x 200 
Session 2 8 x 200 
Session 3 14 x 200 
Session 44 12 x 200 
Low (a= i) (§, 3) 40 
Session 1 14 x 200 
Session 2 10 x 200 
Session 3 16 x 200 
High (a =§) ($ 3) 38 
Session 1 8 x 166 
Session 2 14 x 200 
Session 3 16 x 200 
a Polttical context. 





probability D wins reaches its maximum value of å 
when a=. 


EXPERIMENTAL DESIGN 
AND PROCEDURES 


We conducted laboratory experiments using three val- 
ues of a, corresponding to three levels of uncertainty. 
The three values were a = 1/3 (uniform), a =1/5 (low 
uncertainty), and œ =3/7 (high uncertainty). The ex- 
periments used students from the California Institute 
of Technology (CIT; Caltech) and Universitat Pompeu 
Fabra (UPF). Nine sessions were conducted, three for 
each value of a, of which two were carried out at CIT 
and one at UPF. Table 2 summarizes the information 
about each session. In addition, there was an additional 
Caltech uniform session run in a political context. The 
procedures are described in the next section. 


Procedures 


The experiments were conducted using software de- 
veloped at the Hacker Social Science Experimental 
Laboratory at Caltech. The interface for the software 
presents each subject with a matrix of payoffs and keeps 
track of the history of previous game outcomes auto- 
matically for each subject. The matrix of payoffs was 
strategically equivalent to the three-location games, but 
constants were added to the payoffs to avoid zero out- 
comes and to approximately equalize the payoff mag- 
nitudes for A and D players. 

Each session lasted 200 rounds, each round being 
one play of the game.'© Between eight and 16 subjects 
participated in each session. Total earnings were equal 
to the sum of all earnings over the 200 rounds. Average 
earnings were $20 and sessions lasted about 90 min. 


16 The first High session was terminated after round 166 due to a 
computer crash 

17 For the experiments conducted at UPF, earnings averaged 
13 Euros. 


Each subject played both roles (A and D). At the 
beginning of the session, subjects were assigned to be 
either row or column players and instructions were read 
aloud. The game matrix was displayed in front of the 
room for everyone to see. It also appeared on their 
computer screen. In each round, row players clicked 
their mouse on a row to make a decision and column 
players clicked on a column to select their decision. 
After everyone had made a decision, the row/column 
outcome of their match was highlighted in the ma- 
trix on their screen. The screen also kept a display 
of the history of their play and the choices made by 
their past opponents. Several practice rounds were con- 
ducted in order to familiarize the subjects with the 
computer interface. During these practice rounds, the 
subjects were not allowed to make any choices on their 
own. 

The subjects then played 100 rounds, being randomly 
rematched into pairs (one column player and one row 
player) after each round of play. After round 100, 
the payoff matrix was changed so that the row and 
column players’ roles were reversed. That is, the col- 
umn player’s payoffs now corresponded to what a row 
player’s payoffs had been in the first 100 rounds, and 
vice versa. This reversal was carefully explained to the 
subjects. They played 100 additional times with these 
reversed roles. This reversal allowed each row subject 
to have 100 rounds of experience as the A player and 
100 rounds of experience as the D player, and the same 
for each column player. For all sessions except the po- 
litical context session, the instructions were worded in 
neutral terms that would not be associated with per- 
sonal political ideology. The three strategies were la- 
beled A, B, and C. A sample copy of the instructions is 
given in the Appendix. 

For the political context session, the subjects were 
told that they were adopting the role of political can- 
didates, and they would be choosing policy platforms 
over a sequence of 200 elections. The number of votes 
they received in an election depended on their policy 
and the policy of the other candidate in the election. 
They were allowed to choose from a menu of three 
policies, generically called A, B, and C. The number of 
votes each candidate received as a function of pairs 
of policies (one for each candidate in the election) 
was determined in exactly the same way as the pay- 
offs from the neutral uniform treatment. Subjects were 
paid a constant amount for each vote they received. 
As in the neutral context experiments, each subject 
played 100 times in the role of the A candidate and 
100 times in the role of the D candidate, Therefore, 
this session was strategically identical to the other uni- 
form treatments, the only difference being the political 
context. 


Hypotheses 


We have the following comparative static hypotheses 
that are derived from the theory. They are summarized 
below, denoting the empirical choice frequencies of 
center (P, 9) and the treatments H, M, and L (for high, 
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medium, and low uncertainty) 


1. Pi> Pu> pu. 
2. åL<ĝu <u. 
3. ĝ< d < f for all uncertainty treatments. 


This implies the following string of inequalities: 
l 1 
9L<4u<4H< 3 <Pu< bu < PL. 


Summarizing, there are four primary hypotheses 
based on the Nash equilibrium model. The first hypoth- 
esis states that disadvantaged candidates will choose 
the center location less than one-third of the time, re- 
gardless of the level of uncertainty. The second hypoth- 
esis states that advantaged candidates will choose the 
center location more than one-third of the time, regard- 
less of the level of uncertainty. Thus hypotheses 1 and 
2 jointly imply the quality divergence hypothesis. The 
third hypothesis states that disadvantaged candidates 
will choose the center location more often as the level 
of uncertainty increases. The fourth hypothesis states 
that advantaged candidates will choose the center lo- 
cation less often as the level of uncertainty increases. 
Thus hypotheses 3 and 4 jointly imply the polarization 
hypothesis. . 


RESULTS 


The main results of the paper have to do with the effect 
of the primary treatment variable (level of uncertainty) 
on the location decisions of the subject candidates. The 
key observation from the experimental results is that all 
four of the main hypotheses of the theory are strongly 
supported by the aggregate data from this experiment. 
The Nash equilibrium order of center choice relative 
frequencies, 91 < ĝu < ĝu < $ < Pu < Pw < Pu, is ex- 
actly what is found in the data. Table 3 clearly shows 
the support for all of these theoretical hypotheses. That 
table displays the differences between pairs of aggre- 
gate choice frequencies.!® The cell entries in the ta- 
ble correspond to the difference in the relative fre- 
quency of center choices for two treatments (or one 
treatment compared with i). For example, the entry 





18 For comparability reasons, the results presented include only data 
from the neutral context sessions. The findings from the pohtical con- 
text session are discussed separately. However, it 1s important to note 
that none of the results change by pooling data from both contexts. 
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Note: * Significant at better than 5% level. = Significant at better than 1% level. 





in the cell with row label pz and column label py is 
Pi— Pu =0.769 — 0.609 = 0.160. Every single one of 
the 21 predicted differences has the correct sign, and 
all except one are statistically significant at better than 
the 1% level. 

With the exception of the bimodal treatment, the 
aggregate fit for the A players to the quantitative pre- 
diction of Nash equilibrium was nearly perfect. The 
quantitative fit for the D players is not nearly as good, 
and the error was in the direction of overplaying the 
center strategy in all cases.° The A players overplayed 
the center strategy in two of three treatments (the ex- 
ception being the L treatment, where the difference 
is very small.) In addition. the D players do not re- 
spond very strongly to the treatment effects. That is, the 
differences between @ in the different treatments was 
always less than predicted by the theory. Thus, while 
the qualitative features of the data are very supportive 
of the theory, the actual magnitudes of (P, 9) in the 
various treatments deviate somewhat from the Nash 
equilibrium predictions in systematic ways. 

To summarize, there are seven main features of the 
aggregate data. 


1. The quality divergence hypothesis is strongly sup- 
ported by the data. 

2. The polarization hypothesis is strongly supported by 
the data. 

3. All of the signed comparative static predictions 
about p and q are strongly supported by the data. 

4. All of these comparative static differences are sta- 
tistically significant. 

5. The A player fits the Nash predictions much better 
than the D player. 

6. Both players tend to overplay the center strategy, 
and this effect is strongest for the D players. 

7. The response of the D players to changes in the level 
of uncertainty is less than predicted. 


QUANTAL RESPONSE EQUILIBRIUM 
ANALYSIS 


The strategic structure of equilibrium suggests the fol- 
lowing possible explanation. If D players begin with 


19 Another quantitative feature of the data 1s that subjects do not 
play Land R with equal frequency. There is a small (but statistically 
i t) difference between R frequencies and L frequencies, for 
both Aand D players, with R played more frequently than L 
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uninformative prior beliefs about the choices by A 
players, then locating in the center is their optimal 
choice. The same is true for the A players. This could 
produce a pattern in which both players initially over- 
play C, then gradually adapt in the direction of their 
equilibrium strategy. Since in this kind of process D 
starts out farther away from his equilibrium strategy 
than A, it is not surprising that A frequencies are closer 
to their equilibrium values than D frequencies. What 
is needed to capture this idea theoretically is a model 
that can predict one player to be farther from Nash 
equilibrium than the other player. 

QRE is an equilibrium model of imperfect play. A 
quantal response function is simply a smoothed out 
single-valued best response function that is monoton- 
ically increasing in expected payoffs. The quantal re- 
sponse functions are continuous and “statistical” in 
the sense that all strategies are played with positive 
probability. Therefore, players do not always play best 
responses. However, the monotone property implies 
that they play better strategies more frequently than 
worse strategies. Formally, for each player, the quantal 
response function maps the vector of expected payoffs 
of feasible actions into mixed strategy, which satisfies 
monotonicity and continuity properties. A QRE is a 
fixed point of the following composed mapping. Let o 
be some (mixed) strategy profile in the game. Given 
o, one can compute, for each player i and for each of 
player i’s possible actions j, the expected payoff from 
playing that action, given ø, denoted U;,. Given these 
vectors of expected payoffs, the quantal response func- 
tions of players then yield a new mixed strategy pro- 
file, ¢ = O(c). A QRE is a fixed point of this mapping, 
that is, a mixed strategy profile, o*, with the property 
that o* = Qo"). McKelvey and Palfrey (1995) estab- 
lish a number of theoretical properties of QRE points, 
including existence and upper hemicontinuity and a 
connection between QRE and Bayesian equilibrium 
of games with payoff disturbances. 

A particularly useful parametric form of QRE is the 
Logit equilibrium. The Logit equilibrium arises when 
all players’ quantal response functions are Logit func- 
tions of the expected utilities that are implied by the 
mixed strategies.” Formally, a Logit quantal response 
function is given by 


eUu (0) 


ô = Fp eUe)’ 


where A is a parameter measuring the responsiveness of 
i to payoff differences between strategies. A Logit equi- 
librium is therefore a mixed strategy profile o* such that 
eUl") 

Oo SS 

Y S, eUe) 

When à =0, behavior is completely random, and the 
unique Logit equilibrium has every player choosing ac- 


for alli and j. 





2 Hence the Logit equilibrium is the game theoretic extension of the 
standard Logit model of discrete choice that is commonly employed 
in empirical estimation of indrvidual choice models See McFadden 
(1976) for a survey of the Logit and related models of quantal choice. 
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tions according to a uniform distribution. When A > oo, 
the Logit equilibria converge to Nash equilibria. The 
Logit equilibrium correspondence for a game is the set 
of all Logit equilibria for the game, for each nonneg- 
ative value of à. Because of its simple functional form, 
Logit equilibria are relatively easy to compute numer- 
ically and, in some cases, analytically. Gambit software 
(1999) was used for the calculations and figures above. 

Figure 1 displays the graphs of the Logit equilibrium 
correspondences for each of the primary treatments. 
Logit equilibrium choice probabilities are on the verti- 
cal axis and À is on the horizontal axis. Each graph has 
two curves, one for the probability of choosing center 
and a second that is the probability of choosing left (by 
symmetry equal to the probability of choosing right). 
From the graphs, one can see how the QRE captures 
the intuition that the A players converge more quickly 
to the Nash equilibrium, while the D players converge 
slowly. Neither converges monotonically. For interme- 
diate values of A, both players overplay C relative to 
Nash equilibrium. 

The Logit equilibrium correspondence provides a 
structural model that permits us to fit the data to QRE 
using standard maximum likelihood techniques. Given 
a dataset consisting of n observations of A and D 
choices in the location game, one can construct the like- 
lihood function as a function of the free parameter, A, 
which is determined by the theoretical choice probabil- 
ities of the unique Logit equilibrium for that value of À. 
The maximum likelihood estimate of A is the value of À 
at which that likelihood function is maximized. This pa- 
rameter estimate in turn implies estimated equilibrium 
choice frequencies (p*, 9*) using the formula above. 

Figure 2 includes the fitted QRE-predicted choice 
probabilities of A and D, as well as the Nash predictions 
and the aggregate data. The QRE model clearly picks 
up the three anomalous features of the data: overplay 
of center by both players, the worse fit of D compared 
to A, and the weaker responsiveness by D to changes 
in uncertainty. 

Table 4 presents the QRE estimates of the data bro- 
ken down by treatment and model. Column 1 lists the 
three treatments, uniform, low, and high, and the num- 
ber of observations of each treatment. We computed 
estimates for three models, which we call the uncon- 
strained model, the constrained model, and the Nash 





FIGURE 2. QRE Estimates and the Data 
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TABLE 4. QRE Estimation Results 

Treatment Model 3 p i -ogl č č 

Uniform (N=6,400) Unconstralned 0.616 0.289 0.609 0.288 1.04 6,510 
Constrained 0.618 0.266 1.43 6,515 8.708 
Nash equilibrium 0.600 0.200 œo 6,580 140" 

Low uncertainty (N=8,000) Unconstrained 0.803 0.261 0.769 0.252 1.55 7,150 
Constrained 0.800 0.272 1.43 7,151 2.66? 
Nash equlllbdum 0.778 0.111 œ 7,441 5828 

High uncertalnty (N= 7,328)  Unconstrained 0.462 0.312 0.514 0.320 1.22 7,817 
Constrained 0.462 0.306 1.43 7,817 92 
Nash equilibrium 0.455 _ 0.273 coo 7,839 44 










® Model restnction rejected at 1% significance level or better 
b Model restriction cannot be rejected at 10% significance level. 








P 














Low uncertalnty 








Inexperlenced 0.800 0.270 0.763 0.260 qs 3,605 
Experlenced 0.804 0.251 0.775 0.245 1. 3,544 

High uncertalnty 
Inexperienced 0.461 0.315 0.516 0.324 1.15 4,051 
enced! 0.462 0.310 0.512 0.315 1.32 3,765 





0.623 
0.594 








model, respectively. The unconstrained estimates allow 
a separate estimate of 4 for each treatment, while the 
constrained estimate forces 4 to be the same for all 
treatments. The Nash model computes the likelihood 
function using the Nash equilibrium choice probabili- 
ties, which correspond to the limit of the QRE choice 
probabilities when 4 — oo. 

Columns 3 and 4 give (f*, 9"), the estimated choice 
probabilities under the various models. Columns 5 and 
6 give (p,q), the empirical relative frequencies ob- 
served in the experiment.*? Column 7 gives the max- 
imum likelihood estimate of à, and column 8 gives 
minus the value of the log likelihood function for the 
model, evaluated at the maximum likelihood estimate 
of A. 

The constrained model is nested in the unconstrained 
model, so we use a likelihood ratio test to test for model 
rejection. The chi-square statistic (twice the log of the 
likelihood ratio) is given in the last column in Table 4. 
While the unconstrained model fits slightly better in all 
three cases, the improvement in fit is insignificant (at 
the 10% level) for two of the treatments (bimodal and 
unimodal). Only with the uniform treatment is it sta- 
tistically significant (at the 1% level), but even for this 
case, the improvement in fit is of little real consequence, 
as the implied differences for choice probabilities be- 
tween the two models are negligible. 


21 The data for QRE estimation mclude only observations from the 
neutral context treatment, m order to avoid confoundmg compar- 
isons across the primary treatments (subject pool and distribution of 
voters). The estimates for those data are virtually the same as the 
estimates for the uniform treatment 


a Two hundred seventy-two fewer observations than previous row. 


The Nash equilibrium model is also (approximately) 
nested in the unconstrained model, so we can again 
use a likelihood ratio test to test for model rejection. 
The Nash model is easily rejected for all treatments, 
and the differences are statistically significant at any 
conventional level. 


EXPERIENCE AND LEARNING 


We investigate learning at a macro scale, simply asking 
whether aggregate behavior was different after subjects 
had a chance to observe the pattern of behavior of 
their opponents. Recall that our design used random 
matching, so that subjects were not trying to outguess 
an opponent based on observation of that opponent’s 
play. Instead, the subjects were receiving information 
about he average play of the population over time. For 
this reason, this subsection focuses on changes in aver- 
age play over the course of a session. 

We divide the data into two data subsets; we call one 
experienced and the other inexperienced. Since subjects 
played 100 rounds in each candidate’s role, we define 
inexperienced rounds to be the first 50 rounds a sub- 
ject was in a particular round and define experienced 
rounds to be the remaining 50 rounds a subject was 
in that role.” There were clear and significant trends 
in the data, with the experienced data being closer 
to Nash equilibrium and also fitting the QRE model 
better. Also, in all cases the estimate of i increases 


Z In the case of the experiment that crashed, we lost 34 rounds of 
experienced data. 
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FIGURE 3. QRE Estimates and the Data: The 
Effect of Experience 
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with experience, and the changes are significant at the 
1% level or better. However, in most cases, the actual 
movement in the aggregate choice probabilities was 
not very large. Table 5 displays the estimates broken 
down separately by treatment and by experience level. 
Figure 3 compares the inexperienced and experienced 
data in a graph similar to Figure 2. 


HETEROGENEITY, SUBJECT POOL 
EFFECTS, AND CONTEXT EFFECTS 


This section examines variation in choice behavior 
across subjects. We find evidence for heterogeneity. 


Even with the heterogeneity, all of the comparative 
statics results are still supported in the data. This is 
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important to note because the statistical tests in the 
earlier section assume that all observations are inde- 
pendent, and therefore the significance levels are in- 
flated. One way to adjust for this is to conduct simi- 
lar tests with the individual-level data, comparing the 
population distribution of choice probabilities across 
samples, using nonparametric statistics. This is what we 
do here, with the pooled sample of individuals. Figure 4 
shows the cumulative distributions of individual choice 
frequencies by treatment and by role. For example, 
in the uniform treatment, there were 32 subjects, so 
the graph shows 32 center choice frequencies for the 
individuals when thèy were A players and 32 choice 
frequencies for the same individuals when they were 
D players. Each point on the graph gives the rela- 
tive frequency (of 100 moves) that a particular indi- 
vidual chose the center strategy in a particular rule. 
The points are ordered by relative frequency (not sub- 
ject), so that the curves represent empirical cumulative 
distribution functions (CDFs) of individual choice fre- 
quencies. There is a clear ordering of these empirical 
CDFs as hypothesized. A second issue with hetero- 
geneity arises because we used two separate subject 
pools. 

This feature of the design was implemented as a ro- 
bustness check. The student populations (and culture) 
at UPF and CIT are different in many ways, but the 
theoretical model is intended to apply to both subject 
pools, so we do not predict a difference. Figure 5 dis- 
plays the UPF and CIT data as well as the Nash pre- 
dictions and the QRE estimates. 





FIGURE 4. Cholce Frequencies: Indtvidual-Level Data 


Individual data: Uniform Voter Distribution 

















Individual data: BImodal Voter Distribution 
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Individual data: Unimodal Voter Distribution 
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FIGURE 5. Subject Pool and Context Effects 
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Indeed the behavior in the two subject pools is very 
similar, with some small quantitative differences. There 
is one reversal of the sign predictions, which occurs in 
the CIT data. That reversal is crr > ĝm,crr, but the 
difference (—0.016) is not significant at the 5% level. 

The uniform session conducted with a political con- 
text yielded results similar to those of the other uniform 
sessions. The relative frequencies for A and D locating 
in the center position were 0.652 and 0.278, respec- 
tively. This datapoint is displayed as a shaded triangle 
in Figure 5. To test for significant differences between 
this and the neutral uniform sessions, we used the Logit 
equilibrium as the structural model and conducted a 
likelihood ratio test between the constrained and the 
unconstrained estimates. The chi-square statistic is .88 
with one degree of freedom, which is insignificant at all 
conventional significance levels (p * .5). 


CONCLUSIONS 


The results of this laboratory experiment provide 
strong support for the theoretical equilibrium effects of 
candidate quality on policy location. The central predic- 
tions of the theory are the quality divergence hypoth- 
esis and the polarization hypothesis. That is, (1) both 
candidates diverge from the center, with the weaker 
candidate diverging more than the stronger candidate; 
and (2) as the distribution of voters becomes more 
spread out, both candidates moderate their positions. 
The design allowed us to test these key predictions 
about how endogenous variables (candidate locations) 
covary with candidate quality and with the distribution 
of voters. All of these predictions were supported by 
the data. Altogether, the design and the model offer 21 
predicted sign differences in the observable choice fre- 
quencies by the candidates, across the three treatments. 
Every single one of these 21 predicted sign differences 
was the correct sign according to the theory, and all of 
these differences were statistically significant. 

The quantitative predictions of the theory were also 
the right order of magnitude, but there were two in- 
teresting biases that were observed. First, we found 
that when subjects were in the role of the advantaged 
candidate they were more responsive to the changes 


in the distribution of voters than when they were in 
the role of the disadvantaged candidate. As a result the 
Nash equilibrium predictions fit the data for advan- 
taged candidates better than the data for the disadvan- 
taged candidates. Second, on average, subjects in both 
roles adopt more moderate positions than predicted by 
the model. We show that both of these observations can 
be accounted for very well by a bounded rationality 
version of Nash equilibrium, called quantal response 
equilibrium. 

The experiment not only shows robustness of the 
model with respect to bounded rationality, but also 
demonstrates robustness of behavior across subject 
pools and with respect to experimental context. The lat- 
ter is important for considerations of external validity 
of the results. Neither of the two secondary treatments 
(subject pool and context) had any effect on the qual- 
itative conclusions of the experiment. The behavior in 
the two subject pools (from two different countries) 
was qualitatively identical and quantitatively very sim- 
ilar. Behavior in the political context treatment was 
not significantly different from behavior in the neutral 
context sessions. This suggests robustness of the basic 
game theoretic predictions of the model, at least with 
respect to these sorts of considerations, and this ro- 
bustness leads us to be optimistic about its relevance to 
electoral politics. 


APPENDIX: SAMPLE INSTRUCTIONS 


Welcome to the SSEL Lab. Please do not do anything with the 
computer equipment until you are instructed to. Please put 
all of your personal belongings away, so we can have your 
complete attention. Raise your hand if you need a pencil. 
Feel free to adjust your chairs so they are comfortable for 
you. 

This is an experiment in decision making, and you will be 
paid for your participation in cash, at the end of the experi- 
ment. Different subjects may earn different amounts. What 
you earn depends partly on your decisions and partly on the 
decisions of others. 

The entire experiment will take place through computer 
terminals, and all interaction between you will take place 
through the computers. It is important that you not talk or ın 
any way try to communicate with other subjects during the 
experiment. 

We will start with a brief instruction period. During this 
instruction period, you will be given a complete description of 
the experiment and will be shown how to use the computers. 
If you have any questions during the instruction period, raise 
your hand and your question will be answered so everyone 
can hear. 

During the computer instruction session, we will teach you 
how to use the computer by going through a practice session. 
We will go through this practice session very slowly and it is 
important that you follow instructions exactly. Do not hit any 
keys until you are told to do so, and when you are told to enter 
information, type exactly what you are told to type You are 
not paid for the practice session. 

We will first pass out the practice experiment record sheet, 
on which you will record all of the results from this exper- 
iment. Please record your name, the date, and your Social 
Security number on the bottom of the sheet. Note that you 
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have been assigned a color, e1ther Red or Blue. The color 1s 
written on top/of the record sheet. 


[PASS OUT RECORD SHEETS] 
[WAIT FOR SUBJECTS TO RECORD 
' INFORMATION] 
[START PLDK SERVER PROGRAM ON 
SERVER, IF NOT DONE ALREADY] 


Please click on the ICON that says “PLDK client.” When 
the computer prompts you for your name, type your full name, 
your Social Security number, and click on your color. Then 
chek OK to cohfirm. If you have any questions about how to 
do this, please raise your hand. 


[WAIT FOR SUBJECTS TO LOG ON] 


You now see the experiment screen. You have each been 
assigned to be ¢ither a RED subject or a BLUE subject in this 
experiment. Your color as well as your subject ID number 1s 
shown in the banner at the top of the screen. Please record 
your subject ID number on your record sheet. 


[WAIT FOR SUBJECTS TO RECORD 
INFORMATION] 


Each of you has been matched by the computer with a 
subject of the opposite color. If you are a BLUE subject, 
you are matched with one of the other RED subjects. If you 
are a RED subject, you are matched with one of the BLUE 
subjects. 

In the upper left part of the screen, you see a table, 

[SHOW TABLE ON OVERHEAD PROJECTOR] 


Wiul all subjects now move the mouse into the table and 
click it, If you are a RED subject, one of the rows will be 
highlighted. If you are a BLUE subject, one of the columns 
will be highlighted. 

Each of you is asked to make a choice, but please do not 
do so at this tihe. If you are a RED subject, on the left of 
the screen you are asked to please choose a row. If you area 
BLUE subject, you are asked to please choose a column. The 
outcome, and your payoff, is determined by the cell in the 
table that is chosen. In each cell of the table, the first number 
is the payoff for the RED subject, and the second number is 
the payoff to the BLUE subject. 


[GO THROUGH A COUPLE CELLS 
IN OVERHEAD TABLE TO EXPLAIN] 
[WAIT FOR SUBJECTS TO HIGHLIGHT 
ROW OR COLUMN] 


Will all RED subjects now please choose “B” and all 
BLUE subjects please choose “A” by clicking the mouse but- 
ton now while the arrow is pointing to the appropriate row 
or column. After you have made your choice, you are given 
a chance to confirm your decision. If it is not correct, please 
change it. When it 1s correct, please confirm by clicking on 
“confirm” now. ' 


[WAIT FOR SUBJECTS TO CHOOSE 
AND CONFIRM CHOICE] 
[WALK AROUND ROOM TO CHECK] 


After all subjects have confirmed their choices, the match 
is over and you are shown the choice of the blue subject you 
were matched with. The outcome of the round, BA, is now 
highlighted in purple on everybody’s screen. Your earnings 
are determined by the entries in the highlighted cell of the 
table that was selected. So the payoff to a RED subject for 
the first match is 6 points and the payoff to a BLUE subject 
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is 14 points. You are not being paid for the practice session, 
but if this were the real experiment, then the payoff you have 
recorded would be money you have earned from the first 
match, in points. 

We will now proceed to the second practice match. Each 
match is the same except you are matched with a new subject 
of the opposite color. Note that the decisions and payoffs 
of the first match are recorded in the experiment history 
at the right side of the screen. The outcomes of all of the 
previous matches will be recorded at the right side of the 
screen throughout the experiment so that you can refer back 
to previous rounds whenever you hke. 


[HIT KEY TO START SECOND ROUND] 


For the second match, each of you have now been re- 
matched with a new subject of the opposite color. All RED 
subjects again choose “B” and confirm. All BLUE subjects 
choose “C” by clicking on the right column. 


[WAIT FOR SUBJECTS TO CHOOSE 
AND CONFIRM CHOICE] 


The payoff to a RED subject for this practice round is 8 
points and the payoff to a BLUE subject 1s 12 points. This 
concludes the second round. Notice that the results are again 
recorded in the history screen. Note also that the history 
screen keeps track of the number of times you have chosen 
each row or column, and of the average payoff you received 
from each row or column. For example, the red subject has 
chosen “A” twice. The first time she received 6, and the second 
tıme 8 points. So the average is 7. 


[DO FOUR MORE ROUNDS, CHOOSING 


(BA), (CC), (BB), (CA)] 
[HIT KEY TO END PRACTICE SESSION] 


This concludes the practice session. The computer screen 
now indicates your total points that you earned in the practice 
session. This is multiplied by the exchange rate to get your 
money Payoff Since this is a practice session, the exchange 
Tate is zero. In the actual experiment, the exchange rate is 
0.01, so that each point is worth one cent. 


[WAIT FOR SUBJECTS TO RECORD 
OUTCOME AND CLICK “OK”] 


Part 1 


The actual experiment consists of two parta. Each part will last 
for 100 matches. When the first part is over, we will give you 
some additional instructions before the second part begins. 
Each match will proceed as in the first practice match, except 
you will be paid one cent for each point. The table will have 
three rows and three columns, the row and column labels will 
be the same as the practice, and the payoffs in the table will 
be the same as in the practice. Also, just like in the practice 
round, you will be randomly rematched with a new subject of 
the opposite color after each match. 

The total amount you earn in this first part of the experi- 
ment is equal to the sum of your earnings in all 100 matches. 
You will be paid in cash at the end of the experiment. No 
other participant will be told how much money you earned in 
the experiment. You need not tell any other participants how 
much you earned. Are there any questions before we begin 
the expenment? 


[TAKE QUESTIONS] 


O.K., then we will now begin with the actual experiment. 
Please lower your chairs to the lowest position, and pull out 
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the dividers as far as they will go. This ensures your privacy 
and the privacy of the others in the experiment. We will now 
begin match number 1. 


[START EXPERIMENT] 
[AFTER FIRST MATCH ANNOUNCE:] 


The first real match of the experiment is over. There will be 
99 more matches in Part 1. Remember that you are randomly 
rematched with a new subject after every single match. After 
match 100 has finished, please record your total payoffs on 
your record sheet and then wait for the instructions for the 
second part of the experiment. 


Part 2 


This is the second and final part of the experiment. The 

amount of money you earn in this part will be added to the 

amount you earned in Part 1 to determine your total money 

earnings for the whole experiment. Just as in Part 1, each 

point is worth one cent. This part of the experiment is similar 

to the first part, except the payoff table has been changed in 
a very specific way. 


[SHOW NEW TABLE ON OVERHEAD 
PROJECTOR NEXT TO OLD TABLE] 


This is a payoff table that reverses the roles of Blue and 
Red. That is, Blue’s payoffs are the same as Reds payoffs were 
in Part 1, and Red’s payoffs are the same as Blue’s payoffs 
were ın Part 1. For example, suppose that Blue chooses A 
and Red chooses B. Then Blue gets 6 and Red gets 14. Now 
compare this to the payoffs in the first table, when Red chose 
A and Blue chose B. 


[PUT UP COMBINED SLIDE 
WITH BOTH PAYOFFS] 


Then Red got 6 and Blue got 14. If you look carefully at 
the new table, you will notice that it is derived from the old 
one by transposing it (that is, flipping it around the diagonal) 
and reversing the Red and Blue payoffs. 


[ILLUSTRATE HOW THIS TRANSPOSITION 
WORKS USING OVERHEADS] 


Are there any questions before we begin? 
[TAKE QUESTIONS] 


O.K., then we will now begin the second part. The second 
part will also have 100 matches. Remember that you are ran- 
domly rematched with a new subject after every single match. 
After match 100 has finished, please record your Part 2 total 
payoffs on your record sheet and then wait for instructions 
for how to be paid. We ask you to refrain from talking with 
each other, not only during the matches, but also while you 
are waiting to be paid. Thank you in advance for your coop- 
eration. 


[BEGIN SECOND PART] 
[END AFTER MATCH 100] 


The experiment is now over. Please record your Part 2 
money earnings and add them to your Part 1 money earnings. 
Enter this sum in the row labeled Total Earnings. You will be 
paid this amount of money in the next room. We will pay you 
one at a time, beginning with subject number 1. We ask you 
not to talk with each other or use the computer equipment 
while you are waiting to be paid. Subject number 1, will you 


please come with us to the next room. Please collect your 
belongings and bring them and your record sheet with you. 
You will be leaving from the outside door in the next room. 
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Citizens, Slaves, and Foreigners: Aristotle on Human Nature 


JILL FRANK University of South Carolina 


most readers, Aristotle's many references to nature throughout the first book of the Politics imply 

a foundational role for nature outside and prior to politics. Aristotle, they claim, pairs nature with 

necessity and, thus, sets nature as a standard that fixes the boundaries of inclusion and exclusion 

in political life. Through readings of Aristotle on the nature of citizens, slaves, and foreigners in the 

Politics, this essay argues, in contrast, that, to Aristotle, nature, especially human nature, is changeable 

and shaped by politics. Through an analysis of Aristotle’s philosophical and scientific treatments of 

nature in the Metaphysics and Physics, this essay demonstrates that in order to preserve what he takes 

to be characteristic and also constitutive of a distinctively human way of living—prohairetic activity— 
Aristotle is especially keen to guard against any assimilation of nature to necessity. 


Man is by nature a political being. 
Aristotle, NE 1097b12, Pol. 125382! 


nature, phusis, throughout the first book of the 

Politics, imply a foundational role for nature 
prior to politics? Politics I is important, they claim, 
because it pairs nature with necessity and sets nature 
as a standard that fixes the boundaries of inclusion 
and exclusion in political life.? Aristotle’s defense of 
natural slavery provides the primary evidence for this 
interpretation. Treating natural slaves as necessary to, 


T: most readers, Aristotle’s many references to 
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ted. 

I use translations of Anstotle’s texts by Jowett 1996 and Rackham 
1977, in the case of the Politics, Ackrill and Urmson 1980 and 
Rackham 1982, in the case of the Nicomachean Ethics, Hicks 1991 in 
the case of De Anima, Tredennick 1980 in the case of the Metaphysics, 
and Waterfleld 1996 in the case of the Physics, at times modified. 

2 Wayne Ambler (1984, 487) counts 86 references in Book I to “words 
based on the root ‘nature.’” 

3 Some scholars find Aristotle’s science of nature to be outdated, 
discredited, and altogether unacceptable, and so reject his account 
of nature and the politics and ethics to which it is linked 

1990, 44, 98-99; Williams 1995, 199, 201; contra Arnhart 1998; Bolotin 
1997, 2; Park 1997). Others reject Aristotle’s science of nature but 
remain committed to his ethics and politics and sever the latter from 
the former (Salkever 1990a, chap. 1). Still others see in Aristotle’s 
science of human nature nch resources for his political and ethical 
philosophy. Of these, some endorse what they take to be Anstotle’s 
elitist exclusion of all but a few aristocratic men from parti on 
in a political life (Miller 1979; Strauss 1964, chap. 1; Winthrop 1975). 
Others argue the opposite, namely, that Aristotle’s understandmg 
of human nature is less hostile than is generally thought to women 
(Nichols 1992, chap. 1; Salkever 1991, 165-90, Saxonhouse 1985, 
chap. 4) or slaves (Booth 1993, chap. 2) Still others split the differ- 
ence, endorsing Aristotle’s philosophical account of human nature, 
while deploring some of his political applications of it (Nussbaum 
1995, 87, 120). The secondary literature on Anstotle’s Politcs I is 
rich, but few find in that text an account of the politics of slavery. 
Instead, they find an awkward preface to politics in slavery. 


but incontrovertibly distinct from, the free life of pol- 
itics, Aristotle maintains that, in virtue of their na- 
tures, natural slaves must be excluded from citizenship.‘ 
Richard Kraut (2002) encapsulates a familiar interpre- 
tation when he claims that “[a]t least this much is clear 
and uncontroversial”: At birth, some are suited to be 
slaves; these individuals lack the faculty by which most 
people reason; and nothing can be done to undo the 
deficiency with which they are born (282). Kraut, along 
with most readers, takes the content and signification 
of “nature” in Aristotle’s defense of natural slavery to 
be necessary and, somehow, given. Malcolm Schofield 
(1999) does as well, claiming that “the natural princi- 
ples from which Aristotle derives his theories of slav- 
ery” are among the first principles of philosophy; “in 
Pol. I [Aristotle] is arguing from first principles” (215 


With the distinction between necessity and freedom 
secured by nature, and with hierarchy thus established 
in Book I, Aristotle can move on, in the rest of the 
Politics, to engage the real business of politics, includ- 
ing citizen identity, regime formation and change, and 
revolution. Where nature appears later in the Politics, 
as it does, for example, in Aristotle’s identification of 
certain foreigners as natural slaves (Pol. 1285a20-24, 
1327b27-28), it simply serves to confirm the lesson of 
Book I, namely, that nature secures the distinction be- 
tween free and unfree (Smith 1991). This reading of 
Politics I is powerful. It seems to explain why Aristotle 
began his treatise on politics with an account of nature 


4 This s true of other household members as well, though in different 
ways and for different reasons. Aristotle’s focus on these relations 
of necesaity in the household have led some scholars to distinguish 
two different beginnings to Aristotle’s Politics: Book I is about “The 
Origin of the City in the Body”; Book II, about “The Origin of the 
City in Thought” (Nichols 1992, chap. 1). 

3 Anstotle claims that “from the hour of their birth, some are marked 
out for subjection, others for rule” (Pol. 1254823-24), but he ends that 
passage with the claim that nature cannot make this mark plain. For 
discussion, see subsection “Slaves,” below. 

6 See, in contrast, Mara (1998), who argues for the importance of 
Aristotle’s dialogic approach in establishing hrs epistemological po- 
sitions. Although Schofield and Kraut both pair nature and neces- 
sity, they differ on the grvenness of Anstotle’s account of nature. 
For Schofield, nature is grven insofar as it is determined by the first 
principles of Aristotle’s metaphysics, whereas for Kraut nature is 
given insofar as it 1s determined at birth I think both accounts are 
mistaken. 
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and it vests nature with a normative force that justifies 
the political exclusions that follow. Aristotle may be an 
ancient thinker, but the model he is said to provide— 
in which nature serves as an incontrovertible ground 
and guide to the boundaries of politics—temains pow- 
erful in our own time. The legacy of this understanding 
may be seen in projects of imperial rule in the name 
of a government of natural superiors over their natu- 
ral inferiors, exclusions from the spheres of labor and 
politics of certain immigrants in the late nineteenth- 
century United States, and explanations of academic 
performance in terms of natural intelligence. Thus, na- 
ture continues to be used to do the work readers of 
Aristotle see it doing in him: setting the boundaries and 
hierarchies of citizenship, participation, membership, 
and belonging. 

I think this reading of Aristotle on nature is wrong. 
Aristotle is, to be sure, concerned with necessity and hi- 
erarchy in Politics I, indeed, with the necessity of hierar- 
chy to politics, and he uses nature to frame this concern. 
However, on my reading, Aristotle does not use nature 
to establish the prepolitical and necessary conditions 
of politics. He treats nature, instead, as a question for 
politics. He thereby divests nature of the moral author- 
ity usually granted to it; subjects to scrutiny the exclu- 
sions said to be secured by that authority; and, placing 
authority in those who establish the hierarchies of pol- 
itics, namely, rulers and citizens, renders them account- 
able for those hierarchies. By challenging the dominant 
appreciations of Aristotle’s account of nature in these 
ways, this essay contributes to Aristotle scholarship. By 
opening a new way of thinking about nature and a more 
complex understanding of the relation between nature 
and politics, it challenges the commitment, still preva- 
lent today, to the naturalization of racial and ethnic 
distinctions. - 

Throughout Politics I, Aristotle uses language from 
his Physics both to secure nature’s ability to underwrite 
politics and also, simultaneously, to call this ability into 
question: What nature wants, he says at least twice in 
the course of his discussion of natural slaves, it may fail 
to achieve (Pol. 1254b26-32, 1255b3). Nature, Aristotle 
implies, cannot stand as a guarantee. Unable to sustain 
itself, it must, rather, be sustained by something else, 
namely, as the context of Aristotle’s discussion suggests, 
by politics. This is not to make politics prior to, or more 
fundamental than, nature or to say that nature is wholly 
political. It is rather to call attention to the complex 
relation Aristotle sets up between politics and nature. 
Human nature may be a measure of politics but the 
fact that we are, in Aristotle’s terminology, naturally 
political beings (NE 1097b12, 1169b20, Pol. 1253a2, 
1253a7-8, 1278b19) suggests that human nature is also, 
at least in part, constituted politically. Nature is thus 
not immutable but changeable, and this means that the 
hierarchy it underwrites, though necessary to politics, 
will be changeable too.’ 


7 Swanson (1999, 225) also argues that Aristotle understands na- 
ture to be changeable but claims that he presents his conclusions 
based on nature dogmatically for two reasons: first, because only the 
few philosophers can properly appreciate nature’s changeability and, 
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That hierarchy is necessary to politics is clear enough. 
Politics depends on rule, arche, which is to say, on ruling 
and being ruled. Without hierarchy, then, there can be 
no political association. Political association also de- 
pends on the freedom of its members, and so a second 
hierarchy is necessary, one that distinguishes free from 
unfree (Pol. 1255b18-19). On Aristotle’s account, this 
second hierarchy is secured by slavery, which is nec- 
essary to free masters from meeting their daily needs 
so that they can, as rulers or citizens, practice politics 
(and philosophy) (Pol. 1255b35—38; see also 1328a34— 
36, 1329a35—36). To some readers, Politics I is about this 
second hierarchy only, whereas the rest of the Politics 
is concerned with the first hierarchy, the one within a 
political association, between rulers and ruled. A brief 
look at the dangers of hierarchy, brought to light in 
Aristotle’s discussion of slavery, suggests a more com- 
plex picture. That hierarchy can be dangerous to slaves 
goes without saying. This is why Aristotle is centrally 
concerned in Politics I with the justice of slavery (Pol. 
1.5-6). Perhaps more important to Aristotle, however, 
is that slavery can also be dangerous to those who 
stand most to benefit from it: rulers and citizens, When 
rulers or citizens act only as masters, Aristotle notes, 
hierarchy ceases to be properly political and becomes, 
instead, despotic (Pol. 1292a14-38, 1295b20-24).8 This 
is why Aristotle makes a point of distinguishing the 
rule of masters from political rule (Pol. 1252a7-17). 
Dangerous, too, is when rulers or citizens act as slaves, 
for then there ceases to be a distinction between free 
and unfree, and hierarchy collapses '(Pol. 1277b5-7). 
Where there is no hierarchy, as where hierarchy is all 
and only despotic, political association ceases. Aristo- 
tle’s account of slavery, this suggests, is not only about 
the necessity of slavery, or only about securing the hi- 
erarchy between slaves and masters. It is also about the 
dangers of slavery to citizens and rulers, which is to say, 
of the politics of slavery. 

Politics I, concerned with the nature of hierarchy 
and its justice, is, on this reading, hardly a false start 
to Aristotle’s engagement with politics. On the con- 
trary, Aristotle raises fundamental questions of politics 
through his discussions of natural slavery. Against those 
who take Aristotle’s account of nature in his ethical 
and political writings to be static and straightforward 
and, also, against those who take it to be equivocal and 
questionable (Annas 1993, 146; Irwin 1985, 416-17), I 
demonstrate that it is, rather, dynamic and complex, 
unified, and continuous with his scientific, metaphys- 
ical, and psychological writings on nature. Aristotle’s 
discussion of natural slaves may, as we will see, be filled 
with inconsistencies but this is no reason to dismiss it as 
incoherent (Garnsey 1996, 107, 125; Smith 1991) or to 
resolve it into “clear and uncontroversial” propositions 


second, to discourage “political challenges to the natural order m 
the name of progress or freedom.” I argue, in contrast, that Aristo- 
tle presents his conclusions based on nature imprecasely. I therefore 
disagree with Swanson’s assessment and explanation of Anstotle’s 
rhetorical approach to nature. 

8 See Davis (1996, 23-24) for what he calls the “tragic implications 
of [hierarchy’s] unlimited extension.” 
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(Kraut 2002, 282). Aristotle’s imprecisions, there, as 
elsewhere in his ethical, political, and natural scientific 
writings, are, I argue, better read as accurate reflections 
of the nature of beings who act through and change 
over time. To develop this relation between human na- 
ture and activity, the first section of this essay turns 
to Aristotle’s discussions of citizenship and slavery. To 
establish Aristotle’s categorical separation of nature 
from necessity, the second section examines his philo- 
sophical treatment of nature. The final section extends 
my interpretation to Aristotle’s treatment of certain 
foreigners and draws some lessons for politics. 


THE NATURE OF IDENTITY 


Citizens 


To ask who is a citizen, as Aristotle does at the start 
of Politics III, is to ask about the identity or nature 
of a citizen.? In Aristotle’s hands, this is to ask who 
deserves to be a citizen or who merits the political good 
of citizenship. Aristotle answers by saying what will not 
qualify someone for citizenship: not place, or location, 
or the capacity to sue and be sued (Pol. 1275a7-11); not 
birth, ancestry, or blood (Pol. 1275b32-34). Rather, a 
citizen is one who participates in ruling and judging 
(Pol. 1275a22-23), one who rules and is ruled in turn 
(Pol. 1277b13-16), one who shares in the judicial and 
deliberative offices of a polity (Pol. 1275b18-20). Place, 
legal capacity, birth, and parentage—as static qualities 
and/or markers of status—do not demonstrate merit 
in Aristotle’s view. Although there may be subtle dif- 
ferences among the formulations Aristotle approves, 
they share an emphasis on activity: “Sharing in a consti- 
tution,” in Malcolm Schofield’s (1999, 144-49) phrase, 
qualifies one for citizenship.’ 

Aristotle’s emphasis on activity has a curiously tauto- 
logical or self-contained quality. Practicing citizenship, 
Aristotle seems to be saying, makes someone a citizen: 
A “citizen is a citizen in being a citizen” (Winthrop 1975, 
407). This circularity is a feature not only of Aristotle’s 
understanding of citizenship but of all human activity. 
In doing, he says, “the end cannot be other than the act 
itself” (NE 1140b6). Activity, energeia or entelecheia, 
is that which has, echein, what is aimed at—an end or 


9 That the “Who 1s?” question 1s one about nature and identity is 
substantiated by the translations Barker (1969, 92) and Rackham 
(1977, 173) ask about the “nature” of the atizen; Jowett (1993, 51), 
Reeve (1998, 65), and Robinson (1995, 3) ask, “Who is a citizen?”; 
Lord (1984, 86) asks “what the citizen is.” 

10 See Nichols (1992, 55-61) for a discussion of the differences among 
these formulations. To call citrrenship a practice of “sharing in a 
constitution” is different from what we today might call a “right” 
to jom m collective deliberation, which, to us, guarantees our dt- 
wenship. Whereas having the power or potentiality (erouna, what 
is sometimes translated as a “rght”) to participate m a constitution 
is certainly part of Aristotle’s account, exousia 1s always referred 
to activity, Le, the practice of citizenship itself As Schofield puts it 
(1999, 149), citing Politics 1275a34. “Those freeborn natives excluded 
from the assembly or court membership may be called citizens in 
an oligarchy, but they are not really so.” On the relation between 
potentiality and actuahty, see subsection “The Power of Activity,” 
below. 
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telos—in, en, itself (Meta. 1050a23—24). Although self- 
contained, human activity is not invulnerable to exter- 
nal influences. There is no carrying out one’s citizenship 
in a vacuum. Indeed, Aristotle insists that there can be 
no citizen qua citizen prior to the regime of which that 
citizen is a part (Pol. 1275a3-4). For this reason, he 
pursues his investigation of citizenship by asking who 
is a citizen of a dernocracy or of an oligarchy. Being a 
citizen is regime-dependent not least because what it 
means to share in a constitution largely depends on the 
laws, education, and other social and political institu- 
tions of that particular constitution.!! These institutions 
all contribute to the making of citizens (Pol. 1275b4). 
Being a citizen, this suggests, is a complex combination 
of doing on the part of citizen practitioners and making 
on the part of social and political institutions. 

At the start of his inquiry into citizenship, however, 
Aristotle says that it is important to leave to one side 
“those who have been made citizens, or who have ob- 
tained the name of citizen in any other accidental man- 
ner” (Pol. 1275a5—7). This sentence is key. It carves out 
what, for philosophical reasons, Aristotle thinks ought 
not to be included in an inquiry into the identity or 
nature of a citizen. To be excluded, as already noted, are 
those who are “made” citizens by the accidents of birth, 
ancestry, parentage, or location. That is clear enough. 
But, against the backdrop of Aristotle’s ready acknowl- 
edgment of the role of social and political institutions 
in the making of citizens, how are we to understand 
Aristotle’s apparently sweeping exclusion of all “made 
citizens”? He offers the following examples. To be ex- 
cluded from consideration of the nature of a citizen, 
Aristotle says, are those who have been made citizens 
“by the magistrates,” a kind of making he analogizes 
to the production of artifacts, specifically, kettles (Pol. 
1275b29-30); and those who have been made citizens 
“after a revolution” (Pol. 1275b35-36).4 As with the 
granting of legal rights under a treaty (which, as “the 
capacity to sue and be sued,” Aristotle rejects as a qual- 
ification for citizenship), these are examples of citizens 
having been made citizens, one might say, ex nihilo: by 
being so named by a magistrate, or by fiat after a revolu- 
tion, or by the force of legal treaty alone. Aristotle does 
not identity those who are made citizens in any of these 
ways as citizens for the same reason he excludes those 
who are made citizens by accident: Their citizenship 
does not come about in virtue of their own activity. It 
is rather granted to them. 





1 This is not always the case: Aristotle calls Theramenes an ex- 
emplary citizen in the Consttution of Athens for refusing to follow 
the laws of the polity. Being a good citizen calls for disobeying the 
laws when there ceases to be a difference between the polity’s laws 
and force, when nomos becomes bia See, for discussion, Frank and 
Monoeon (2003). 

12 Winthrop (1975, 410) explains the pun on Lanssaeans that names 
both the people and the artifact. 

B Aristotle is referring here to the foreigners and alien slaves sup- 
posedly enrolled by the Athenian reformer Cleasthenes, who were 
thus made citizens “in one stroke,” after the expulmon of the tyrant 
Hippias ın 510 B.CE. See, for discussion of the mterpretative con- 
troversies around this example, Manville (1997, 173-209, esp. 191). 
See also Constitution of Athens, 20.1, 21.2, and Politics 1319b19-27. 
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Aristotle ‘includes in the proper making of citi- 
zens, laws, education, and other social and political 
institutions because, unlike treaty, revolution, or mag- 
isterial edict, which like accident or force, make irrel- 
evant the activity of a citizen, a polity’s institutions do 
not make that activity irrelevant but rather supervene 
upon or guide it (Pol. 1258a22-23). Indeed, it is im- 
possible to understand a citizen’s identity without tak- 
ing into account the ways in which it has been shaped 
by these institutions (Salkever 1990b, 176; Smith 2001, 
23-26). Citizen identity is, then, a product of making 
and doing, where doing is a kind of self-making (by 
sharing in the constitution, I make myself a citizen) 
and making, as the guided shaping by laws, educa- 
tion, and other institutions, entails citizenly doing. Ac- 
cident and force must be pushed to one side when in- 
vestigating the nature of the citizen because they make 
irrelevant what is at the heart of both formations of 
citizen identity: the dynamic and reciprocal relation 
between identity and action, between doers and their 
deeds. 

Citizens are made citizens not only by their particular 
or individual activities but by sharing in a constitution, 
in other words, by their collective activity (Pol. 1275b4— 
6). At the same time, collective activity produces the 
social and political institutions that contribute to the 
making of citizens in the first place. If, to be citizens, 
citizens must act as citizens, they do so not only in- 
dividually but also in their collective action, by which 
they make for themselves the social and political in- 
stitutions that also help make them. Taking democ- 
Tacy, with Josiah Ober (1996), to be constituted neither 
by institutions alone nor by popular action alone but 
rather by “dynamic tensions” between institutionaliza- 
tion and participation (31), there is something nicely 
democratic about Aristotle’s understanding of citizen 
identity, read in this way.'* Citizenship is a matter of 
individual self-determining activity and it is participa- 
tory. By acting in concert, sharing in their constitution, 
citizens make the institutions that, in turn, as institu- 
tions, guide, but do not fully determine, their individual 
activity. 

Reading Aristotle on citizenship and political partic- 
ipation calls for attending not only to those he includes 
but also to those he excludes, specifically to those he 
excludes not because of what they do (shopkeepers, 
craftsmen) but, ostensibly, because of who they are: 
women, foreigners, and slaves. For most readers, it is 
because Aristotle takes the nature of these individu- 
als to be essentially and necessarily different from the 
nature of citizens that they must be excluded from po- 
litical participation." I disagree. To explain why, I turn 
next to an extended treatment of Aristotle’s account 
of slavery in'Politics I, the hard case for the claim I am 
making about nature.!¢ 





14 Ober (1998, chap. 6) does not read Aristotle in this way. 

15 Even as they condition its possibihty for others: Politics, 1328a34— 
36, 1328b19-22, 1329a35-38. 

16 Scholars who argue that Anstotle’s treatment of the nature of 
women is more complex than 1s usually appreciated also nicely prob- 
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Aristotle opens his discussion of slavery with the ques- 
tion: Who is a slave? As he does in the case of citizen- 
ship, he analyzes this as a question of justice, that is, 
in terms of desert or qualification. Rejecting parentage 
or ancestry (Pol. 1255b1-3) and convention (which he 
calls nomos and equates with violence or force, bia) 
(Pol. 1255615) as inadequate justifications of slavery, 
Aristotle pushes to one side, as he does when he dis- 
cusses citizen identity, those who have been made slaves 
by accident or by force. The significance of these moves 
on Aristotle’s part should not be underestimated. That 
Aristotle parses slavery as a question of justice, which 
he treats as the key question for politics (Pol. 1255a7— 
17) and “a question for political philosophy” (Pol. 
1282b23), signals that he intends to give it careful po- 
litical and philosophical consideration. That he rejects 
as unjust all forms of enslavement by force shows that 
he is prepared to challenge the predominant form of 
slavery in ancient Greece, which was the enslavement 
of foreigners captured by war or kidnaped by pirates 
and their descendants (Kraut 2002, 280; MacDowell 
1978, 79). For these reasons, those who read Aristotle 
as simply a product of his times (Annas 1993, 153, 155; 
1996; Saunders 1995, 79-83) or as merely an apolo- 
gist for the institutions of his regime (Waldron 1992; 
Williams 1993, 103-29) are mistaken.” 

In the light of the structural similarities between his 
accounts of slave and citizen identity, one might expect 
Aristotle to draw the same conclusion in the case of 
slavery that he draws in the case of citizenship. If being 
a citizen is to be understood in terms of citizen activity, 
then being a slave is to be understood in terms of slave 
activity. If citizen activity (including how this activity 
is guided by a polity’s social and political institutions 
but nothing accidental, forced, or necessary) defines 
the nature of a citizen, then slave activity (similarly 
understood) should define the natural slave. 

These are exactly Aristotle’s conclusions. He says, 
“The good man and the statesman and the good citizen 
ought not to learn the crafts of inferiors except for their 
own occasional use; if they habitually practice them, 
there will cease to be a distinction between master and 
slave” (Pol. 1277b5—7). He warns against including in 
the art of household management knowledge on the 
part of the master of how to do the tasks of slaves 
(Pol. 1255b23-38), and he warns his audience of free 
citizens in the Nicomachean Ethics against engaging in 
slavish kinds of activities (VE 1118a23-b4, 1118b21, 
1128a22).!8 Aristotle demands this sort of vigilance 
on the part of masters, citizens, and rulers because, as 


lematizs the usual ons of Anstotle’s view of the nature of 
slaves (Nichols 1992, 19-24; Saxonhouse 1985, 68-71) 

17 For an excellent challenge to the common views that take Ansto- 
tle’s teaching on natural slavery to support actual slavery, see Ambler 
1987. 

18 Not, to my knowledge, noticed by scholars, Aristotle uses different 
words when referring to those who are slavish through the practice of 
vice (andrapadoudou root) and those whom he calls naturally slavish 
(doulos root). Whereas this may be read as evidence that Anstotle 
assumes an essential and necessary difference between these two 
ways of being a slave, the first being reversible, the second not, it might 
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these examples suggest, performing the activities of a 
slave can make one a slave. The reverse seems to be 
true as well: Presuming the capacity to cease being 
a slave, Aristotle maintains that it is appropriate to 
hold out to slaves the promise of their freedom (Pol. 
1330a33-34). Insisting that friendship is not possible 
with a slave asa slave but that it is with the slave as a per- 
son (NE 1161b5-6), Aristotle holds out the possibility 
that a slave can become a person worthy of friendship, 
Aristotle’s model for free politics. 

Aristotle may take slavery to be necessary to politics, 
but all this suggests that there is nothing immutable that 
singles out any particular person as a slave. Instead, 
slave identity, like citizen identity, is determined by ac- 
tivity. If this is right, then there is no “permanent and 
complete” difference between slave and citizen (con- 
tra Nichols 1992, 6).!° Aristotle’s account of slavery in 
Politics I, accordingly, serves not to describe and set 
apart a domain that is pre- or nonpolitical, but to warn 
his audience of free citizens of their vulnerability, not 
only to accident and force but, more importantly, to 
the power of acting in shaping their political destinies 
(Davis 1996, 22; Mara 1995, 286, 296). It follows that 
when I make myself a citizen or a slave in virtue of my 
own activity, it is just to so treat me. 

There is, however, a fundamental difference between 
citizens and slaves in this regard: The social and political 
institutions that supervene upon self-determining ac- 
tivity to produce citizens as citizens and slaves as slaves 
are the products of citizen activity alone. Insofar as I 
am produced as a slave by social and political institu- 
tions in whose making I have not myself participated, 
I am made a slave independently of my own activity. I 
am, therefore, by the terms of Aristotle’s own account, 
made a slave by accident or, more likely, by force. As we 
have seen, Aristotle insists that the effects of accident 
and force are to be left out of a consideration of the na- 
ture of identity. A study of slaves produced as slaves by 
coercive social and political institutions, then, reveals 
little about the nature of slaves. It does, however, reveal 
something about those who create such institutions, 
namely, that they confuse political rule with mastery, 
a science Aristotle refers to as servile (Pol. 1255b30— 
35). Citizens or rulers who act as masters show 
that they are prepared to rule despotically, which, for 
Aristotle, is the unmaking of their polity (Pol. 1292a14— 
38, 1295b20-24). The practice of slavery and its institu- 
tion, though necessary to free citizens individually and 
collectively, are also, and at the same time, dangerous 
to the very freedom they secure. 

Against the backdrop of this reading of Politics 1, 
how might we understand Aristotle’s defense of natural 
slavery? Appearing to carve out a category of nature 
defined independently of activity, it seems to display 


a a E 
just as well be a symptom of Aristotle’s worry that the phenomena 
are not different; hence the need to enforce, nomunally at least, a 
strict boundary between them. 

19 Though she also says that the distinction is a matter of degree 
(1992, 184 n.2). 

2 For this reason, Aristotle recommends that those in a position 
to occupy themselves with philosophy or politics have stewards to 
attend to the management of their households (Pol. 1255b35—37). 


a lack of parallelism with his treatment of citizenship. 
This appearance is deceptive. Aristotle’s discussion of 
who is by nature a slave imports into the Politics lan- 
guage he has introduced in the Physics. Parsing this 
question along two axes, he asks whether nature as 
matter, meaning physical bodies, will distinguish slaves 
from nonslaves; and he asks whether nature as form, 
meaning soul, will do the trick. In Politics I Aristotle 
has insisted that nature makes nothing in vain (1253a9). 
On the contrary, nature makes things to particular uses 
and so should mark a slave in a way that shows him to be 
fit for use as an object of property by giving him a body 
suited to menial chores. Aristotle notes, however, that 
although “nature would like to distinguish” slaves from 
nonslaves on the basis of physical appearance, nature 
can fail to do so, giving slaves, instead, the bodies of 
freemen (Pol. 1254b26-32). Under its material aspect, 
as body, nature cannot tell us who deserves to be a slave. 

Most scholars agree that under its formal aspect, 
as the soul, nature does a better job of distinguishing 
slaves from nonslaves. They claim that, to Aristotle, it is 
the absence of the faculty of deliberation, a deficiency 
of the soul or, in the terminology of De Anima, a first- 
level incapacity, that makes natural slavery natural. 
Aristotle’s examination of the soul of the slave is not so 
clear, however. He says that slaves lack the deliberative 
element (Pol. 1254b22-23, 1260a12-13) but also that if 
they could not deliberate at all they would not be able to 
execute their masters’ orders (Pol. 1260a1); he says that 
slaves are not capable of self-rule (Pol. 1254b16—21) but 
also that they have the excellence necessary to prevent 
them from failing in their function through lack of self- 
control (Pol. 1259b22-28, 1260a1-3); he distinguishes 
slaves from children on the ground that, unlike slaves, 
children possess the deliberative element (albeit in an 
immature form) (Pol. 1260a13), but he also insists that 
the proper response to slaves, even more so than to chil- 
dren, is admonition rather than command alone (Pol. 
1260b5-7); he says that slaves are essentially not-form 
and instead simply matter or bodies waiting for minds 
as form to impose order upon them (Pol. 1252a31- 
34, 1254b15~20) but also that, as human beings, they 
are constituted by matter and form (Pol. 1254a32-34) 
and share in the capacity to reason (Pol. 1259b29). On 
the basis of these inconsistencies, some scholars dis- 
miss Aristotle’s account of natural slavery as incoher- 
ent (Garnsey 1996, 107, 125; Smith 1991). Aristotle is 
not, however, unaware that his examination of the soul 
of the slave pulls in different directions. He maintains, 
by way of response, that “beauty of soul is not seen” 
(Pol. 1255a1). Because the soul is not visible to the eye, 
Aristotle’s answer to the question of who is a slave by 
nature in terms of soul can be no more conclusive than 
was his answer to who is a slave by nature in terms of 
body. 

Aristotle, nonetheless, concludes that “it is clear, 
then, that some men are by nature free, and others 
slaves, and for these latter slavery is both expedient 
and just” (Pol. 1255a1-2). If natural slavery is not de- 
termined by an immutable physical deficiency and the 
soul’s invisibility makes it impossible to know whether 
natural slaves suffer from an immutable psychological 
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deficiency, what, then, makes slavery natural? Aristo- 
tle’s answer is that the master-slave relation is natural 
when it benefits both individuals involved, when the 
interests of the slave and of the master are the same 
(Pol. 1255b13-14, 1252a35-b1). The circumstances in 
which slavery might benefit a master are clear enough. 
In what sorts of circumstances might someone benefit 
from being a slave?” Aristotle says that a person whose 
soul is so disordered that it fails to guide his body might 
well be better off guided by someone else’s soul than 
left wholly unguided (Pol. 1254b16-20). Because it is 
difficult to see and therefore to know whether a soul is 
well ordered, evidence of a disordered soul is provided 
by the activities in which the person engages. The in- 
ference that a person’s soul is disordered is justified 
not when he acts, every now and then, “as most slaves 
act,” as this would give the status of slavery too much 
weight. Nor is there a biological standard for deter- 
mining when a soul is disordered: none, at least, in the 
sense of a necessary one. Rather, what might be called 
characteristically human activity itself provides a kind 
of internal standard sufficient to allow judgment about 
which activities and ways of living are more slavish than 
others. 

What is characteristically human activity? At the 
start of the Politics, Aristotle distinguishes human be- 
ings from all other natural beings on the ground that 
human beings alone possess logos, the capacity for ar- 
ticulate speech or reason (Pol. 1253a10). All human 
beings, by virtue of being human, possess this first- 
level capacity, including slaves (Pol. 1259b29). It is in 
virtue of logos that human beings make choices about 
the useful and harmful, the just and unjust, the good 
and the bad (Pol. 1253a14~-18), and it is characteris- 
tic of human beings that, with regard to these ethical 
and political matters, we act “according to thoughtful 
or deliberate choice” (Salkever 1990b, 195; 1991, 182; 
Saxonhouse 1985, 66), kata prohairesin (Pol. 1280a31- 
34). Choice, prohairesis, charts the course of a human 
life, It is the act of choosing one action instead of (or 
before, pro) another, namely, making a judgment about 
what to choose. It is, Aristotle says, the starting point or 
tule, arche, of action (NE 1113a4-9). As signaled by the 
prefix “pro,” prohairesis, in the Greek understanding, 
has an embedded character: The choices that initiate 
the actions people undertake are determined by their 
habits, which reflect who they have been and therefore 
who they are.” Prohairetic activity, combining desire 
and intelligence (NE 1139b5-7), is characteristically 





2) Kraut (2002, 295-301) offers an excellent account of “Why Slavery 
Benefits Slaves,” which also, however, undermines his insistence that 
what distinguishes slaves by nature is the complete absence of the 
capacity to acquire practical wisdom. Insofar as Kraut agrees that a 
slave can develop sufficently good habits and a sufficient measure of 
moderation to someday deserve his freedom (a postion Kraut also 
rightly attributes to Artstotle), and insofar as to Aristotle, there can 
be no moderation without practical wisdom, mdeed no virtue without 
practical wisdom, and vice versa (NE 1144b30—33), attributing to nat- 
ural slaves even a “modicum of virtue” is, eo ipso, to attribute to them 
practical wisdom and, thereby, to call into question the rmmutabtlity 
of their slavery. ' 

Z For this reason, Anstotle distinguishes acting by choice and acting 
voluntarily: In the absence of external constraint, as we will see, all 
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human activity insofar as it discloses the character, 
the soul, and, thereby, the nature of the one who acts, 
specifically by revealing the degree to which, in the ac- 
tions he undertakes, the actor is using the capacity for 
logos he possesses. 

The one who possesses the capacity for logos but 
consistently does not use it, engaging, instead, in ac- 
tivity that falls short of prohairetic activity, is a natural 
slave. Such a person can have no share in “a life based 
on choice” (Pol. 1280a34—35) but rather must have his 
choices made for him by someone who, in contrast, 
uses foresight to choose thoughtfully (Pol. 1252a32). 
Slavery thus benefits the person who consistently fails 
to engage in prohairetic activity by bringing that person 
into a relation that allows him to mirror or approxi- 
mate it. The deficiency of a natural slave is, then, his 
failure to actualize the first-level capacity for logos he 
possesses A natural slave thus lacks what Aristotle 
calls a second-level capability. To say this is not, how- 
ever, to ascribe to natural slaves an immutable nature in 
the sense that I have been challenging. A second-level 
capability is an actualization of a first-level capacity 
that comes about by virtue of the activity of its use (De 
Anima 1.4-5): I actualize my first-level capacity for 
logos through prohairetic activity. Similarly, a second- 
level incapacity results from a failure to actualize the 
first-level capacity: My consistent failure to exercise my 
capacity for logos produces my deliberative deficiency. 

Those who are deliberatively deficient owing to their 
consistent failure to use their Jogos are, for that rea- 
son, worthy of slavery and are, therefore, in Aristotle’s 
terms, natural slaves. In contrast, those who are pre- 
vented from using their Jogos owing to conquest or 
coercive institutions, or those whose capacity for logos 
is damaged from birth or incapacitated later in their 
lives (through no willing nonuse of their own), are 
made slaves by force or accident and are, therefore, to 
Aristotle, not natural slaves at all. Understood by way 
of prohairetic activity, nature thus distinguishes slaves 
from nonslaves but secures no absolute boundaries and 
offers no permanent foundations. Guided and deter- 
mined by activity, nature is changeable. It is for this 
reason that, when he discusses the capacity of nature, 
under its formal aspect—the soulto distinguish slaves 
from nonslaves, Aristotle’s statements are imprecise. It 








acts count as voluntary. That not all voluntary acts are, however, 
chosen (NE III 2), sets Arstotle’s understanding of choice apart from 
more voluntanst and cognitive conceptions. 
P On the importance of the relation between mimesis and logos for 
Anstotle, see Davis 1992 and Lear 2003, chap. 4. This ıs not to impose 
upon the master any obligation to teach the slave how to engage in 
prohairetie activity On this ıssue, I agree with Kraut (2002, 298-99). 
Kraut argues that what distmgurshes natural slaves from freemen is 
that the former can achieve only a low-level capacity for deliberation, 
a capacity that allows them to be skilled only at menial craftsmanship. 
Kraut cannot be arguing that this is a first-level incapacity (although 
he sometimes seems to, as when he argues that alaves “lack the faculty 
by which most people reason” from birth) because to have even 
a lowdevel capacity is to have a first-level capacity. If, ın contrast, 
he is arguing that natural slaves lack the second-level capability to 
actualize their first-level capacity, this can change over time. As a 
practical deficiency rather than an immutable one, a second-level 
incapacity may be activated through practice 


? 


American Political Science Review 


Vol. 98, No. 1 


Fae hr a ne nee Te 


is 80 as to preserve the prohairetic activity he takes to 
be characteristic and also constitutive of a distinctively 
human way of living that Aristotle is especially keen, in 
his more explicitly philosophical treatments of nature, 
to safeguard nature’s changeability. As I show next, this 
he does by guarding against the assimilation of nature 
to necessity and also to chance. 


THE NATURE OF NATURE 


To enquire whether being is single and 
unchanging is no part of an enquiry into 
nature. 


Aristotle, Physics 184b25-185a1 


Between Necessity and Chance 


Aristotle understands the natural as what happens usu- 
ally and for the most part, epi to polu. What happens 
usually and for the most part is a “modal” middle be- 
tween what is always and what is rare (Frede 1992). 
What is always corresponds to what is by necessity, and 
what is rare to what happens by accident. There is much 
to learn about Aristotle’s understanding of nature from 
this tripartite division. In an Aristotelian fashion, I be- 
gin my investigation of what is distinctive about the 
natural by looking first at what he counterposes to na- 
ture: the necessary and the accidental. 

The primary signification of the necessary, ana- 
gkaion, or the sense from which “all others are some- 
how derived,” is “that which cannot be otherwise” 
(Meta. 1015a34—b1). The necessary also includes the 
compulsory or forced, that which is opposed to im- 
pulse or purpose (Meta. 1015a27-28); and what is 
true by demonstration, the first principles of knowl- 
edge (Meta. 1015b7). The category of the necessary in- 
cludes a range of significations across different fields 
of inquiry—ontology, epistemology, ethics—held to- 
gether by a kind of family resemblance.» What these 
significations share may be explored by looking at Aris- 
totle’s epistemological and ethical treatments of the 
necessary in NE VI3 and NE 111.1, respectively. Dis- 
cussing the intellectual virtue of scientific knowledge, 
episteme, Aristotle says it studies what is eternal, un- 
generated, and imperishable (NE 1139b24~25), num- 
bers or figures, for example. Always and invariable, 
they are necessary in that they are out-of-time and, 
hence, without motion. As form without matter, num- 
bers and figures may be precisely and scientifically stud- 
ied by the intellectual virtues of science, episterne, and 
philosophic wisdom, theoria. Although they may be 
studied by human beings, what is necessary or always is 
independent of human being. This feature is present as 
well in Aristotle’s understanding of necessity as com- 
pulsion or force: “Actions are forced when the cause is 
in the external circumstances and the agent contributes 


nothing” (NE 1110b1ff). 


B For other sgnifications that belong here, see Metaphysics, 
1015a20—23 


The necessary, understood as that which cannot be 
otherwise in its ontological, epistemological, and ethi- 
cal senses, shares a kinship with the past. What is past, 
Aristotle says, is not capable of not having taken place 
(NE 1139b7-9). Once past, what has happened cannot 
be otherwise. People may study the past, but owing to 
its invariability, no one deliberates about the past (NE 
1139b7-9). Likewise, no one deliberates about eternal 
things, for these cannot be brought about by our own 
efforts (NE 1112a20ff.). The past, like the first prin- 
ciples of knowledge, and like force or compulsion, is 
independent of human being; human agency cannot 
change it (NE 1140a32-34). 

Counterpoised to the necessary and flanking the nat- 
ural on the other side is the accidental or the rare. 
“Accident” is what applies to something “but neither 
necessarily or usually” (Meta. 1025a15). It is what can 
always be otherwise and so is never, at least not in the 
way the invariable is. The accidental is contingent. It 
is what Aristotle associates with chance, tuche, which 
he calls the indefinite, aorista, and cites as the cause of 
accidents (Meta. 102525). If what is necessary can be 
studied precisely and scientifically, what happens acci- 
dentally or by chance cannot be studied at all. Inexpli- 
cable, indeterminate, and random, chance or accidental 
events have no account of their own (Physics 197a18- 
19). If there is a kinship between the necessary and the 
past, there is one as well between the accidental and the 
future: The accidental is the always possible, what lies 
uncertainly ahead. Although in most ways unlike the 
necessary, the accidental and the necessary have one 
thing in common: What happens by accident, like what 
happens necessarily, is independent of human agency. 

Between “necessity and contingency,” between what 
is always and what is never, “between past and 
future,”” lies what is by nature: What happens usually 
and for the most part. Unlike that which can never be 
otherwise and unlike that which can always be other- 
wise, that which is by nature is both variable and sta- 
ble: What is by nature, Aristotle says in the Physics, 
has within itself a principle of change and resistence 
to change (192b13-14). Neither motionless nor per- 
petually in motion, natural beings are. They can also 
be otherwise. Owing to their relative stability, what is 
by nature, unlike what is by accident, can be studied. 
Owing to their relative variability, studies of what is by 
nature, unlike studies of the necessary, will be impre- 
cise. It is because their possibilities may be actualized in 
any number of unpredictable ways that only by looking 
at what natural beings do can anything be known about 
what they are (Salkever 1990a, chap. 3). Claims about 
the identity of natural beings will, therefore, be claims 
about their activities. This is why, in his inquiries into the 
natures of citizens and slaves, as we have seen, Aristotle 
rejects status claims and focuses instead on what they 
do. It is also why, in his accounts of the natures of citi- 
zens and slaves, Aristotle, as we saw, is often imprecise. 





26 The phrases, “between necessity and con and “between 
past and future,” belong to Hampshire (2000, 30) and Arendt (1961), 
respectively. For both, as for Aristotle, these phrases refer to the 
domain of prohasretic, that 1s, ethical and political, acnon 
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The imprecision arises because the nature of a natu- 
ral being will change should its characteristic activities 
change. There is thus an iterative quality to Aristotle’s 
studies of natural beings that accommodates the revi- 
sion of his own conclusions in cases of change (Lear 
1988, 43-54; Salkever 1990, chap. 1).7 

Even if the identity of a natural being is given by its 
characteristic activities, and is therefore variable, most 
commentators claim that Aristotle is, nonetheless, com- 
mitted to there being something necessary about na- 
ture, something that somehow lurks behind or beyond 
a natural being’s characteristic activities to disclose its 
true identity (Kraut 2002; Swanson 1999). As we have 
seen, this is not true of those whom Aristotle calls natu- 
ral slaves. It is, indeed, not true of any natural beings, for 
to claim a necessity to nature, especially in the case of 
human nature, is to undermine precisely what Aristotle 
aims to preserve, namely, the characteristically human 
activities of prohairesis, and, relatedly, of accountability 
and responsibility. To demonstrate this, I turn next to an 
exploration of the boundaries of Aristotle’s category of 
the natural. 

Aristotle often includes in the category of the natu- 
ral features that seem to belong more properly to the 
categories of the necessary and the accidental. Aris- 
totle sometimes uses the phrase “what is always or 
for the most part” instead of “what is usually or for 
the most part” to refer to what is by nature (replacing 
“usually” with “always,” the term he associates with 
the necessary) (Physics 199b15~18). Other times he 
seems to hold up as natural examples that are more 
rare than usual. Some commentators take these to be 
signs that Aristotle’s account of the natural is equivocal 
or that he is ‘adjusting it to fit circumstances that do 
not readily mold to his principles (Annas 1993, 146; 
1996; Irwin 1985, 416-17). I disagree. Starting first with 
the apparent elision of the distinction between nature 
and necessity, and looking next and more briefly at the 
relation between the natural and the accidental (be- 
cause the tendency is to read nature as necessity not as 
chance), I argue that although Aristotle takes neces- 
sity or force and chance or accident to affect natural 
beings—indeed, he sees natural beings as singularly 
vulnerable to both—neither has a part, and they must 
have no part, in the definition of what is by nature. 
We have seen'this already in Aristotle’s exclusion from 
considerationof force and accident/chance in his inves- 
tigations of the nature of citizens and slaves. We must 
now look more carefully at what underlies Aristotle’s 
exclusion of force and chance and his inclusion of what 
I call the “as if” necessary and accidental. 

The nature, of natural beings is discerned, we have 
seen, by looking at their activities. Although this is not 
to say that nature is determined by activity alone, these 





77 Lear (1988, 45) argues that Anstotle’s scientific method can re- 
vise its own conclusions. This 1s true as well about his psychological, 
ethical, and pohtical methods. 

78 As in the case of certain forms of money-making he discusses in 
Politics, 18-10, and calls natural but that are, in reality, extremely 
rare, Annas (1996, 733) treats this example as evidence of Aristo- 
tle’s inconsistent use of nature and of his illegitimate ehsion of the 
distinction between the natural and the ideal. 
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activities do form the patterns of our lives. Because 
natural beings usually act thus and so, it is not hard 
to imagine patterns of activity becoming so ingrained’ 
as to be justifiably treated as predictable and precise, 
that is, as characteristic: A human being who has acted 
courageously when confronted with danger will always 
do so, we might say, for she is that sort of person. We 
often speak in this way, and it might be said that the 
stability and security of our daily lives depend on the 
trust in the world presupposed by this way of speaking. 
We think, speak, and act, in other words, as if usual 
patterns of activity were compelled, as if they were 
necessary. Aristotle’s definition of “the usual” in the 
Physics makes this plain: What is by nature “always 
[tends] toward the same end, unless something inter- 
venes” (199b18). Although we think, speak, and act as 
if the usual were necessary, we do so knowing that it 
is always possible that things will turn out otherwise. 
The courageous person may meet a danger she cannot 
face down. We speak of “the always” in the case of 
human beings and all natural beings only as if it were 
necessary. This is as it should be. Human beings are 
stable insofar as we have within ourselves a principle 
of staying the same or rest. Insofar as we have within 
ourselves a principle of change, we are also changeable 
and in constant interaction with our circumstances. In 
different circumstances, human beings act differently. 
When nature is understood not in terms of stability but 
in terms of necessity, invariability controls beings who, 
for Aristotle at least, are defined as well by movement 
and possibility (Arnhart 1998, chap. 9). 

Aristotle’s vigilance against any more than an “as 
if” assimilation of nature to necessity is on display in 
his discussion of responsibility in Nicomachean Ethics, 
II.1, where nature and necessity are brought into a con- 
frontation. As noted earlier, Aristotle takes actions to 
be compelled when a person contributes nothing to the 
action; the cause of action is rather in external circum- 
stances. Only two kinds of situations meet Aristotle’s 
definition of force or compulsion, so understood: when 
an agent acts without knowledge of the circumstances 
of action (and his ignorance is honest or innocent); 
or when a third party physically effects the action in 
the agent’s place by, to use an anachronistic example, 
putting his hand over the agent’s when she is holding 
a gun and using his finger over hers to pull the trigger 
despite her efforts to resist. For all other actions, the 
agent, in Aristotle’s view, is responsible, for, as he puts 
it, the origin of action is in the agent (NE 1111023). In 
comparison to modern legal definitions, Aristotle’s ac- 
count of force is exceedingly limited: It does not include 
actions taken under conditions of duress, debilitating 
drunkenness, pent-up rage, and the like, when these 
conditions are brought about by the agent himself. For 
Aristotle, an agent’s actions are forced only when he is 
effectively prevented from acting voluntarily. 

If Aristotle’s understanding of force is narrow, his 
account of responsibility is remarkably expansive.” As 





® He does not consider duress, for example, as an excuse for bad 
action He does, however, advocate taking justifications and other 
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the following passage from the Nicomachean Ethics in- 
dicates, people are responsible for all of their voluntary 
actions, virtuous as well as vicious: 


Virtue is up to us. And so also is vice. For where we are free 
to act we are also free to refrain from acting, and where 
we are able to say No we are also able to say Yes. If it is up 
to us to act when doing a thing is good or noble, not acting 
will be up to us when acting would be shameful or wrong; 
and, if not acting when inaction is good is up to us, so, too, 
acting when action is shameful is up to us. But if it is in 
our power to refrain from doing right and wrong, and if... 
being good or bad is doing right or wrong, it consequently 
depends on us whether we are good or bad (1113b6-14 
trans. modified). 


Prohairetic activity is at stake in Aristotle’s vigilance 
against assimilating responsibility to force. Prohairetic 
activity is also at stake in his vigilance against assimi- 
lating nature to necessity. In the Nicomachean Ethics, 
necessity or compulsion—as that to which the agent 
contributes nothing, because the cause of action lies 
altogether outside the agent—stands opposed to pro- 
hairetic agency as that which has within itself its own 
principle, arche, of action. Absence of force or necessity 
is a necessary, although not a sufficient, condition of 
choice, prohairesis. In the Metaphysics, the Physics, and 
elsewhere, force or necessity stands opposed to what is 
by nature as that which has within itself its own princi- 
ple of motion and rest. Eliding the distinction between 
nature and necessity makes virtue no more voluntary 
than vice, a position Aristotle rejects (NE 1113b14). 
When nature and necessity are paired, prohairetic ac- 
tivity, responsibility, and self-determination disappear. 
What is by nature, then, may reach toward necessity 
but it must remain distinct from the necessary so as to 
preserve the activity characteristic of human beings. 

What is by nature also reaches toward but remains 
distinct from the accidental. Aristotle may recognize 
the role of chance or luck in the lives of natural be- 
ings and the ameliorative effects of contingent external 
goods throughout the Nicomachean Ethics and Politics 
(Nussbaum 1986, chap. 11). As with necessity, however, 
Aristotle refuses to understand the identity of natural 
beings in terms of chance, luck, or accident. The stakes 
in keeping nature and chance distinct are no different 
from what they were in keeping distinct nature and 
necessity. Understanding the nature of natural beings 
in terms of chance, like understanding it in terms of 
necessity, makes prohairetic activity and responsibility 
irrelevant. 

If, on the side of the necessary, Aristotle enfolds into 
the category of the natural what I called the “as if” 
necessary, on the side of the accidental, he enfolds into 
the category of the natural, the “as if” accidental, what, 
in the Nicomachean Ethics, he calls art, techne. Art is 
concerned neither with things that are or come into 
being by necessity nor with things that do so in accor- 
dance with nature. Art is rather “concerned with the 
same objects” as chance (NE 1140a14~-20). Both are 





mitigating factors into account when determiming how to respond to 
actions taken under difficult conditions (Rhet. 1374b13-16). 


concerned with possibility, with how something may 
come into being which is capable of being or not being 
(NE 1140a13). Moreover, in art, as in chance, the fin- 
ished work is not completely governed by the activity 
of producing (NE 1140a18). 

“Art loves chance and chance loves art” says Aristo- 
tle, quoting Agathon approvingly (NE 1140a20). But 
art is only “as if” accidental. Unlike chance, where 
the cause is altogether indeterminate, in art, it is the 
blueprint in the soul of the maker, eidos, that is the 
cause of action (Meta. 1032b22ff.). There is, in other 
words, prohairetic agency in art but not in chance. This 
is not to deny the key difference Aristotle is at pains 
to underscore between the making of art and the doing 
that belongs to activity proper: In art, he says, the end is 
outside the activity of making; whereas doing is activity 
that has within itself its own end (NE 1140b4-6). Even 
if the end product in art is outside the agent’s control, 
nonetheless, making, like doing, but unlike chance or 
necessity, crucially involves activity and responsibility. 

In keeping distinct what is by nature from necessity 
or force and also from chance or accident, as Aristotle 
does in his discussions of the nature of citizens and 
slaves and in his account of the nature of nature itself, 
Aristotle preserves the prohairetic activity that char- 
acterizes and distinguishes human nature, In including 
the “as if” necessary and the “as if” accidental in his 
account of the natural, Aristotle reveals the expansive- 
ness of his conception of prohairetic activity. It is pol- 
itics, itself an art and so a product of human activity, 
that produces the institutions that help make citizens 
and slaves. Politics is the art that Aristotle takes to be 
integral to any understanding of the practices of human 
beings and their natures (NE 1094a27-1094b11). It is 
because what is by nature is, in Aristotle’s view, defined 
by the practice and effects of activity under all three 
aspects, self-determining activity, guided making of cit- 
izens by institutions, and making by citizens of institu- 
tions, that he can, without inconsistency, treat human 
beings and, indeed, the polity itself as both natural and 
made (Pol. 1253a19-31, 1252b30££). 


The Power of Activity 


Is human nature all and only activity? Aristotle’s an- 
swer seems to be “yes”: yes, in that, as we have seen, 
there is nothing necessary lurking behind activity; yes, 
in that natural beings are distinguished by their activi- 
ties; yes, in that even the stability characteristic of hu- 
man nature is based on activity. Aristotle insists, how- 
ever, that activity alone cannot produce the movement 
and change or stability that characterize human nature. 
Activity alone does not account for the nature of natu- 
ral beings because the nature of a natural being is not 
simply a description of what it tends to do. Activity, 
we have noted, also sets a standard, an internal and de- 
manding one. To see how requires attending not simply 
to activity but also to what gives rise to activity in the 
first place, the conditions of its possibility. 

Aristotle calls that which makes activity possible 
dunamis, which is translated variously as capacity, 
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power, capability, potentiality. Dunamis is, Aristotle 
explains, the capacity or power activity has to regu- 
late itself. He analyzes this relation between capabil- 
ity, dunamis, and activity, energeia, in his discussion 
of the Megarians in Metaphysics, [X.3.% The Megar- 
ians, Aristotle recounts, say “that a man who is not 
building cannot build, but only the man who is build- 
ing, and at the moment when he is building” (Meta. 
1046b30ff). This means that it is only when a dunamis 
(or capability) is actually at work that the ability to 
do something is present. When it is not at work, the 
dunamis, as a capacity, is only potential and, therefore, 
absent. For the Megarians, as for some contemporary 
post-Nietzscheans, this means, that activities emerge 
ex nihilo. 

Aristotle thinks that this account of activity is absurd. 
When dunamis is treated as only present when it is in 
action, he argues, there can be no change or movement 
at all (Meta. 1047a15). Change or movement must hap- 
pen from one thing to another. It may be true that to be 
capable means to have a dunamis, and that not having 
the dunamis means not being capable, but, Aristotle 
insists, dunarnis has its own energeia or activity. The ac- 
tivity or actuality of dunamis (the activity or actuality of 
capacity) lies in its being possessed even when it is not 
at work. The builder can have the capability to build, 
this means, even when he is not actually building. Not 
building, then, does not necessarily signal the absence 
of dunamis, though it can, as when, for example, the 
builder loses the capacity to build through bad luck (he 
loses his hands, say) or he forgets how to build owing 
to the passage of time. Under these conditions of ac- 
cident or nonuse, there can be no building activity at 
all. Where thére is activity or energeia, it emerges not 
from something only potential, i.e., absent, but from 
dunamis understood, in Heidegger’s (1995) words, as 
“the withdrawal into itself of the capability such that 
it is primed for release, i.e., primed for activity” (158). 
Dunamis, as “the source, arche, of change in some other 
thing, or in the same thing qua other” (Meta. 1046a11, 
1019a19-21), is the “power” of activity: It is what makes 
activity possible. As we saw earlier, without the capacity 
for logos, there could be no activity of its use. 

If dunamis powers activity, this does not make ac- 
tivity itself any less important. Dunamis may be that 
from out of which change occurs, but it is not some- 
thing inert, waiting to move to action. Rather, it effects 
change by way of its actualization, by doing its work, 
through practice. It is by performing the activities for 
which it is holding itself in readiness that a dunamis 
becomes capable in the first place. It is, in other words, 
by building that a builder becomes capable of building 
(Meta. 1046b34—36). The activity of building actualizes 
the builder’s capability to build. One is a builder in the 
way one is a courageous person or a deliberative per- 
son, for example, that is, only as long as the disposition 
to build or to act courageously or prohairetically shows 


% Because I am interested m what Aristotle’s analysis teaches about 
his understanding of activity, I leave to one side whether Amstotle’s 
description of the Megarian position is fair. My analysis here draws 
on Heidegger (1995, chap. 3). 
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itself from time to time in the relevant activity. Along 
similar lines, Aristotle remarks that “distance does not 
break off friendship absolutely, only the activity of it. 
But if the absence is lasting, it seems actually to make 
men forget their friendship” (NE 1157b6-13). 

Human nature is, then, not determined all and only 
by activity on Aristotle’s understanding, for activities 
come out of capabilities. There are, so to say, doers 
behind deeds. Who the doer is, the nature of the doer, 
is stable. This is not to say, however, that nature is once 
and for all determined. Rather, who the doer is, his 
nature, is continually informed by the activities he has 
performed and continues to perform. This means that if 
a deed or action is the product of activity, it is not activ- 
ity’s sole product. For even as activities emerge out of 
a stable character, activities themselves are formative 
of character. The more courageously I act, for example, 
the more courageous I become. There can be no activity 
without capability, but there can also be no capability 
without activity. Each depends on the other. It is this 
interdependence between energeia and dunamis that 
makes possible the changes over time and movement 
that define the nature of human beings and also their 
stability. This interdependence also defines soul, the 
part of natural beings that contains their principle and 
source of motion. If the ordering or constitution of soul 
and of human nature itself is given by the interdepen- 
dence between energeia and dunamis, and is change- 
able, it is nonetheless possible, at any given moment, 
to distinguish among those who are and those who are 
not actualizing their potential. This distinction, as we 
have seen, rests on the practice of prohairetic activity. 

Some commentators have taken Aristotle’s defini- 
tion of nature in Metaphysics V.4 to be equivocal. They 
say that he defines nature in two senses and they criti- 
cize him for sometimes favoring one and sometimes the 
other (Annas 1993, 146; Irwin 1985, 416-17).*! As “the 
source of motion in natural beings, which is somehow 
inherent in them, either potentially, dunarnei, or actu- 
ally, entelecheia” (Meta. 1015a18-19), Aristotle’s defi- 
nition of nature in the Metaphysics involves duality, to 
be sure. In one sense, he says, nature is the primary 
stuff—matter—and, in another sense, it is form. In one 
sense, it is the immanent thing from which a growing 
thing first begins to grow, dunarnis; in another sense, it 
is the genesis of growing things, their activity, energeia. 
He takes form, or energeia, to be primary and guiding 
(Physics 193b17) (Lear 1988, chap. 2; Nussbaum and 
Putnam 1992), and he also understands form to take 
its guiding orientation from matter, much in the way 
the shape of a statue may appear to its sculptor from 


31 Annas (1996, 735 n.12; 1993, 146) understands Anstotle here to 
be “adding to” his Physics’ account of nature as the internal source 
of change “the point that a thing’s nature is both the matter from 
which the change begms and also the substance or form which is the 
telos of the completed change.” Calling the matter from which the 
change begins “mere nature” and the form or telos of the completed 
change the “strong sense of nature.” Annas disaggregates what, for 
Aristotle, coconstitutes natural beings—matter and form or dunarnus 
and energeia (De Anima 412a10). Irwin (1985, 416-17) makes the 
same mistake. Arnhart (1998, 36-39), in contrast, treats nature as 
“both origmal potential and developed potential.” 
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out of the clay. But this is not to equivocate. For in the 
Metaphysics, as in the Physics, the Nicomachean Ethics, 
and also the Politics, where Aristotle both establishes 
the necessity of hierarchy or teleology (among species 
and human beings in Politics I and among regimes in 
the rest of the Politics) and also alerts us to its dangers, 
the sense of nature that Aristotle takes to be primary 
and guiding is nature neither as an origin nor as an 
end separable from growth.** It is both, and it is cap- 
tured in the process of growth itself. A natural being 
becomes and reveals its nature as it grows, changes, and 
moves through time. Nature, understood as an end and 
a beginning all at once, is an ongoing process.* As the 
domain of activity, it is also, and crucially for Aristotle, 
the domain of ethics and politics (NE 1103b26-30). 


THE NATURE OF FOREIGNERS 


If, to most readers, Aristotle takes nature to be im- 
mutable, necessary, and prepolitical, I have argued, in 
contrast, that, to Aristotle, human nature is constituted, 
in large part, by the practice and effects of prohairetic 
activity. Nature, so understood, has both a more limited 
and a more expansive role than is usually granted to 
it by Aristotle’s readers. It has a more limited role in 
that it is not necessitarian; changeable as well as stable, 
and shaped by the arts and practices of politics, nature 
can establish no permanent and secure hierarchies or 
foundations. It has a more expansive role in that human 
nature, understood via the practices and effects of pro- 
hairetic activity, makes human beings responsible and 
accountable for the hierarchies they create and sustain. 
This not only is the case within a given polity but also 
extends to the boundaries and hierarchies among poli- 
ties. To see this, I turn, by way of conclusion, to what 
might be called the hardest part of the hard case for 
my account of nature, Aristotle’s treatment of certain 
foreigners as natural slaves. 

Aristotle is usually read as treating certain foreigners 
as justly enslaved based on an immutable inferiority 
he is said to associate with those non-Greeks (Kraut 
2002, 290-95). If this is right, then my account of human 
nature in terms of activity, even if true about Greeks, 
would falter in the face of Aristotle’s xenophobia. Its 
relevance would also become questionable, as most 


32 Notice that in the case of what is by nature, form may be the telos 
of natural beings but form is not itself static. Itself defined ın terms of 
entelecheia and energeia, the form of natural beings, as thear telos, 1s no 
less (and no more) kinetic than actrvity itself For illuminating discus- 
sions of Aristotle’s self-consciously complex treatment of teleology 
in Politics J, showing how he both establishes and problematizes any 
straightforward claims about teleology, see Salkever 1990, chap. 1; 
Davis 1996, chap. 1; and Nichols 1992, chap. 1 

3 Contra Villa (1996, 42-52), according to whom Aristotle’s tele- 
ology robs action of its imtiatory power and gives action a prede- 
termined “authoritarian” future or end. Insofar as the telos is itself 
kinetic, it may be embedded in a past and oriented by a set of capa- 
bilities, yet predetermines nothing. The telos of a bonsai sapling will 
be appropriate to ıt and different from the telos of an acorn. And the 
telos of a particular bonsai sapling will not be the same as the telos 
of another. The same is true of people: The telos of one human beng 
will not be the same as that of another. Both may aim at excellence, 
but what excellence amounts to in each will differ. 


slaves in Athens were non-Greeks. However, rather 
than posing a challenge to the account I have devel- 
oped, the passages in the Politics on foreigners confirm 
it. Aristotle’s distinction between Greeks and certain 
non-Greeks, it turns out, rests not on nature as some- 
thing immutable, not on his conviction that Greeks 
were superior to foreigners, but on his observations 
about the (political and nonpolitical) behaviors of those 
foreigners. 

In Politics 1, Aristotle maintains that “among foreign- 
ers no distinction is made between women and slaves, 
because there is no natural ruler among them: they are 
a community of slaves, male and female.” Immediately 
following this statement, he quotes “the poets” as say- 
ing, “‘It is meet that Hellenes should rule over non- 
Greeks’; as if they thought that the foreigner and the 
slave were by nature one” (Pol. 1252b5-9). Aristotle 
first reports what he sees among foreigners and then 
quotes the words of the poets, who proclaim the justice 
of Greek rule over non-Greeks on the ground that for- 
eigners are natural slaves. If Aristotle’s observations 
are accurate, and the foreigners to whom he refers do, 
indeed, act as a community of slaves, then, in the terms 
of the analysis offered so far, he is justified in calling 
them natural slaves in virtue of that behavior. In light 
of the fact that Aristotle puts the identification of (all) 
foreigners as natural slaves into the mouths of “the po- 
ets,” however, it is not clear, in this passage at least, 
whether he would himself endorse this identification 
(Ambler 1987, 393).¥ 

Also in Politics I, Aristotle says, “It must be admit- 
ted that some are slaves everywhere, others nowhere” 
(Pol. 1255a31-32). Aristotle is usually read as main- 
taining that there are some, namely, certain foreigners, 
who (because they are natural slaves) are slaves every- 
where, and that there are others, namely, Greeks, who 
(because they are naturally free) are slaves nowhere. 
The rest of the passage, however, suggests a different 
reading. Aristotle is exploring the question of whether 
the enslavement of foreigners conquered in war is just. 
His answer, as we have seen, is that conquest, as a mode 
of force, cannot justify slavery. What can? Aristotle 
answers that worthiness determines one’s qualifica- 
tion for slavery (Pol. 1255a25~-26, also 1255b21-23). 
When, just after this, Aristotle says “It must be admit- 
ted that some are slaves everywhere, others nowhere,” 
he should be read as saying that those who are igno- 
ble are slaves everywhere, and those who are good, 
nowhere (Saxonhouse 1985, 70-71). Aristotle notes 
that there is a tendency among Greeks to regard for- 
eigners as ignoble and themselves as good and, there- 
fore, to treat foreigners as justly enslaved. In response, 
Aristotle reiterates that the proper determimant with 
regard to slavery is not foreignness but worthiness or 


M For discussion, see Davis (1996, 17), who argues that Aristotle in- 
vokes the passage from Euripides with knowledge ofits context to call 
into question any too-casy opposition between foreigners as natural 
slaves and Greeks as naturally free: “Iphigencia, who is speaking, is 
about to be sacrificed by her father, Agamemnon to propitiate the 
gods so that the Greeks can continue their expedition against Troy. 
Is this less barbaric than treating women as slaves? Iphigeneia is a 
living instrument used for the sake of an action.” 
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character (Pol. 1255b1), itself a function of activity (NE 
I.1-2). Character, Aristotle continues, will not justify 
slavery in perpetuity: Whereas nature intends that from 
good men a'good man will spring (and from a slave 
will spring a slave), this desire is often thwarted (Pol. 
1255b3). A person’s character can therefore justify only 
his own enslavement, not that of his children. 

Toward the end of the Politics, Aristotle uses spirit, 
thumos, the source of the love of freedom and the 
power of command (Pol. 1328a1-8), to distinguish 
free from unfree, calling Europeans comparatively free 
and Asians natural slaves (Pol. 1327b25-29). Aristotle 
frames this discussion by referring to meteorological 
conditions: Europe is cold and Asia is hot. Aristotle’s 
references to climate suggest right off that his distinc- 
tion between free and unfree rests on something other 
than a fixture of foreign psychology. Aristotle seems, 
rather, to be saying something like, “Where it is often 
extremely hot, people act listlessly or without spirit.” 
To say this is'not to announce a necessary, immutable 
feature about the Asian soul, which, like any soul, is 
unobservable and hard to speculate about. Aristotle, 
instead, calls Asians natural slaves based on what he 
sees as their ‘apparent lethargy, which is to say, their 
tendency to forget how to act on their own initiative, 
or “inactivity” (OED, 963). 

Discussing 'the fact that certain monarchies among 
foreigners sometimes resemble tyrannies, Aristotle 
claims that “such kingships have the nature of tyran- 
nies because the people are by nature slaves” (Pol. 
1285a22~23). Aristotle goes on to say that these tyran- 
nies are in no danger of being overthrown because, 
unlike other tyrannies, they are hereditary and legal, 
legal in that the subjects acquiesce voluntarily in the 
tyrannical rule (Pol. 1285a25—29). It is possible to read 
Aristotle, in this passage, as ascribing to certain foreign- 
ers, in this instance, Asians once again, an immutable 
inferiority that explains their willing acquiescence in, 
and responsibility for, the despotic regime that governs 
them. The rest of the passage suggests another possi- 
bility. Aristotle stresses that the tyrannies of Asia are 
not only legal‘but hereditary. In the same discussion, he 
counterposes these hereditary tyrannies to the elective 
tyrannies that, from time to time, governed the ancient 
Greeks (Pol. 1285a30-33). If an immutable inferiority 
is to be held responsible for the tyrannies governing the 
Asians, then Aristotle would have to conclude that the 
ancient Greeks, who were also governed by tyrannies, 
were similarly inferior. Further, if the ancient Greeks 
were immutably inferior, then it would follow that 
Aristotle’s contemporary Greeks were too, because, 
when nature is understood in terms of necessity, to 





3 Unlike Hippocrates, for example, Aristotle does not, to my knowl- 
edge, mvoke residency in the mother’s womb or the quality or nature 
of the mother’s conception m his account of the effects of climate on 
human psychology. In support of my argument about the key role 
of human activity in human nature, it 1s to be noted that although 
Anmstotle has a lot to say about the mechanics of reproduction, he 
does not seem to draw any clear causal lines between the biology of 
birth and reproduction, on the one hand, and the psychology of the 
born person. 
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be immutably inferior at one point in time is to be so 
always. 

Aristotle does not, of course, ascribe to Greeks the 
status of natural slaves and this suggests that explaining 
regime type by reference to an immutable inferiority is 
not his purpose. By focusing on regime, specifically, on 
the difference between the forms of the tyrannies gov- 
erning Asians (hereditary) and those governing Greeks 
(elective), Aristotle seems rather to imply that human 
nature is as much a product of the regime under which 
one lives as it is a regime’s cause. Insofar as they have 
long been habituated to living under tyrannies and 
acting according to the habits fostered by tyrannies, 
Asians are naturally slavish and so acquiesce in and, 
thereby, reproduce the regime that produced them. In 
contrast, it is because Greeks experienced tyrannies 
only sporadically, if willingly, that they did not become 
habituated to slavish behavior and so cannot be called 
natural slaves. This is not to say, however, that they 
cannot become natural slaves, and that seems to be at 
least part of Aristotle’s point in alluding to both Greeks 
and Asians as being governed by tyrannies, albeit in 
different forms. 

The passages on foreigners in the Politics, then, like 
Aristotle’s early discussion of natural slavery, reinforce 
the idea that human nature is changeable. Vulnerable 
to, and shaped by, both politics and self-determining ac- 
tivity, human nature cannot be fixed and can never func- 
tion as the determining ground for political hierarchy. 
In keeping with the lessons of Politics I, the passages 
on foreigners reinforce the ways in which Aristotle’s 
defense of natural slavery at the same time serves as a 
warning about the dangers slavery poses to politics. In 
both settings, Aristotle’s intent is didactic: Greeks can 
become slaves if they act like slaves; they have no free, 
rational nature to guarantee against that. 

If nature offers no guarantee against slavery, it also 
offers no guarantee against despotism. Absence of thu- 
mos may be a sign of slavishness in Asians, but Aris- 
totle also warns that thumos, the love of freedom and 
power of command that makes free politics possible, 
can, like the art of mastery, orient its possessors toward 
despotism (Pol. 1324b19~26), thus rendering them no 
less unfree than those with no thumos at all. It is only 
by avoiding activities that may lead (back) down the 
paths to slavery and despotism that Athens’ citizens 
and rulers can navigate between these twin dangers to 
freedom. Doing so requires vigilance in foreign rela- 
tions, in the political relations of one’s own polity, and 
even, or perhaps, especially. in domestic life. There, in 
the realm of domesticity, hierarchy can often seem most 
given and, therefore, most secure. This sense of secu- 
rity is, however, false. The hierarchies of the household 
and polity are reversible. Aristotle offers not the guar- 
antees of immutability but rather a call to ethics and 
politics understood as perpetual and ongoing activities 
of boundary-setting and keeping. That hierarchies are 
natural and made, necessary and dangerous (dangerous 
not only to those at the bottom but also to those who 
are [for now] at the top), is a lesson no less important 
for contemporary politics than it was for fourth-century 
democratic Athens. 
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Between Eros and Will to Power: Rousseau and “The Desire to 


Extend Our Being” 
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desire to extend our being.” Yet the concept is foundational to his political philosophy. Though 


Me scholars, following Rousseau himself, have had little to say about what Rousseau calls “the 


less compelling a formulation (and deliberately so) than Plato’s “eros” or Nietzsche's “will 
to power,” “the desire to extend our being” is comparable to these in its strength and reach, and hence 
in its power to unlock the full meaning of Rousseau’s thought. My purpose in this paper is to uncover 
Rousseau’s view of the meaning and significance of this desire; its pervasive influence on moral and 
political life; and the means through which it can and cannot, and should and should not, be fulfilled. 


e all want what is good. But what is good? 

Or, to begin with only slightly less ambitious 

a question, How can we discover the good? 
Classical philosophy taught that the route to know- 
ledge of the good must begin with what is widely be- 
lieved to be good: The good is the desirable, and the 
desirable either is, or is somehow suggested by, what 
we actually desire. In neither case is the inquiry an easy 
one, if only because we desire many mutually exclu- 
sive things. If the good is suggested by or even if it is 
included among what we actually already desire, we 
still must discover the principle by which to discern the 
“valid” aspect of our desires from those parts that, for 
reasons of accident or ignorance or misapprehension, 
do not point toward the truly desirable. Yet however 
difficult, there is a coherence and an intuitive sense to 
the philosophic enterprise thus conceived. The good 
is the desirable and the desirable is to be determined 
through an examination of our actual desires. Mod- 
ern and postmodern philosophy have not always main- 
tained this approach to discovering the good. Many, 
following Kant, have separated the moral good from 
considerations of desire and happiness, arguing that 
even if virtue does contribute to happiness, its demands 
must be determined and met without any regard for 
one’s own well-being. Far from representing the ful- 
fillment of desire, Kantian autonomy (i.e., adherence 
to the moral law) consists precisely in subordinating 
desire to duty. 

Yet no one, not even those who deny that the good is 
somehow contained in our desires, would shrink from 
calling the good desirable. This is important: To call 
something desirable is to hold that it either satisfies or 
would satisfy some desire; and to call something most 
desirable (e.g., Kantian autonomy) is to hold that it 
would fulfill the highest or worthiest (though not nec- 
essarily the deepest or most powerful) desire (e.g., the 
desire for freedom and dignity). My point is simply that 
all serious moral and political thought, whether it would 
like to or not, is made by its own terms to concern itself 
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with the factual question of desire. If even the least 
naturalistic, most deontological view of the good has 
a vital link to desire, then so does all moral and po- 
litical thought.! The good, as the desirable, is fulfilling 
either of our deepest desire (Plato) or our highest desire 
(Kant, and also Plato). What, then, are our deepest and 
our highest desires? 

Rousseau has an answer worth considering. The 
structure of his moral and political thought is natural- 
istic in several ways and, thus, is more akin to Plato’s 
than to Kant’s. First, he avows that things are good 
only if and to the extent that they fulfill our natural 
desires. If men had the natures of wolves, he pointedly 
suggests, the man who resisted preying on his fellows 
would be depraved (Rousseau 1979, 287). Second, for 
all that his prescriptive political thought seems to es- 
chew natural desire—Rousseau goes so far as to assert 
that the best political institutions are those that most 
denature man—that departure from nature refers to 
the means by which society can artificially replicate 
something of the natural man’s soul and satisfy what, at 
bottom, are deep natural desires (Rousseau 1979, 40); 
and any adequate political solution must take its bear- 
ings from natural right, knowledge of which depends on 
knowledge of “the nature of man,” which itself includes 
and arguably is essentially constituted by the desires 
(Rousseau 1964, 93). Finally, Rousseau holds that there 
is one good, arising from one desire, that outranks all 
others and indeed comprehends them, in the sense that 
these other goods are good only to the extent that they 
participate in or contribute to the primary good. In this 
Rousseau’s good is comparable to Plato’s Good, and 
his understanding of the desire for this good is com- 
parable to Plato’s eros. Yet Rousseau’s good is not, 
in its content, presented as synonymous with Plato’s, 
and Rousseau does not endorse or otherwise indicate 
that he subscribes to the Platonic conception of eros. 
The kinship between his thought and Plato’s appears to 
be formal or structural more than substantive. Rather 
than suggest that there is some particular condition or 
state of being or content of consciousness that consti- 
tutes the good for human beings, he holds that the good 
consists in maximized existence, that is, felt existence, 


1 It is worth pointing out that, even if there were no connection be- 
tween the good and desire, one would still need to know what we 
desire, if only to know what morality is up against. 
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or, as he calls it, the sentiment of our being, a good 
whose rather formal and abstract name points to the 
fact that it can be gained through a number of means. 
Rather than suggest that we all long for or are drawn 
to transcendence of our finitude and mortality (which 
would seem to be the core meaning of eros), he seems to 
hold that our desire for the good, i.e., our desire to exist, 
a desire whose name is self-love, is more a push than a 
pull, and an intrinsically directionless, nonteleological 
push at that.? Self-love is inherently expansive, seeking 
to extend one’s very being or self, but not, apparently, 
toward any particular end. 

Whether Rousseau’s divergence from Plato is as con- 
siderable as it appears is debatable, for Rousseau’s cho- 
sen standpoint, his determination to examine phenom- 
ena in strictly empirical and phenomenological ways, 
arguably would lead him to eschew Plato’s more “meta- 
physical” language even if he believed in something 
like Plato’s Good and Platonic eros. Yet precisely be- 
cause he adopts this more modern-scientific perspec- 
tive, his thought commends itself to those inclined to 
the Platonic view, for this perspective allows Rousseau 
to offer a supplementary articulation, an articulation 
“from below,” as it were, of the same phenomena 
treated by Plato. Interestingly, it does something com- 
parable for the Kantian view of the good (though here, 
I think, there is definite disagreement regarding the 
status of that good), for Rousseau’s good allows for 
and explains the goodness of virtue and autonomy (or, 
in Rousseau’s terminology, moral freedom). For these 
reasons, Rousseau’s notion that existence is the good 
and his notion that, as creatures of self-love, we all de- 
sire the maximization of this good, are worthy of our se- 
rious consideration. There are other reasons as well, not 
least among which are the surface plausibility and the 
explanatory power of these ideas. Indeed, Rousseau of- 
fers a credible basis for a comprehensive interpretation 
of the human problem. His thought on these matters 
is not in the least metaphysical but it is metapsycho- 
logical, and as such it speaks to the wellsprings of all 
sorts of behavior, including political behavior, and to 
the question of moral and political standards and their 
basis in nature. 

The focus of the following inquiry will be less exis- 
tence itself (i.e., less the sentiment of existence) than 
the desire for existence, that is, the desire to main- 
tain and extend our being. This emphasis matches 
Rousseau’s own: perhaps because his perspective is 
phenomenological, concentrating more on our inner 
experience than on that toward which our experience 
seems to point, and perhaps too because he sees that 
this desire attaches itself to myriad objects, he says more 
about the desire than the thing desired.? Nevertheless, 
Rousseau does speak about existence, both directly and 


2 Self-love of course goes by two names m Rousseau, amour de soi 
and amour-propre As will be discussed below, though, each of these 
versions of self-love is fundamentally constituted by the desre to 
maintain and extend existence, or the sentiment of existence. In the 
rest of this paper the words “bemg” and “existence” will always refer 
to the feeling or sentiment of existence. 

3 Tf this single desire can attach itself to various objects, why call it 
a stugle desire? The answer is that these various objects are pursued 
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by implication. Indeed, it is not clear that one can speak 
coherently about a desire without at least a-brief pre- 
liminary look at the object, even if that object is a kind 
of meta-object that is rarely experienced as the focus 
of longing. Let us then look briefly at what existence 
means to Rousseau and why we may conclude that he 
considers it the good for human beings.‘ 


BEING 


To experience the sentiment of existence seems to 
mean nothing other than simply to feel alive, and to 
experience the sentiment of existence to a great extent 
means to feel intensely alive. Rousseau nowhere ad- 
dresses the matter systematically, and it is not clear that 
one could capture such a thing systematically. He nei- 
ther catalogs the modes of existence or its dimensions 
nor states the relation between existence and lesser 
or more proximate goods. Fortunately, though—so, at 
least, it seems to me—he does not need to. For we all 
have some sense of what it is to feel alive, to feel particu- 
larly alive, and of the variability of the degree or extent 
to which we feel alive; and we would probably all agree 
that some people, in some basic way, seem “larger” than 
others, seem to experience life more intensely and act 
with greater verve than others. The way to convey this 
phenomenon is indirectly, either through poetic repre- 
sentation of inner experience or through descriptions of 
behavior that are novelistic enough to convey the inner 
life behind the behavior. Rousseau (1979) offers ex- 
amples of both—the former in various autobiographi- 
cal passages (to be considered below), the latter in his 
portrayals of certain literary characters (Emile chief 
among them) and in the occasional general statement 
such as this one: “To be something, to be oneself and 
always one, a man must act as he speaks; he must always 
be decisive in making his choice, make it in a lofty style, 
and always stick to it” (40). Immediately prior to this 
explanation of what it means to “be something” (note 
the existential language) Rousseau had described “the 
men of our days,” the bourgeois, as “nothing.” 

We learn more about the sentiment of existence from 
Rousseau’s discussions of what does and does not con- 
tribute to or enable us to feel it. “To live is not to 
breathe,” he writes, “it is to act; it is to make use of 
all our organs, our senses, our faculties, of all the parts 
of ourselves which give us the sentiment of existence” 
(Rousseau 1979, 42). Rousseau’s various depictions of 





for the sake of a single, unvarying aim or “meta-object,” namely, 
maximized existence. This expansive use of the word “desire” 1s con- 
sistent with Rousseau’s usage: It 1s he who speaks of “the desire to 
extend our being” (1979, 168). Compare to Aristotle’s coinage of 
the relatively expansive “ores,” or “reaching-out” (Aristotle 1986, 
433a—b; Arnhart 1998, 19), usually rendered as “degre.” 

4 The meaning and mgnificance of existence, or the sentiment of 
existence, m Rousseau’s thought have been addressed by a num- 
ber of interpreters, among whom I would cite two as particularly 
worthy of note: Grace (2000) and Melzer (1990, 39-46, 64-69, 103— 
105) (For bref surveys of other interpreters, see Grace 2000, 37 
n. 38, and Melzer 1990, 40 n. 20.) I have explored these issues myself 
in Cooper 1999, chap. 1. In its investigation of “the desire to extend 
our being” the present paper goes beyond my earlier discussion, but 
the following brief survey 1s based on that longer account. 
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impressive and strong-souled individuals, from the sav- 
age to Emile to the exemplary citizen to the legislator 
to himself, serve to confirm this statement and to artic- 
ulate its various permutations. As this range of the per- 
sonalities suggests, these permutations are many. The 
faculties chiefly employed by, say, Rousseau himself 
are far removed from those of the savage (and each 
of the others, for that matter), and so, presumably, is 
the manner in which he experiences the sentiment of 
existence: through reverie and meditation Rousseau 
experiences a pure version of the sentiment of exis- 
tence, “stripped of any other emotion” and apparently 
all the more intense for that, whereas others seem to 
enjoy a less pure experience of existence through the 
exercise of more ordinary physical, social, and mental 
faculties (Rousseau 1959-69b, 1063-64, 1992-69). One 
experiences being through doing, though the modes of 
doing are innumerable. 

That Rousseau considers existence good in and of 
itself is incontrovertible. In the passage just cited from 
the Reveries of the Solitary Walker, he speaks of the 
peace and contentment and sweetness of the sentiment 
of existence, and in the letter to Voltaire he famously 
proclaims that, “for anyone who feels his existence, it is 
better to exist than not to exist” (Rousseau 1959-69b, 
1070).5 That existence is the highest good, indeed the 
good or the comprehensive good for Rousseau (in that 
it is the source or standard by which other goods are 
good), is less obvious but no less true. That existence 
holds this status can be established by examining how 
Rousseau describes, and that for which he praises or 
recommends, other goods. Virtue, for example (which 
various neo-Kantian interpreters mistakenly consider 
Rousseau’s highest good),° is treated as praiseworthy 
because it in some way satisfies the desire for more 
being. (In what way will be addressed below.) Even 
happiness, the only other plausible contender for high- 
est good, proves to be second to existence in two senses. 
First, the content of happiness consists in feeling, which 
always means in enjoying, one’s existence. This is true 
at the highest reaches, the happiness, described in the 
Reveries and cited above, which consists in the sweet 
sentiment of existence “stripped of any other emotion”; 
but it is equally true of more commonly available kinds 
of happiness. The latter is implied both in Rousseau’s 
various depictions of happiness and in his argument 
that happiness depends not so much on pleasure or 
any other positive thing but on the relative absence 
of pain and psychic conflict: That the mere absence of 
pain yields happiness can only mean that the positive 
element of happiness is already present by virtue of 





5 Rousseau’s belief in the goodness of “enlarged” exstence and his 
disdain for smallness of soul (or being) are perhaps most strikingly 
displayed in his famous comparmon between religious fanaticism and 
the “philosophic indifference” of “the allegedly wise man.” The for- 
mer, though destructive, at least manifests “a prodigious energy that 
need only be better directed to produce the most sublime virtues” 
The latter, in contrast, is irredeemably contemptible for exemphfying 
and encouraging an existentially constricting egoism. See Rousseau 
1979, 312n. 

6 For the neo-Kantian interpretation, see Casairer 1954, 1963, 
Gurvitch 1932; and Levine 1976. 
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one’s being and feeling alive. The second important 
sense in which existence is a good prior to happiness is 
that it can be attained, and can thereby give meaning 
and quality to life, apart from happiness. Happiness, as 
Rousseau (1992, 137) uses the term (bonheur), depends 
partly on our own conduct and internal state but also on 
external conditions beyond our control. Sadly, external 
conditions are often worst for those whose character is 
most admirable. Too often the best among us have been 
made to suffer precisely for what is best about them. 
Yet those individuals are not foolish or without earthly 
compensation, for, though obviously not happy, their 
contentment with themselves and their intensity of ex- 
perience yield them a magnificent share of existence 
and render their lives exemplary. To be sure, happiness 
is a great good. Probably we ought to think of it as 
the crowning good, since true happiness does not en- 
tail the compromise of any other good. Nevertheless, I 
call existence the greatest good since it is the source or 
substance of other goods. 

The means and conditions required for attaining a 
“large” existence are many and complex and in some 
ways ultimately mysterious. It is possible to say some- 
thing now about the “shape” of the endeavor, however, 
and it will be possible to say something more once we 
have examined Rousseau’s treatment of the “desire to 
extend our being.” In brief, an examination of the vari- 
ous character types admired or endorsed by Rousseau 
reveals that, for all their (enormous) differences, they 
all share two basic features, or meta-features.’ First, 
they all approximate the “negative” perfection of the 
first man, whose soul was troubled neither by inner con- 
flict nor by a painful excess of desire. None of them, it 
should be noted, quite attains that ideal. Yet the means 
by which they approximate it more than compensate 
them for that imperfection, for these means themselves 
derive from the second, and “positive,” shared feature 
of all good human types—namely, the harmonious de- 
velopment and employment of faculties which in and 
of themselves constitute extended or enlarged being. 
This is seen most clearly in Emile, whose imagination 
and passion keep him from quite attaining the savage’s 
equality between desire and faculty but whose will is 
kept from outpacing his power. The part of his desire 
that exceeds his faculties is mastered by virtue, which, 
being both a kind of rationalized passion and a faculty 
(will), yields a certain satisfaction and an extension of 
being (Rousseau 1979, 80). 

Existence, then—which, we recall, means felt 
existence—requires, first, a sufficient freedom from in- 
ner division and distraction so that one’s being may 
be felt and, second, that there be something to be felt. 
The extent of one’s existence depends on the degree to 


7 The character types to which I refer include the virtuous citizen; 
“the natural man living in the state of society [1.e., Emile]”; the con- 
templative Rousseau as he portrays himself in his autobiographical 
writings; and even the tribal savage, the demzen of “nascent society” 
(Rousseau 1964, 150-51). These four hardly exhaust the list of types 
approved of by Rousseau. But they do seem to me to occupy the 
“pure” positions and thereby to constitute the “markers” by which 
to understand all other worthy individuals. 
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which these requirements are met. The second require- 
ment is the one which admits of greater variability: The 
faculties which can contribute to the enlargement of 
existence are numerous and diverse. To the extent that 
most of us fail to achieve a desirable level of existence, 
though—and it is quite clear that Rousseau would find 
the vast majority of us wanting, just as he found the vast 
majority of his contemporaries wanting—the primary 
locus of the failure would seem to lie in the first or 
“negative” part of the project. Our smallness is first 
and foremost a product of inner conflict and excessive, 
unfulfillable desire; it is they, along with the consequent 
tendency to obsess over the past and to fret over the 
future, that keep us from living with whole hearts in 
the present moment. The ultimate source of this evil 
is society. By making us dependent on one another, 
society puts our most urgently felt needs and interests, 
themselves the product of socialization, into contradic- 
tion with the requirements of our desire—our need— 
to enhance the degree of our existence.’ (We see now 
why liberal or bourgeois society comes in for partic- 
ular scorn from Rousseau: It deliberately intensifies 
personal dependence.) Society’s existence-diminishing 
effects are not inevitable, however. In principle, they 
can be checked either by the construction of a regime 
based on the principles of political right as articulated in 
the Social Contract or by isolated individuals, through 
education or moral reform. Rousseau (1978) sees the 
former as highly unlikely, for reasons explained in the 
Social Contract (70-75). What about the latter? 
Whether in the case of the naturally exceptional in- 
dividual as exemplified by Rousseau himself or that of 
an ordinary person as exemplified by Emile, the key to 
a high level of existence is the proper ordering of self- 
love. (Note that both men were raised amid an unre- 
generate society.) Neither case seems very encouraging 
from a practical standpoint. The exceptional individ- 
ual by definition is not a model for most, and Emile, 
practically, is no proof at all, for Emile was not only 
conceived by a philosophic genius, but raised by one 
as well. Nevertheless, improvement is possible, as indi- 
cated by Rousseau’s various efforts to stimulate social 
and moral reform. One of these efforts is the Lettres 
morales, whose rhetoric is designed to persuade indi- 
viduals to self-reform and whose prescriptive content is 
designed to yield a more natural life and hence a greater 
degree of existence (Rousseau 1959-69b). Yet even 
this more modest but accessible possibility is remote 
for most people. Most of us, Rousseau would say, fail 
to understand—in fact, resist understanding—that we 
might increase our happiness and existence by shedding 
illusory hopes and false attachments. Here we arrive at 
a key cause of our diminished existence and return to 
the primary focus of our inquiry. For it turns out that the 
proximate cause of this tragic blindness—and, indeed, 
the cause of much of our existential diminishment in the 
first place—is precisely the desire to extend our being. 


_—— 

8 For an illuminating discussion of the “contradiction of society,” see 
Melzer 1990, 69-85 Regarding the connection between this contra- 
diction and the sentiment of existence specifically, see Grace 2000, 
22-24. 
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“THE DESIRE TO EXTEND OUR BEING” 


It is a peculiar ethical naturalism that identifies our 
greatest good with the object of our deepest desire but 
then identifies that desire as the source of our diversion 
from the good. Yet that is Rousseau’s view—in part. In 
any number of forceful passages he warns against the 
desire to be more than we already are. This is one of 
the more pronounced strains in Emile, from the rather 
Stoical discourse on happiness at the start of Book I 
(“O man, draw your existence up within yourself, and 
you will no longer be miserable” [Rousseau 1979, 83]) 
to the account of the young man’s study of history in 
Book IV (“He will be afflicted at seeing his brothers... 
turn into ferocious animals because they do not know 
how to be satisfied with being men” [242]) to the tutor’s 
counsel near the end of the book: 


Ihave only one precept to give you, and it comprehends all 
the others. Be a man. Restrain your heart within the limits 
of your condition. Study and know these limits. However 
narrow they may be, a man is not unhappy as long as he 
closes himself up within them. He is unhappy only when 
he wants to go out beyond them. (446) 


To be sure, the latter two of these admonitions concern 
not the desire to extend our being in toto but only the 
desire to be more than human. Yet the latter, inherently 
dangerous desire is arguably the logical endpoint of 
the desire to extend our being : and in fact is the hidden 
desire of “every civilized man” (Rousseau 1964, 195).? 
Therefore Rousseau’s counsel against wanting to ele- 
vate our being above the realm of the human effectively 
comes down to counsel against, or at the very least an 
urgent plea for caution regarding, the desire to extend 
our being as such; and often the counsel he gives is 
directly concerned with extension as such. 

Yet if Rousseau counsels against the desire to extend 
our being—and if he would leave his readers above all 
with a sense of the danger of that desire—that does not 
mean that he views this desire in an altogether negative 
light. I wish to argue, in fact, that Rousseau views the 
desire to extend our being as a highly ambiguous thing. 
Belonging to the essence of self-love, the desire to ex- 
tend our being is natural in its origins and potentially 
good even for civilized human beings, even if that po- 
tential is rarely realized. Indeed, it is the motive force 
directing us to the greatest good, just as, as we have 
just seen, it also directs us to the greatest evils. In this 
the desire to extend our being resembles Platonic eros, 
which is the motive force of both the best and the worst 
of men.’ One of my hopes in this paper is to distinguish 


9 «[A]fter long prosperity, after having swallowed up many treasures 
and desolated many men, my hero will end by ruining everything 
until he is the sole master of the universe. Such ın brief is the moral 
picture, if not of human hfe, at least of the secret pretensions of the 
heart of every civilized man” (emphasis added). 

10 The ambiguous potential of eros ıs one of the great themes of the 
Republic, in which the philosopher, the most self-sufficient and just of 
men, and the tyrant, the most avidly insatiable and unjust of men, are 
identified as erotic beings. See, respectively, 475e, 485a—b, 501d, 573b- 
579d. Also see 491d—492a, where Socrates links these two extremes 
by suggesting that those with highly erotic natures are destined for 
one or the other, with little likelihood of a middle position. 
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between these divergent potentials, that is, to ascertain 
the defining criteria of what, for lack of more elegant 
terms, I will simply call “good” and “bad” versions of 
the desire to extend our being. But before considering 
versions or species we would do well to look into the 
genus. What is the desire to extend our being? 

I remarked above that this desire belongs to the 
essence of self-love. That statement requires elabora- 
tion and qualification. Rousseau famously divides self- 
love into two types, amour de soi and amour-propre, the 
latter being the unnatural but inevitable outgrowth of 
the former in civilized human beings." It is not difficult 
to discern the centrality of the desire to extend one’s 
being to amour-propre, which, being a relative passion, 
always seeks preeminence over others and typically 
wishes for mastery over others (recall “the secret pre- 
tensions of the heart of every civilized man” [Rousseau 
1964, 195]). Preeminence and mastery bring, and are 
surely sought for the sake of, recognition by others; and 
for social man this recognition is the very stuff of being 
(179). Although born in response to the frustration of 
desire, amour-propre, much like Plato’s thymos, soon 
grows beyond the proportions demanded by its initi- 
ating provocation and takes on a very large (indeed, a 
typically tyrannical) life of its own.” Rather than seek 
merely to overcome a particular obstacle to a particular 
desire, it comes to see the world as full of obstacles and 
to view everything outside itself, including, especially, 
other people, as little else than obstructions or threats 
or objects of conquest: 


As soon as th[e] absolute love [of oneself] degenerates 
into amour-propre and comparative love . . as soon as 
one adopts the habit of measuring oneself against others 
and moving outside oneself in order to assign oneself the 
first and best place, it is impossible not to develop an aver- 
sion for everything that surpasses us, everything that low- 
ers our standing, everything that diminishes us, everything 
that by being something prevents us from being everything. 
(Rousseau 1990, 112; emphasis added) 


Whatever the initiating provocation, then, amour- 
propre comes to experience it as merely the edge or 
“presenting symptom” of the real provocation, namely, 
the fact that we are limited or finite beings, the fact that 
we are not “everything.” How far we have traveled from 
the benign self-love of the indolent natural man. 

Yet amour-propre is not injected into the soul by 
an alien force. It is born when self-consciousness, the 
awareness of one’s separateness and relatedness to oth- 
ers, triggers the transformation, the relativization, of 
amour de soi. That self-consciousness should relativize 
self-love, making self-esteem problematic and giving it 
a moral dimension, seems logical enough. (One would 


11 Rousseau’s most succinct articulation of the distinction between 
the two types of self-love appears ın note “o” of the Second Discourse 
(1964, 221-22). When I use the English term “self-love” I will be 
speaking of both amour de soi (or amour de sot-meme) and amour- 
propre 

12 Rousseau gives several accounts of the genesis of amour-propre 
out of amour de soi. The one that most nighlights the triggering role 
of obstacles or frustration appears at Dialogues (1990), 9. Regarding 
Plato’s account of the birth of thymos, see Cooper 2001; Newell 2000, 
34 et passim; and Pangle 1976. 
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like to say, “seems natural” enough, but that would be to 
settle an important question without first asking it.) But 
that self-consciousness should ignite infinite passion 
where before there had been only contentment with 
one’s level of being is baffling—so much so, in fact, that 
we might question our premise and wonder whether 
the longing for infinity was not in fact already present 
in amour de soi, either in latency or in some live but 
benign way. For it would be much easier to make sense 
of the birth of amour-propre, of the sudden rebellion 
against finitude, if the amour de soi out of which it is 
born were already expansive. And so it is. We might 
recall that Rousseau’s (1979) single explicit reference 
to “the desire to extend our being” occurs in connection 
with the preadolescent Emile, in whom amour-propre 
had yet to be born (168). In Rousseau’s view self-love, 
by its very principle, is expansive—which is to say that 
human beings, who, after all, are motivated exclusively 
by one kind of self-love or another, seek to be all that 
they can be. Human being as such is expansive. 

To register the precise meaning of this expansiveness 
it would help to take note of what it does not mean. That 
self-love and human being are expansive does not mean 
that the longing to be everything belongs to amour de 
soi. “Everything” is a relative term and as such can be 
the object only of the relative form of self-love. Nor 
does it mean that the desire to extend our being is an 
essential or universal feature of amour de soi. In the 
strict sense, at least, the desire to extend one’s being 
would seem to require that one be capable of desiring 
more than just the satisfaction of physical needs, a con- 
dition that excludes men and women in the pure state 
of nature as well as (perhaps) very small children amid 
civilization. (True, I just said that self-love, by its very 
principle, is expansive; but this principle can be opera- 
tive only where a certain level of cognitive development 
has taken place; more on this below.) Nor does it mean 
that every person, at every moment, somehow or other 
seeks to be more than he or she already is. Those who 
are weak or weakening seek only to preserve the being 
they already enjoy. For them, seeking to be all they can 
be is more in the nature of a defensive enterprise than 
an expansive one. Yet even this defensive posture tes- 
tifies to the positive inclination of self-love and human 
being, i.e., testifies to the reach for maximum existence. 
Let us then examine this inclination and let me specify 
what I think it does mean. 

Stated positively, my interpretation of Rousseau’s 
view of self-love and the expansiveness of human being 
consists in the following points, the first of which has 
already been stated. (1) Self-love of every variety, and 
human beings in whatever condition, seek the maxi- 
mum possible degree of existence. (2) In those who 
are capable—those who are sufficiently energetic and 
sufficiently advanced cognitively—the pursuit of maxi- 
mized existence manifests itself as the desire to ex- 
tend their being. This desire for extension appears both 
(a) in amour-propre, in which case it manifests itself as 


13 Regarding self-love as the source of all human action, consider 
that passion 1s the source of all action and that self-love is the source 
of all passion (Rousseau 1979, 183, 212-13; 1990, 9). 
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the desire to be everything, and (b) in amour de soi, 
in which case it manifests itself as a more benign reach 
for extension. (I am assuming that amour-propre and 
amour de soi are in principle distinguishable in the soul 
of civilized human beings and that they can, and do, 
coexist in the same soul.) (3) In those who are weak, the 
pursuit of maximized existence manifests itself in the 
desire only for preservation, not expansion. (4) In those 
who may be sufficiently strong but who lack the req- 
uisite cognitive capacity to conceive and desire things 
that are superfluous to their physical well-being (i.e., 
human beings in the pure state of nature and the very 
young amid civilization), the inclination to maximize 
existence manifests itself not as the desire to extend 
being but simply as a love of being that manifests as ex- 
uberance or overflowing activity and that, being love— 
indeed, being a passion (Rousseau 1979, 213)—would 
seek extension if it knew enough to be able to. (Perhaps 
one could say that this primitive self-love is latently 
expansive.) Finally, (5) what transforms the not-yet- 
expansive self-love and being of the savage or the infant 
into one or more versions of the desire for extension 
is the advent of self-consciousness, or the awareness of 
oneself as a separate self related to other selves. With 
self-consciousness comes the capacity to desire more 
than what would satisfy one’s physical needs, and the 
principle of self-love (the fact that it is love) dictates 
that this capacity will be employed. 

(1) The first and most basic point is nowhere cate- 
gorically stated by Rousseau, but it is suggested, defini- 
tively I think, by the principle of self-love. Self-love, 
we recall, is not only the source of passions but a pas- 
sion itself, whether in the form of amour de soi or as 
amour-propre (Rousseau 1964, 195, 221-22; 1979, 213). 
Passion implies desire. When one loves something one 
desires either to obtain it (or to obtain more of it) or 
to maintain possession of it, or both. When the ob- 
ject of love is something as unending as being itself, 
one never stops wanting more of it, at least not until 
one feels oneself somehow in possession of it (or more 
likely, union with it—about which more below). That 
most people do not consciously seek or desire extended 
being does not count against my interpretation. It does 
indicate, though, that most of us are ignorant of what we 
most deeply want. Now this analysis by itself is hardly 
proof of anything. It does, however, represent a key 
premise of Rousseau’s moral and political philosophy, 
which will become apparent as we consider some of 
the particulars of Rousseau’s treatment of the expan- 
siveness of self-love and the desire for extended being. 
Let us therefore proceed to those particulars, begin- 
ning with what I take to be Rousseau’s most complete 
summary statement of the nature and sources of human 
action (and hence human being). Besides testifying to 
Rousseau’s belief in a universal tendency of self-love 
to maximize existence, this passage from the Dialogues 
also lays out the distinction between the two basic ver- 
sions of the desire for extended being: 


Sensitivity is the principle of all action. . . . There is a purely 
passive physical and organic sensitrvity which seems to 
have as its end only the preservation of our bodies and 
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of our species through the direction of pleasure and pain. 
There is another sensitivity that I call active and moral 
which is nothing other than the faculty of attaching our 
affections to beings who are foreign to us. This type, about 
which study of nerve pairs teaches nothing, seems to offer a 
fairly clear analogy for souls to the magnetic faculty of bod- 
ies. Its strength is in proportion to the relationships we feel 
between ourselves and other beings, and depending on the 
nature of these relationships it sometimes acts positively 
by attraction, sometimes negatively by repulsion, like the 
poles of a magnet. The positive or attracting action is the 
simple work of nature, which seeks to extend and reinforce 
the feeling of our shed the negative or repelling action, 
which compresses and diminishes the being of another, is 
a combination produced by reflection. From the former 
arise all the loving and gentle passions, and from the latter 
all the hateful and cruel passions. . . . Positive sensitivity is 
directly derived from amour de soi. It 1s very natural that a 
person who loves himself should seek to extend his being 
and his enjoyments and to appropriate for himself through 
attachment what he feels should be a good thing for him. 
This is a pure matter of feeling ın which reflection plays 
no part. But as soon as this absolute love degenerates into 
amour-propre and comparative love, it produces negative 
sensitivity, because as soon as one adopts the habit of mea- 
suring oneself against others and moving outside oneself ın 
order to assign oneself the first and best place, it is impossi- 
ble not to develop an aversion for everything that surpasses 
us, everything that lowers our standing, everything that di- 
minishes us, everything that by being something prevents 
us from being everything. (Rousseau 1990, 112; emphasis 
added) 


Prior to and cutting across the distinction between 
amour de soi and amour-propre is the distinction be- 
tween the (passive, physical) sensitivity that tends to the 
preservation of our being and the (active, moral) sensi- 
tivity that tends to the extension of our being. The point 
of immediate relevance is that the expansive tendency 
belongs to self-love as such—i.e., that it is expressed, 
however differently, by both amour de soi and amour- 
propre—and that, belonging to the nature of self-love, 
it does not arise only in nse to some threat. (See 
also Rousseau 1959-69a, 1324-25; 1979, 79.) 

(2, 3) I noted above that expansiveness, or the de- 
sire for extended being, requires sufficient force; ab- 
sent such force self-love or the desire for maximized 
existence manifests itself defensively, as the desire for 
preservation only. “In the state of weakness and insuf- 
ficiency concern for our preservation concentrates us 
within ourselves. In the state of power and strength 
the desire to extend our being takes us out of our- 
selves and causes us to leap as far as is possible for 
us” (Rousseau 1979, 168).‘4 Although in this line from 
Emile, Rousseau seems to speak of a singular “state of 
power and strength,” the more comprehensive analysis 
in the Dialogues indicates that the force requisite for 
the desire to extend one’s being can come in either of 
two variants—either strength of soul, in which case the 





14 Compare to Nietzsche 1974, sec. 349: “The wish to preserve one- 
self is the symptom of a condition of distress, of a hmitation of the 
really fundamental instinct of hfe which arms at the expansion of 
power and, wishing for that, frequently risks and even sacrifices self- 
preservation” (emphasis m the original) 
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result is a “positive” expansiveness, or the energy of 
those less strong of soul but nevertheless full of desire, 
in which case the expansive tendency is “negatives 
seeking its own fullness only by diminishing others. 
Examples of the former include the adolescent Emile 
and indeed all adolescents who have not fallen into 
dissipation: “a superabundance of life seeks to extend 
itself outward” and brings about friendship, compas- 
sion, and romantic love (Rousseau 1979, 220). Of the 
newly compassionate Emile, Rousseau observes, "He 
feels himself to be in that condition of strength which 
extends us beyond ourselves and leads us to take else- 
where activity superfluous to our well-being” (229). 
Examples of the expansiveness born of the desire or 
energy of those who are not strong of soul are found 
throughout Rousseau’s corpus, from the “agitated” so- 
ciable man of the Second Discourse to the assiduous 
conspirators persecuting Rousseau (1964, 179; 1990, 
passim). 

(3) Just as there is a distinction among those with the 
psychic force to desire extended existence, so there is a 
distinction to be made among the weak. Those who are 
weak from old age, husbanding what they understand 
to be waning resources, seek only calm, whereas those 
who are weak because they are still too young to pro- 
vide for themselves are active and rambunctious in an- 
ticipation of greater power (and perhaps too because, 
unlike the old, they are confident that their preserva- 
tion needs will continue to be met by others). See, in 
this regard, Rousseau’s (1979) comparison of the child, 
whose strength is not great but whose “activity ... is 
superabundant and extends outward,” to the similarly 
(but of course not similarly) weak “old man,” whose 
“failing activity” is “concentrated in [his] heart” (67). 
Also note Rousseau’s observation that it belongs to ev- 
ery age but especially to childhood “to want to create, 
imitate, produce, give signs of power and activity,” a 
propensity that can be seen in children as a harbinger 
or even direct evidence of the desire to extend their 
being (98). Though weak in an absolute sense, children 
typically have the energy and confidence to let them 
take preservation for granted and seek extended being 
according to their own lights and capacities. 

(4, 5) I suggested above that the discovery of amour 
de soi’s expansiveness makes the birth of amour-propre, 
with its aggressive expansiveness, somewhat less baf- 
fling. What occurs with the birth of amour-propre is 
not the creation but rather the transformation, the rel- 
ativization, of a preexisting tendency. Amour-propre 
and amour de soi prove to have more in common than 
is initially apparent. Yet precisely by grasping this link 


15 In absolute terms, one who expands his being only by cimmushing 
others (even assuming such a thing 1s possible) rs not expanding one’s 
being at all. But that is just the point: Such a person is entirely relative 
in his being; caring only about his standing vis-a-vis others, he only 
cares about proportions and, thus, feels himself more for their being 
less. 

16 Rousseau does not claim that strength of soul and healthy expan- 
stveness belong only to the young. But it is a bit discouraging that 
his most compelling portrart of adult strength of soul and healthy 
expanstveness is found in his fantastic depiction of an “ideal world” 
(Rousseau 1990, 9-12). 


we come to see a different disjunction and are faced 
with another baffling evolutionary step: How are we to 
understand the development of an expansive self-love 
(Le., self-love as it appears in civilized human beings, 
whether in the form of amour de soi or amour-propre) 
out of the apparently nonexpansive self-love of the 
savage? Why did contentment give way to infinite and 
usually insatiable longing? 

Rousseau famously describes the savage as idle and 
lazy (1964, 118, 208; 1986c, 266n.)—and not only lazy, 
but content: What little he wanted was in his power to 
procure (1964, 116). Rousseau does not even hint that 
there was in the soul of the savage any deep-seated agi- 
tation or any impulse to overflow himself. The extent of 
the savage’s expansiveness was his repugnance at see- 
ing sensible beings perish or suffer, a phenomenon that 
does denote a kind of extensive impulse but one whose 
smallness Rousseau (1964) underscores by attributing 
to horses as well as to men (95, 130). Yet the savage was 
a creature of self-love, and self-love, as noted earlier, is 
a passion that carries a desire for maximized existence. 
This desire is the principle of self-love as such, as appli- 
cable to the savage as to civilized humanity. (It is impor- 
tant to recall Rousseau’s insistence that the inhabitant 
of the pure state of nature was indeed a human be- 
ing.) Because of his limited cognitive development, the 
savage was not able to conceive or (therefore) desire 
more than he needed, and what he needed was normally 
available to him—hence the contentment. With the ap- 
pearance of incipient self-consciousness, however, he 
became capable of conceiving new objects of desire and 
began to be aware of his finitude and vulnerability. That 
is to say, he began to feel, however inarticulately, both 
that his existence could be extended and that, if he was 
to be happy, his existence needed to be extended. Self- 
consciousness is what triggered the desire to extend 
one’s being. 

This logic, though nowhere simply stated by 
Rousseau, is borne out by his depictions of the vari- 
ous human types. While the savage is depicted as un- 
interested in, because not able to conceive, things that 
would extend his being, all sufficiently energetic self- 
conscious or civilized human beings are depicted as 
desiring extended existence. Those who are dominated 
by unruly arnour-propre—tink here of “the secret pre- 
tensions of the heart of every civilized man”—are de- 
picted as insatiable in their desire to outstrip others, 
to dominate everyone, to be everything. Those gov- 
erned by an orderly amour-propre—citizens of a well 
constituted republic, for example—while well disposed 
toward fellow citizens, still seek preeminence both for 
themselves and for the nation. Finally, those in whom 
amour-propre is not predominant, such as the young 
Emile and the older Rousseau, also evince a desire to 
extend their being. Emile’s case has already been dis- 
cussed: His education in compassion is explicitly said 
to consist in the channeling of an overflowing or ex- 
pansive strength, and it is in connection with Emile’s 
scientific education that Rousseau (1979) speaks of 
“the desire to extend our being” (167-68). But can 
one really maintain that the dreamy-contemplative and 
self-describedly idle Rousseau exemplifies a desire to 
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extend his being? (Rousseau 1992, 64; 1995, 537). Yes, 
in fact. And with this we come to the next part of our 
discussion. 


HOW, AND HOW NOT, TO “EXTEND OUR 
BEING” 


No one but a philosopher would speak of “the desire 
to extend our being,” and most philosophers have pre- 
ferred to speak of some other goal—power, or plea- 
sure, or preservation, for example—when postulating 
the ruling desire of human beings.’ Even Rousseau 
agrees that extended being is not the immediate object 
but rather the ultimate object—or, to use a psycho- 
analytic distinction, the aim—of many other desires. 
(Undoubtedly this is one reason for the infrequency of 
the language of “being” in his work: One who wishes 
to reform judgment, as Rousseau clearly does, needs 
to speak to people’s desires in more familiar terms.) 
These other desires are many indeed, their foci ranging 
from material possessions, prestige, and mastery over 
others and over oneself to compassion (the desire that 
others not suffer), friendship, patriotism, familial and 
romantic love, and knowledge and understanding.!® In 
many, probably most, of these cases, it is intuitively 
plausible that the desired object is desired for the sake 
of extended being (this is especially true where and to 
the extent that amour-propre is seen to be at work), 
though to say that things are pursued for the sake of 
extended being is not to say that they necessarily bring 
extended being when they are attained. Possessions, 
for example, are typically experienced as extensions of 
oneself, even if many would follow Hobbes and Locke 
in holding that this extension is sought as a fence of 
our preservation and not as an end in itself. The desire 
for prestige, though again thought by many to serve 
some other end, can also rather easily be interpreted as 
an expression or manifestation of the desire to extend 
one’s being, for one who is recognized and admired by 
others sees himself reflected in them and knows himself 


17 The examples are too numerous to analyze, but the mention of 

such names as Machiavelli, Nietzsche, Epicurus, Hobbes, and Spinoza 

should make the point. Plato immediately comes to mind as one who 

does seem to say that human beings desire more, or a more real, being 

Their shared emphasis on being constitutes an important affinity and 
haps an mstance of influence between Plato and Rousseau. 

8 This list is mcomplete, and the explanations that follow will be 
more so (with the mportant exception of my treatment of the last 
item). J am permitting myself this incompleteness partly because 
of space constraints; partly because some of the points are obvious 
enough to require minimal explanation; and partly because some of 
them, especially those belonging to the category of moral pheno- 
mena, have been treated well by other scholars; but also because 
the full meaning of the demre for extended being will be iuminated 
less by a necessarily superficial survey of its many manrfestations 
or subsiciary desires than by a detailed examination of that activity 
that best succeeds at attaining extended exstence—hence the more 
extended treatment of the last item on the lst. Finally, let me stress 
that I am not proposing that the “mechanisms” whereby extended 
being sought or found are the same in each case, or that Rousseau 
thinks they are Indeed, quite the contrary: While pomting existence 
as the comprehensive good for human beings, Rousseau, it seems, 
allows for as many modes of existence and thus of extending exstence 
as there are faculties. 
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to exist somehow in and through their consciousness. 
Perhaps mastery over others is the clearest case of those 
cited. Mastery over others consists in the extension of 
the effective range of one’s will, an extension believed 
by the lover of such power to constitute an extension 
of his existence, since he views his will as the essential 
part of himself or the thing he most is. 

The desire for self-mastery can be interpreted in this 
vein as well. The most remarked-upon kind of self- 
mastery in Rousseau, and certainly the most important, 
is virtue, Virtue, which by definition entails a kind of 
self-overcoming, is pursued for the sake of satisfaction 
with oneself, a satisfaction born of the very fact of over- 
coming, i.e., born of the exercise of strength by one part 
of oneself—presumably the truer part of oneself, the 
part one considers the core of one’s identity—against 
another part. This mastery of one part of oneself by 
another part can be seen as rule by, and therefore as 
the extension of the being of, the active, rational, moral 
part of the self, the part one considers one’s true self. 
(How else to explain why, when the will successfully 
governs appetite, we say that we have won a victory 
over ourselves? Why not say that we have been de- 
feated by ourselves or that, as always, we have both 
won and lost?)!9 

Not only virtue, but the other key element of the 
moral life, compassion, can also (and more easily) be 
seen to entail the extension of one’s being. Compas- 
sion, according to Rousseau, consists in the extension of 
our being to, and consequently our suffering in, others 
(1979, 221, 223; 1986c, 261). And if compassion entails 
extension, so must that whose source is compassion— 
friendship, most notably (Rousseau 1964, 131-32). Pa- 
triotism, too, can be interpreted along these lines. Cit- 
izens who identify with one another are in some sense 
extending their being to one another, and of course 
the imperialism that is frequently born of this collec- 
tivization of amour-propre is self-evidently a case of 
attempting to extend being. If citizens identify with one 
another, so do members of a family, which in fact serves 
as both the prototype and the training ground of civic 
identification and devotion (Rousseau 1979, 363). Thus 
family love too qualifies as a manifestation of the desire 
for extended being. The next item on our list is romantic 
love, which, with its intense mixture of (a) the desire 
to possess the beloved and (b) sympathetic devotion 
to him or her, seems to me to qualify on two grounds, 
the former exemplifying the desire for extended being 
as expressed through amour-propre, the latter through 
amour de soi. Finally, the desire for extended being 
can often be discerned as the source of the desire for 
knowledge and understanding. After all, it is in connec- 
tion with the young Emile’s new education in science 
that Rousseau actually uses the phrase, “the desire to 
extend our being” (168), and it is easy enough to see 


19 Cf Plato 1968, 430e-431b. For more on the satisfactions of virtue, 
see Melzer 1990, 100-104. 

2 Not only virtue and compassion but a whole range of moral phe- 
nomena can be seen as expressions of expansive self-love, including 
benevolence, generosity, gratitude, and promise-keeping; see Grace 
2000. 
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that knowledge of the way the world works grants some 
power, some extension of oneself, over and in the world. 

Now science as Emile learns it is not the science 
(the botany) of the older Rousseau, and it is certainly 
not the same as Rousseau’s philosophizing and reverie. 
Emile’s study, at least in its origin, is undertaken in 
the service of his immediate and palpable interest, that 
is, for the sake of some end beyond itself. Yet I wish 
to suggest that not only his but also Rousseau’s in- 
tellectual activity is pursued for the sake of extended 
being. Indeed, I would suggest that Rousseau’s activ- 
ity is a more direct means of extending being than 
that of Emile—and not only more direct, but truer: 
I propose that the intellectual activity recounted in 
and exemplified by Rousseau’s autobiographical writ- 
ings, and especially the Reveries, is the truest manifes- 
tation of the desire for extended being of all those 
that appear in Rousseau’s work in that it seeks— 
and therefore gains—maximized existence where it is 
most amply available. It—or, rather, Rousseau himself, 
Rousseau the dreamer/thinker/artist—recognizes what 
would best satisfy the desire for extended being, just as 
the Platonic philosopher recognizes what alone would 
satisfy the deepest human longing. Rousseau’s intellec- 
tual activity thus demands our attention, if only because 
a desire cannot be truly understood without some grasp 
of the thing that would best satisfy it. 

In taking up the case of Rousseau himself I must 
begin by responding to an obvious objection. Like the 
savage, Rousseau leads an idle life (1992, 64 et passim; 
1995, 537). How, then, can he be seen as pursuing and 
gaining extended existence? Would it not be more ac- 
curate to see him as having shed the desire for extended 
being along with, or as a part of, his having (for the most 
part) shed amour-propre (Rousseau 1992, 117-18)? 

The force of this objection dissipates when we take 
note of the peculiar character of Rousseau’s idleness. 
Idleness, as Rousseau conceives of it, is not all of a 
piece. Some varieties are active and even involve much 
thinking: “The idleness I love is not that of a do-nothing 
who stays there with his arms crossed in total inactivity 
and thinks no more than he acts. It is both that of a 
child who is ceaselessly in motion while doing nothing 
and, at the same time, that of a dotard who strays when 
his arms are at rest” (Rousseau 1995, 537) The refer- 
ence to thinking is not insignificant. In the very next 
paragraph Rousseau recounts his forays into botany, 
which he describes as “precisely an idle study,” yet one 
that consists in “analysis” and eventuates in written 
documents of considerable sophistication (537; also see 
the “Seventh Walk” of the Reveries). “Idleness,” it 
would seem, means nothing more than that the activity 
is pursued for its own sake, or for the sake of immediate 
enjoyment. 

Yet, as even so empirically oriented a thinker 
as Aristotle (1962, 1097a—b) teaches, things that we 








21 For more on the character of Rousseau’s botanizing, see Cantor 
1985. Also see Davis 1999 for a provocative discussion of the philo- 
sophic significance of botany in Rousseau’s work. Rousseau’s botani- 
cal writings include Dictionnaire des termes d'usage en botanique and 
Lettres Elementaires sur la botanique à Madame de Lessert, each of 
these appears in Rousseau 1959-69b. 


choose for their own sake or for the sake of our im- 
mediate enjoyment are also chosen for the sake of 
that one thing that we seek only for its own sake: for 
him, eudaimonia; for Rousseau, the maximization of 
being. Indeed, Rousseau seems to teach that we want 
everything we want exclusively for the sake of this one, 
ultimate good. However that may be, it seems to me 
quite clear that Rousseau’s intellectual activity as re- 
counted in and exemplified by his autobiographical 
writings, however “idle,” is pursued precisely and even 
consciously for the sake of extended being, and that 
its success indicates that it is propelled by the most en- 
lightened version of the desire for extended being.” At 
their peaks, the Reveries and even the Confessions re- 
count episodes in which Rousseau overcomes “civilized 
self-consciousness” and thereby enlarges or extends his 
existence to extraordinary proportions.” (For the sake 
of economy I am passing over Rousseau’s botanizing 
and will focus instead on the reveries. Not that botany 
is unimportant—indeed, its accessibility to persons of 
ordinary intelligence makes it particularly important 
from a practical standpoint—but as a lesser peak than 
the reveries, botanizing reveals less about the desire 
for and possibility of extended being and is thus less 
theoretically significant than the reveries.) That his be- 
ing is enlarged or extended is not an interpretation I 
have imposed but Rousseau’s own explicit claim. It is 
presented as the immediate source of the pleasure and 
therefore (I would argue) the deliberate goal of the 
activities he describes. 

Rousseau depicts at least two types of extraordinary 
existence-enlarging reveries. Sometimes the enlarge- 
ment of existence occurs through the identification of 
the self with the whole of nature in its beautiful har- 
mony of diverse parts: “I never meditate, I never dream 
more deliciously than when I forget myself. I feel ec- 
stasies and inexpressible raptures in blending, so to 
speak, into the system of beings and in making myself 
one with the whole of nature” (Rousseau 1992, 95). 
Although the raptures of which Rousseau speaks are 
not rational experiences, his use of the word “system” 
suggests that they may somehow include or otherwise 
entail rational awareness. Yet if they include rational- 
ity, they also supersede it, thus rendering them supra- 
rather than irrational; and the result is something 





2 The reader will have observed that my initial references to the 
desire for knowledge and understanding have given way to the more 
vague term, “intellectual activity,” as the discussion has moved from 
the “ordinary” or interested pursut of scence (e.g., that of the young 
Emile) to the extraordinary practice of Rousseau himself Precisely 
what constitutes the latter is not easy to say, for Rousseau not only 
departs from ordinary science, but departs, or at least seems to depart, 
from the traditional modes of philosophy as well For now I would 
simply point out that what I have been calling “Rousseau’s intel- 
lectual actrvity as described and exemplified in the autobiographical 
writings” consists in at least four distinct activities—namely, botany, 
reverie, philosophic meditation on the reveries (and on other things 
beades), and the artistic rendering of the preceding three. The in- 
terrelations of the four are far too complex to be addressed in this 
paper, except for a brief comment below. 

“Crilized self-consciousness” 1s Christopher Kelly’s (1987, 243) 
term. 
24 “Supersession . . is at once a negating and a preserving” (Hegel 
1977, 68). 
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beyond discursive knowledge: “The more sensitive soul 
a contemplator has, the more he gives himself up to the 
ecstasies this harmony arouses in him. A sweet and 
deep reverie takes possession of his senses then, and 
through a delicious intoxication he loses himself in the 
immensity of this beautiful system with which he feels 
himself one. Then, all particular objects elude him; he 
sees and feels nothing except in the whole” (92). From 
the standpoint of ordinary or civilized consciousness, 
there is a paradoxical quality to these experiences. On 
the one hand, the self seems to expand, to incorporate 
something far greater than its original content. This 
is reflected in Rousseau’s continued use of personal 
pronouns to describe the experience: It is always “I” or 
“he” who feels himself one with nature. On the other 
hand, though, he refers in both passages to a forgetting 
or submergence of the self (“I forget myself,” “he loses 
himself”). 

The second kind of existence-enlarging experience 
described in the Reveries seems on its face to contain 
neither side of this paradox. Rather than expand its 
boundaries, the self seems to narrow: It identifies it- 
self not with nature but only with its own innermost 
core. And far from being forgotten or submerged in 
something larger, the self, even in its now narrow form, 
occupies the whole of consciousness: 


If there is a state in which the soul finds a solid enough base 
to rest itself on entirely and to gather its whole being into, 
without needing to recall the past or encroach upon the 
future; in which time is nothing for it; in which the present 
lasts forever without, however, making its duration no- 
ticed and without any trace of time’s passage; without any 
other sentiment of deprivation or of enjoyment, pleasure 
or pain, desire or fear, except that alone of our existence, 
and having this sentiment alone fill it completely; as long 
as this state lasts, he who finds himself in it can call himself 
happy, not with an imperfect, poor, and relative happiness 
such as one finds in the pleasures of life, but with a suff- 
cient, perfect and full happiness which leaves in the soul 
no emptiness it might feel a need to fill. Such is the state 
in which I often found myself during my solitary reveries 
on the fle de Saint-Pierre, either lying in my boat as I let it 
drift with the water or seated on the banks of the tossing 
lake; or elsewhere, at the edge of a beautiful river or of a 
brook murmuring over pebbles. 

What do we enjoy in such a situation? Nothing external 
to ourselves, nothing if not ourselves and our own existence. 
As long as this state lasts, we are sufficient unto ourselves, 
lıke God. (Rousseau 1992, 68-69; emphases added) 


Yet if this second experience does not partake of the 
paradox of the first experience, it nevertheless shares 
much with it: In each of the experiences civilized self- 
consciousness and the separate-self sense have given 
way to a kind of thoughtless but not unconscious feel- 
ing of existence.” In fact, the two experiences even 
share the greater paradox of which the other paradox, 





3 Far from unconsciousness or regression, these experiences are pre- 
sented as being above and beyond ordinary consciousness Although 
they partake of the savage’s inner balance and harmony, Rousseau 
all the while retams and employs a level of awareness, an aesthetic 
sensibility, and a scientific expertise that outstrip those belonging to 
the typical civilized person, let alone the savage. 
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the one found only in the first experience, is an effect. 
Namely, in each of the two experiences we find both 
an enhancement of the self (“we are sufficient unto 
ourselves, like God”) and a simultaneous letting-go of 
or disinvestment from all that belongs to the self in its 
separateness. In fact, though, this will seem a paradox 
only to those who believe that the road to enlarged 
existence is through the enhancement or aggrandize- 
ment of the separate self. To Rousseau, in contrast, it is 
no paradox at all, for he teaches through the Reveries 
(and other texts) that the transcendence of the sepa- 
rate self along with its possessions and encumbrances 
is a legitimate way, probably the best way, to enlarge 
one’s being.* (What I described as the narrowing of 
the self in the second experience is a narrowing only 
with respect to space and time. The self narrows until it 
includes nothing but the sentiment of its own existence, 
yet this sentiment (as experienced by the cultivated 
Rousseau), with its perfect and conscious satisfaction, 
seems to me to constitute a great enlargement of ex- 
istence.) Whereas amour-propre, at least in its typical 
form, inevitably fails to win the satisfaction it seeks (it 
is not easy to achieve preeminence, much less to “be 
everything”), the desire for extended being that arises 
from amour de soi can win greater existence for the self 
by liberating it from identification with the ego.”’ That, 
it seems to me, is the lesson of the reveries. 





25 By “transcendence of the separate self” I mean that the self ceases 
to be conscious or mindful of its separateness. Objectively speaking, 
of course, the existence the self enjoys 1s still its own. 

77 As the comparison to God may indicate, Rousseau thinks that 
this means of enlarging one’s being lies beyond the power of most 
people. It would seem to require a highly improbable combination of 
natural genius and a drfficult-to-choose solitude. Here, as elsewhere, 
though, some may doubt Rousseau’s pessimism while embracing his 
basic theoretical insight. An interesting and very important question 
1s whether amour-propre can in any way serve the cause of its own 
reform or overcoming: One can imagine people reading Rousseau 
(or someone else), coming to the conclusion that the greatest happi- 
ness entails a transcendence of amour-propre, and then seeking that 
happiness .. for reasons of amour-propre (“Why shouldn’t I have 
what he has?”). The question is whether such a project (an egoistically 
propelled attempt to overcome egoism) could ever succeed There 
are perhaps some grounds for thinking that it might. 

78 The lesson of the Revenes (the book, not just the activity recounted 
therein) is far more complex and multifaceted than this one part. 
As noted above, the book recounts and/or exemplifies at least four 
distinct activities. Aside from the reveries themselves (and botany), 
we also encounter Rousseau’s meditation on them and his artistic 
(te)presentation of both the reveries and the meditation. It is beyond 
the purview of the current paper to pursue this further, other than 
to say that it may well be that all four activities contribute to the 
enlargement of being, and even that the latter two—the reporting, as 
it were—are more mportant than the expenences reported. Why else 
would Rousseau “waste” his time in meditation and writing? This 
suggestion has important implications for the status of philosophy 
in Rousseau’s thought. We have already seen Rousseau teach that 
reflection diminishes existence by deflecting passion (1990, 112), and 
we may recall his famous near-suggestion (“I almost dare affirm”) 
“that the state of reflection 1s a state contrary to nature and that the 
man who meditates is a depraved animal” (1964, 110). These state- 
ments seem to make philosophy at best a necessary evil—necessary, 
that ıs, for those who are ginded by the philosopher (Rousseau 1979, 
219), but an evil for the one chosen by fate and a sense of responsi- 
bility to do the guiding. Yet the Revers, with its philosophic portrait 
and analysis of Rousseau’s happiness (and unhappiness), may well 
paint to the pleasure, the enlarged existence, of a certain kind of 
philosopher. 


American Political Science Review 


Vol. 98, No. 1 


American Political Science Review S m a a amama amaaa 


Have we then arrived at the decisive distinction be- 
tween good and bad versions of the desire to extend 
our being? Shall we simply conclude that, in Rousseau’s 
view, the desire for extended existence is good, because 
successful, when born of amour de soi, and bad, be- 
cause unsuccessful (and destructive of others), when 
born of amour-propre? Not quite—which is good news, 
given the overwhelming and inevitable predominance 
of amour-propre in the souls of civilized men and 
women. It does seem to be true that the most per- 
fect expansion of existence comes from a state of mind 
born of amour de soi and that the most ill-fated and 
tyrannical attempts at extended existence arise from 
amour-propre. More broadly, Rousseau undoubtedly 
sees a strong correlation between amour de soi and 
good desire for extended existence on the one hand and 
amour-propre and bad desire for extended existence on 
the other hand, a correlation that makes sense in light 
of the respective origins of these desires: Recall that the 
expansiveness of amour de soi arises from overflowing 
strength, while that of amour-propre arises from weak- 
ness and aversion. Recall too the passage from the Di- 
alogues quoted above (Rousseau 1990, 112), in which 
“all the loving and gentle passions” are said to arise 
from the “positive sensitivity” born of amour de soi, and 
“all the hateful and cruel passions” are attributed to the 
“negative sensitivity” produced by amour-propre. Nev- 
ertheless the correlation is not perfect. Amour de soi, 
though “a good and absolute feeling,” can lead people 
into conflict.2? Conversely, amour-propre, though rival- 
rous and irascible, is educable and can be made to serve 
both civic well-being and one’s own happiness (one’s 
own enlarged existence). All the hateful and cruel pas- 
sions come from amour-propre, but amour-propre does 
not produce only hateful and cruel passions: One need 
only recall my earlier list of phenomena that arise from 
the longing for extended being and then notice the in- 
volvement of amour-propre in some of the best among 
them, including virtue, compassion, romantic love, and 
patriotism.” 

Certainly the relative proportions of amour de soi 
and amour-propre in one’s soul will deeply influence 
the way one pursues extended existence: The person 
governed by amour de soi lives within him- or herself 
and will therefore presumably seek extension differ- 
ently from, more inwardly and/or directly than, the per- 
son governed by amour-propre, who, already “liv[ing] 
only in the opinion of others,” will likely seek extension 
through rule or recognition (Rousseau 1964, 179). To 
determine how Rousseau thinks one should or might 
best extend one’s existence, however—to determine 
the criteria of good and bad versions of the desire 
to extend our being—one needs to look beyond the 
crisp distinction between the two kinds of self-love to 
Rousseau’s many particular judgments of specific in- 





D At rts peak, however, 1¢., as it appears in the highly culuvated 
Rousseau, amour de so: seeks extended bemg through the kind of ın- 
ward or spiritual activity discussed above and so does not risk conflict 
with others. 

30 I have addressed the role of amour-propre in these and other hu- 
man goods in Cooper 1999, 122-30. 


stances of the desire. Such a survey yields the following 
results. 

We begin with two qualifications. First, with the pos- 
sible exception of the sort of reverie discussed above, no 
particular activity aimed at extended existence can be 
called good in an unqualified sense: Social or political 
considerations might well argue against this or that ac- 
tivity at this or that time. Even Rousseau’s kind of rever- 
ies might prove inconsistent with social well-being if too 
many people were drawn to them. Second, how one 
might best pursue extended existence surely depends 
on who one is—on one’s station in life as well as one’s 
abilities and inclinations. That having been said, it is 
possible to discern in Rousseau’s thought some general 
themes regarding good and bad, or better and worse, 
ways of pursuing extended existence. The most promi- 
nent of these, perhaps, involves the distinction between 
having and doing. By “having” I mean to refer (more 
or less) to the pursuit of what Aristotle (1962, 1098b, 
1099a-b) calls “external goods”; by “doing,” I mean 
activity undertaken for its own sake or for the sake of 
the experience itself. Being, Rousseau (1979) teaches, 
is attained or realized through doing: “To live is not to 
breathe, it is to act; it is to make use of all our organs, our 
senses, our faculties, of all the parts of ourselves which 
give us the sentiment of existence” (42). This dictum 
effectively suggests that, in general, the best way to 
pursue maximized existence is to develop and exercise 
our faculties to the maximum possible extent. Of course 
this advice is hardly without peril. The development of 
the various faculties, both in the species at large and 
in innumerable individuals, has more often than not 
resulted in a dimminishment of existence and happiness, 
for the growth of one’s faculties has typically stimulated 
even greater growth of one’s desires (Rousseau 1964, 
147; 1979, 80-81). Yet let us consider wherein this peril 
consists. The danger is that the pursuit of more capacity 
for doing will lead (for reasons that are complex but 
familiar to Rousseau’s readers) to an even greater pur- 
suit of having. Having—or, rather, excessive desire for 
having—is the problem. It would be simplistic but not 
altogether wrong to ascribe to Rousseau the Marxian 
view that having is the enemy of being. 

When is the desire to possess excessive? One could 
say that it becomes excessive simply when it outstrips 
our power to satisfy it, which is true enough. This, how- 
ever, occurs when we lose control over it—when it gains 
control over us (Rousseau 1979, 445).>! And that hap- 
pens, I would argue, precisely when and to the extent 
that these things become the primary vessels through 
which we seek extended existence. Wanting external 
goods—wanting wealth, or prestige, or rule, or any 
other possession—is not inherently problematic, either 
for ourselves or for others. Making these things one’s 
primary route to extended being, however—pursuing 
the soul’s deepest longing through having rather than 
through doing—is problematic, for it invests these 
desires with more weight than they can bear. The 
consequences of this ill-fated investment include the 





at “All passions are good when one remains their master, all are bad 
when one lets oneself be subjected to them.” 
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incitement of ever new and unfulfillable desire; ever 
greater vulnerability, anxiety, and dependence as one’s 
possessions expand (the rich fear losing their wealth, 
the powerful are dependent on those they rule [83-84]); 
and, in consequence of this vulnerability, anxiety, and 
need, an increasing inability to live in the here and now, 
which is a kind of vacating of the self, and an inability 
to act with a whole heart. Each of these consequences 
significantly diminishes existence. 

A comparison of (a) the doomed enterprise of seek- 
ing extension through possessions with (b) the success- 
ful attempt at extension recounted in the Reveries sug- 
gests an even more fundamental distinction than the 
one between having and doing: In Rousseau’s view, 
the attempt to extend our being is doomed when it 
consists in the attempt to aggrandize or empower the 
self as it already is, i.e., to dress up or equip the current 
self, the social self so famously depicted at the con- 
clusion of the Second Discourse, with more things. In 
contrast, the attempt to extend one’s being stands some 
chance of success when it involves certain kinds of in- 
ner transformation, whether the further cultivation of 
existing faculties or the development of new abilities, 
provided that such cultivation or development does not 
incite excessive new desires. It is in this proviso that 
the difficulty and the enormity of the challenge come 
to light. Typically, the cultivation of our faculties does 
incite excessive desire; indeed, it typically leads to all 
the unfortunate consequences outlined above (exces- 
sive desire; vulnerability, anxiety, and dependence; and 
an emptying of the self). It does so because the self, 
governed predominantly by an unruly amour-propre, 
sees itself as an embattled entity whose very substance 
consists in its external goods or possessions. 

I pointed out above that the pursuit of being through 
doing often fails because it degenerates into the pursuit 
of being through having. We see now that the source 
of this degeneration lies in the way one experiences 
and conceives the essence or ground of the self. The 
self whose desire for extension manifests in the futile 
pursuit of being through having is the self that believes 
itself or its being to be constituted by external goods; 
this is the fearful self, the self governed by disorderly 
amour-propre—in a word, the Hobbesian self; in an- 
other word, one that helps illuminate Rousseau’s polit- 
ical program, the bourgeois self. Thus a healthful desire 
for extended being would seem to require either that 
amour-propre be transcended (à la Rousseau himself) 
or that it be educated in such a way that one looks 
to a larger community as the source of one’s identity 
and ground of one’s being. The latter course, which 
would lead the individual to seek extended existence in 
a context of devotion to others, is available through par- 
ticipation in a well-constructed polity. In such a polity 
law and education would reform amour-propre by gen- 
eralizing or collectivizing it: Much of its self-seeking 
character would manifest itself on behalf of the com- 
munity, and the personal ambition of citizens would 
lead them to pursue honor through civic service. Such 
a polity, Rousseau believes, is difficult to the point of 
implausibility in the modern age. Nevertheless, some 
degree of reform does seem to be possible and worth 
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attempting, as witness his efforts in Poland and Corsica 
(Rousseau 1986a, 1986b). 

A third course is available as well, a kind of mid- 
dle course between the rare transcendence of amour- 
propre and its radical reform through politics. That 
course is the life of devoted domesticity of the sort for 
which Emile is educated and which, to varying degrees, 
we find elsewhere in Rousseau’s work (e.g., the idyllic 
life at Clarens in Julie and the rustic life of the free pea- 
santry referred to in various writings). This life would 
be characterized by a moderation of amour-propre, 
both quantitatively and qualitatively: Amour-propre 
would become less “large” and powerful a force in the 
soul, and would be generalized or extended to the fam- 
ily as a whole, so that relations within the family would 
be free of fractious self-seeking. In the best of circum- 
stances, wherein the individual feels him- or herself 
relatively well off, amour-propre would be “humane 
and gentle” (Rousseau 1979, 235). Like the political 
solution (though to a lesser extent), this course has the 
weight of modern sensibilities against it, in Rousseau’s 
view. It is still attainable for those so inclined, however, 
and it can be encouraged, or at least protected where 
it already exists, through political means 2 


BETWEEN EROS AND WILL TO POWER 


On the basis of the analysis I have presented, it is pos- 
sible to offer a few reflections on Rousseau’s thought 
from a broader, comparative perspective. Early in this 
paper I suggested that, in positing the existence of a 
single, comprehensive good and a multiply expressed 
desire for that good, Rousseau is similar to Plato. We 
have now encountered an additional affinity between 
the two philosophers, for in holding that there are good 
and bad and better and worse ways of trying to extend 
our being, Rousseau effectively attests to something 
like a natural hierarchy of ends and desires. In neither 
thinker’s case is the hierarchy easily known or simply 
expressed, yet each appeals to nature as the ground 
of judgment regarding the human good. Plato teaches 
nothing if not the difficulty of apprehending human 
nature, a difficulty that stems from the fact that, as part 
of the whole, the human being cannot be fully known 
without knowledge of the whole. Nevertheless, Plato 
points to nature as the appropriate standard for a sound 
philosophic anthropology. 


32 My sketch of Rousseau’s three “solutions” must for reasons of 
space remain just that. But one further point ıs too important 
to go unmentioned: Each of three specified routes to a new self- 
understanding entails, at its core, a new stance toward death, i.e., 
an overcoming of the desperate desire to resist the mevitable, this 
desire being the chief incitement if not the animating nerve of amour- 
propre. As Christopher Kelly has noted, Rousseau seems to present 
three different ways of dealing with the problem of death: an accep- 
tance of necessity, à la the savage; a forgetting of death, such as we 
see in Rousseau’s reveries; and an equanimous acceptance of death 
arising from so great a fulfillment of our being that we ask nothing 
more from life and are ready to die. Perhaps the list ıs even longer 
than this, for the citrren arguably “escapes” death by imagining that 
the nation will live forever. 

* Plato’s presumption of the existence and (in principle) the knowa- 
bility of nature is implied by the entirety of his corpus, which would 
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Rousseau’s appeal to nature is even more complex. 
How can I suggest that Rousseau appeals to nature at all 
when he endorses several different and even opposing 
versions of the good life, one of which in fact is charac- 
terized by its being denatured (1979, 39-40)}—and 
when, consequently, we can specify the features or re- 
quirements of the good life as Rousseau sees it (Le., the 
good life as such, including all versions) only in the most 
formal language? (Those features or requirements, we 
recall, are psychic unity and absence of inner conflict, 
which together constitute what I have called the “nega- 
tive” side of the project of the good life, along with the 
development and employment of the various faculties 
of body and soul, so long as these do not come at the 
‘cost of the first two features.) Substantively, the various 
versions of the good life entail the development and 
employment of different faculties in pursuit of differ- 
ent ends. These differences, however, are not the whole 
of the story, and they should not blind us to the com- 
mon elements. There is in fact a substantive common 
ground between these lives, consisting, for example, in 
wholesome sociability.* Moreover, and more impor- 
tant for the present argument, the formal requirements 
themselves have definite and unalterable substantive 
implications, Even if one must speak largely in formal 
terms when speaking generically of Rousseau’s multi- 
ple versions of the good life, the requirements of living 
well, of enlarging one’s being, whether for the citizen 
or for the person trying to lead a more natural life, are 
definite and substantive, so that a Rousseauan thera- 
pist, as it were, taking account of one’s aspirations and 
current condition, could in principle advise each of us in 
anonarbitrary way regarding the best way to live. True, 
the therapist’s advice would vary according to which 
type of good life one is attempting to approximate, and 
would in effect appeal to a few (though only a few) 
different hierarchies of ends; but these ends are sub- 
sidiary to and derive from universal and nonnegotiable 
requirements of human well-being. What should one 
call this if not a normatively prescriptive human nature, 
however indirect the prescribing? 

Nevertheless, Rousseau still seems to differ from 
Plato—from the “standard” reading of Plato, at any 
rate—in two decisive regards, one concerning (a) the 
character or content of the good and its “location” vis- 
a-vis the self, the other concerning (b) the consequent 
character or phenomenology of the desire for the good. 
With respect to both of these considerations, Rousseau 


on eee ee 
be rendered mcoherent without the presumed posnbility of appre- 
hending nature. For mstances in which nature is overtly appealed to 
in ways that speak both to its necessity and to 1ts opacity, or at least to 
the misuses to which the concept 1s put, see Republic (1968, 369c-72b, 
451c-66d) and Gorgias (1998, 499e-500a). 

H See Marks 1998 for a very fine elaboration of this point. 

35 Compare to Aristotle’s (1984, 118) likening of the legislative art 
to that of the sports trainer. 

% That nature’s prescriptions are indirect does not prove that nature 
is incoherent or only a fichon, but only that philosophy 1s necessary. 
I have elsewhere (Cooper 1999, 37-65) offered an extended argu- 
ment contending that each of Rousseau’s versions of the good hfe is 
grounded in nature and that his conception of nature is not merely 
formal. For an even stronger version of such an argument, see Marks 
1998. 


seems to occupy a position somewhere between Plato 
and Nietzsche, who too suggests that human activity 
is driven by a single, comprehensive and multiply ex- 
pressed desire or will (Nietzsche 1954, 226; 1966, sec. 
13, 36) but who depicts both the character of the desired 
object and the character of the will itself in terms that 
he understands to be radically opposed to Plato’s public 
teaching. 

(a) Rousseau’s “existence,” attainable as it is through 
numerous means and signifying a feature of experience 
rather than something independent of the self, is more 
subjective a thing than what Plato’s good is typically 
understood to mean. (Few readers claim with confi- 
dence, let alone with credibility, to have understood 
the nature of the good as treated by Plato; but most 
would agree that, as the source of both being and in- 
telligibility [Plato 1968, 504d-08b], Plato’s good has a 
power and relation to human life, and thus a claim to 
human regard, that Rousseau’s “existence” does not 
have.) By the same token, the fact that existence can 
be enlarged only through a finite range of lives or ways 
of being, such that some of the most popular means 
through which it is sought only serve to diminish it, 
renders it in some ways less subjective, or more be- 
holden to the stubborn demands of a prescriptive na- 
ture, than Nietzschean power is typically understood 
to be. 

(b) Precisely because it is a feature of experience 
rather than an objective (even if mysterious) thing that 
lies beyond ourselves, enlarged existence cannot be 
thought to pull or attract us. In this, too, Rousseau 
seems decisively different from Plato. Whereas Plato 
seems to suggest that the Good exerts an erotic attrac- 
tion, a pull, Rousseau seems to portray the desire for 
extended being and the self-love of which it is an ex- 
pression as an intrinsically directionless, nonteleologi- 
cal push, a kind of inchoate expansiveness or overflow- 
ing. Rousseau may well hold with Plato’s suggestion 
that the good is that for which we do all that we do, but 
the subjective way in which he conceives the good nec- 
essarily results in—or perhaps follows from, but in any 
case “goes with”—a very different understanding of 
the phenomenology of the desire for the good. Platonic 
eros admits of innumerable forms, but common to all of 
them is the experience of attraction to something beau- 
tiful outside the self. Indeed, the experience of intense 
eros is marked by self-forgetting. The lover loses him- 
or herself in the beloved. Rousseau, in contrast, while 
he does acknowledge that the desire for extended exis- 
tence manifests itself in various kinds of attraction and 
even employs the metaphor of magnetism (see the ex- 
tended quote from the Dialogues above), nevertheless 
does not portray the desire for extended being as essen- 
tially a self-forgetting attraction to a beautiful object. 
The magnet metaphor, with its absence of beauty and 
with its essentially “horizontal” character—there is no 
sense of uplift or transcendence—is instructive in this 
regard. Here too, as he distinguishes himself from Plato, 
Rousseau seems to land somewhere on the way to 
Nietzsche, whose account of will to power and of the 
origin of human action emphasizes the psychophysics 
of the actor, ie., his need to discharge energy, over 
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and against the particular attraction or meaning of any 
given object (Nietzsche 1974, 315-16).7 

Then again, if we put aside the passage from the 
Dialogues—a passage that, as an attempt to divide 
the whole of human motivation into a few categories, 
is necessarily highly abstract—and consider instead 
Rousseau’s accounts of the experience of healthy souls, 
we remember that Rousseau does allow both for the 
attraction of beauty and for self-forgetting. Emile’s love 
for Sophie comes immediately to mind: Explicable in 
terms of the desire for extended being, it is even more 
easily explicable in terms of eros. Rousseau’s asocial 
reveries, with their sweet self-forgetting, also come to 
mind, even if the self-forgetting was not the result of 
attraction to a beautiful beloved; and the self-forgetting 
may have been the result of absorption in the beauty 
of nature or the whole, in which case it seems not so 
very different from the purportedly erotic experience 
of philosophy as Plato (1968, 484a—90b; 2001, 210a— 
12a) presents it. This is not to say that the distance 
between Rousseau and Plato disappears. There is still 
the sticking point of the status or “location” of the good, 
i.e., whether it lies beyond the self or somehow within 
it. It does seem to suggest, though, that the distance is 
smaller than has so far seemed to be the case. 

The distance is perhaps even smaller yet: for if 
Rousseau emerges from close scrutiny as somewhat 
more of a Platonist than he had appeared, Plato argu- 
ably closes some of the distance from the other side. 
Here I cannot even sketch what would need to be an 
elaborate case. I will suggest, though, that a reading 
of the phenomenology of eros as depicted by Plato, 
i.e., an investigation of what, on Plato’s treatment, eros 
actually seeks, reveals something not so different from 
Rousseau’s desire for extended being. For that mat- 
ter, the same may well be true of Nietzsche and will to 
power if one considers that, despite ostensibly eschew- 
ing Nature or any other source of value or valuation 
beyond the human will, he somehow lands at the side 
of Plato—and Rousseau—both in his ranking of the 
philosophic life and, at least somewhat, in his explana- 
tion of the goodness of that best of lives. The nature of 
a desire is most clearly revealed in that which would 
most satisfy it. If what would most deeply satisfy eros 
(Plato), the desire to extend our being (Rousseau), and 
will to power (Nietzsche) is the activity of the philoso- 
pher, then, however differently one might understand 
that nonvity an important similarity would seem to be 
indicated. 





37 Nietzsche’s will to power is characterized precisely as an eruption 
rather than a drawing-out or -up. The eruption always takes some 
direction or other and can even be sublimated or spiritualized, but 
nature does not direct our inchnations toward any such end, let alone 
toward the Good 

3 I would consider these suggestions regarding the similarity among 
Plato, Rousseau, and Nietzsche too httle developed to merit mention 
here but for three facts. First, to develop them adequately would 
require as much analysis and exegesis of Plato and Nietzsche as this 
paper has provided regardmg Rousseau. Second, it is my intent to 
undertake just that: The present paper is to be a part of a series 
of studies addressing Platonic and post-Platonic conceptions of eros 
and its successors. Finally, even without further evidence, I do think 
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CONCLUSION: ROUSSEAUAN DESIRE 
AND ROUSSEAUAN RHETORIC 


Before ending this investigation let us take note of one 
other general distinction between Rousseau’s “good” 
and “bad” ways of pursuing extended existence—a dis- 
tinction so sharp that it might seem to challenge my 
premise that for Rousseau there is such a thing as a good 
(meaning healthful and prosocial) desire for extended 
existence. I have suggested throughout this paper that 
Rousseau imputes “the desire to extend our being” to 
all sufficiently strong or energetic human beings, not 
only to those who pursue property or power or repu- 
tation but also to those (such as himself) who have de- 
tached themselves from these passions. There can be 
no question about the former. In his heart of hearts, 
as we have seen, the pursuer of property or power or 
reputation—the “civilized man”—wants to be “the sole 
master of the universe”; he resents “everything that 
by being something prevents [him] from being every- 
thing” (Rousseau 1964, 195; 1990, 112). Yet we see no 
such avidity and agitation in Rousseau. He never, to my 
knowledge, voices a felt need to be more than he is; and 
neither, for that matter, does Emile. On the contrary, 
Rousseau demonstrates contentment and equanimity 
(at least during the three quarters of the time that he 
is not overtaken by amour-propre), and of himself and 
Emile he states, “No one knows better than we do how 
to keep in our place, and no one has less desire to leave 
it” (1979, 467). Might it not be the case that Rousseau’s 
extraordinary expansion of his being occurs without 
being preceded by—even because it is not preceded 
by—a desire for expansion? And might that be another 
reason for the overwhelmingly negative thrust of his 
thetoric on the subject of wanting to be more than we 
are? 

The answer to both questions, it seems to me, is no, 
albeit a qualified no. That Rousseau chooses to cul- 
tivate certain experiences, that he devotes himself to 
botany and reverie, is sufficient to establish that he de- 
sires the content of those experiences, and there can 
be no denying that that content consists, in the ways in 
which we have seen, in an expansion of being. That he 
undertakes this activity without avid desire or agitation 
does not prove that no desire is operative. What it sug- 
gests, simply, is that the desire for an experience that is 
gotten by doing something that is within one’s power 
has a different character, a calmer and more assured 
character, than the desire to acquire something exter- 
nal to the self. Nevertheless, the questions have a real 
basis, and they point to an important truth. For while it 
may be true that, like everyone else, Rousseau is moved 
exclusively by passion or desire, his success is as much 
a result of what he does not desire as of what he does 
desire. Declining to pursue extended being through the 
acquisition of external goods is perhaps the crucial step 
in the progress toward expanding one’s being, more 





that the reader who is reasonably well acquainted with these three 
philosophers will be able to appreciate the family relation that I have 
pointed out. Whether this is a happy family, however, 1s not so easy 
to see. 
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crucial even than the positive step of conceiving one’s 
identity in more healthful terms. This is so not because 
the positive step is less important but because it may 
well follow the negative step as a natural consequence. 
If self-love and human being really are by nature ex- 
pansive, then closing down the external route may well 
force the selfs expansive energies inward (using that 
term loosely, to refer to the cultivation and exercise of 
one’s faculties for their own sake), where they could 
succeed in enlarging one’s being. This, arguably, is the 
shared source of both Rousseau’s and Emile’s happi- 
ness. 

Thus the first question points to a truth: It would 
be incorrect to say that Rousseau’s extraordinary ex- 
pansion occurs because it is not preceded by a desire 
for expansion (it is preceded by a desire for expan- 
sion), but it is fair to say that the expansion occurs 
because it is not preceded by a certain kind, the most 
common kind, of desire for expansion. And (to take 
up the matter of the second question) that would be 
reason for the harsh rhetoric Rousseau aims against 
the desire to be more than we are. Rousseau’s teaching 
about the good is not always good to hear. Most of us, 
“always outside of [ourselves]” and believing our being 
to consist in external goods, would probably respond to 
a promise of extended being only by intensifying our 
ill-directed pursuits. More needful than the truth about 
being—and the proper preparation for the safe use of 
that truth—would be a program of moral reform that 
reshapes people’s self-understanding and aspirations. 
Such a program is precisely the legislative agenda of 
Rousseau’s political philosophy. 
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Realism, Narrative, and Happenstance: Thucydides’ Tale of Brasidas 


THOMAS HEILKE University of Kansas 


corealism and some versions of realism seek to furnish nomothetic theories of the international 
Nove at the same time that they also strive to prescribe policy for political leaders. Insofar 
as practical advice is insufficiently articulated by means of either nomothesis or the structural 
theoretical framework that (neo-)realist paradigms supply, these two aspirations seem contradictory. 


This essay is an examunation of what contemporary realism 


and, especially, neorealism require to make 


practical wisdom available for practitioners. It argues that narrative, which is exemplified in the so-called 
classical realism of Thucydides, remains a crucial component of practical realism and neorealism. 


ealism and, particularly, neorealism both hold 

to a pair of seemingly contradictory aspirations. 

On the one hand, modern realist and, especially, 
neorealist theorists seek to furnish theories of the inter- 
national system that establish law-like axioms concern- 
ing the nature of that system (Kegley 1995, 27; Waltz 
1979, 65-67, 116-17, 127, 186-87, 1995, 71ff.). On the 
other hand, both also strive to prescribe policy for polit- 
ical leaders. Accordingly, realism and neorealism must 
be understood not merely as academic “theories of in- 
ternational relations,” but also as attempts to acquire 
a kind of “practical wisdom” concerning interstate or 
international matters. What is meant by practical wis- 
dom is that realists and neorealists (including the most 
ardent neorealist defender of realism as a scientific the- 
ory, Kenneth Waltz) ultimately are interested in com- 
municating their ideas to practitioners of politics and 
prescribing to such practitioners the best course(s) of 
action.! The seeming contradiction between these two 
aspirations comes to view when we consider that practi- 
cal wisdom is insufficiently articulated either by means 
of positing law-like generalizations (nomothesis) or 
the structural theoretical framework that the realist 
paradigm supplies. Appearances notwithstanding, the 
search for a nomothetic theory of world politics hinders 
neither realists nor neorealists from attempting to of- 
fer practical advice. It is therefore valuable not only 
to consider the many and important theoretical and 
doctrinal differences between various realists and neo- 
realists, but also to consider the differences in the modes 
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1 Thus, according to Kegley (1995, 8), “a theory of international 
relations needs to perform four principal tasks. It should describe, 
explain, predict, and prescribe.” In a more strongly moral tone, Waltz 
(1954) proposed in his early work that “to explain how can be 
more readily achieved [which, he unplies, we all desire] requires an 
understanding of the causes of war” (2). For a critique of neoreahsm 
from the perspective of policy prescriptions, see Holsti 1995, 57-58. 
For purposes of this argument, realism can include any of the four 
types that Doyle (1997, 41-193) identifies. 


in which their theories and doctrines can be delivered. 
More specifically: What, if anything, must we add to 
nomothetic statements and structural theories to pro- 
vide practically useful (neo-)realist “policy” prescrip- 
tions, and why?” 


REALIST WISDOM AND MORAL WISDOM 


My argument is that the practical wisdom of either re- 
alism or neorealism must, by its nature, be commu- 
nicated not merely through axioms, but also through 
narrative, and that an example and justification of the 
latter kind of communication may be found in Thucy- 
dides’ realism as it is revealed in his History of the 
Peloponnesian War.? Nomothetic statements and theo- 
retical frameworks remain insufficient. Two objections 
against this claim immediately come to mind. First, 
practical wisdom seems a questionable term to use in 
the context of realist or, especially, neorealist theories 
for two reasons. In the first place, realism (and here I 
include neorealism) has been described on the moral 
side as “skepticism regarding the applicability of eth- 
ical norms to international policy” (Forde 1992, 373). 
If we understand wisdom in a Platonic sense as the 


2 A leading critical alternative to the realist and neorealist schools, 
constructivism, is “interested in the construction of identites and 
interests” in the international arena (Wendt 1994, 385). Because such 
identities are constructed in large part through narratives, we might 
expect narrative to appear more strongly in constructivist theory. The 
renewed emphasis on culture and formative ideas in constructivism 
would lead to the same expectation, since both are either formed in 
narrative or embedded in it. In both cases, however, this expectation 
remams unfulfilled at present (cf Wendt 1999, in which no mention 
of narrative appears), perhaps because constructivists like Wendt are 
interested exclusively in explanation, not practical advice, and per- 
haps because their theoretical commitments concerning constitutive 
and causal science lead them to overlook or even reject narrative as 
a core phenomenon for study and a core constitutent of explanation 
itself. For example, Wendt’s (2001) most “pragmatic” article leaves 
an unarticulated opening for narrative inquiry, but no more. Lebow’s 
(2001) ingenious treatment of Thucydides as a constructivist analyzes 
the narrative nomothetically with due sensitivity to the form of the 
narrative itself (549) but without engaging in narrative analysis. As 
constructivists move from analyms to prescription, one may expect 
problems of narrative to press on them more explicitly. A renewed 
appreciation for narrative as a politcal phenomenon worthy of at- 
tention 1s apparent m the political studies of protest movements and 
other domestic phenomena, but that work 1s tangential to my con- 
cerns here (see Brysk 1995). 

3 For purposes of this account, I take Thucydidean realism to be, 
broadly speaking, what Doyle (1997, 41-92) takes it to be, and I take 
Doyle’s description, broadly speaking, to be accurate. 
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possession of practical moral knowledge concerning 
the relationship between a concrete particular and a 
possibly knowable whole, (neo-)realist moral skepti- 
cism seems to exclude such wisdom as a possible human 
virtue. Yet realists such as Machiavelli, Thucydides, and 
Hobbes—none of them “Platonists”—explicitly sought 
to teach us a kind of practical wisdom or prudence con- 
cerning practical matters* Without assuming Platonic 
categories, even this kind of wisdom implies an inkling 
of a knowable whole within which particular words 
and deeds of a person or a city must take place. At 
a minimum, a wise person must understand the wider 
consequences of particular words and deeds. For such 
realists as Thucydides and Machiavelli, then, excellent 
men are above all men of practical (not understood 
“Platonically”) wisdom. They know what to do in accor- 
dance with the requirements of the specific situation. 
‘Traditionally, wisdom has included above all the ability 
to distinguish between right and wrong, including the 
right (or wise) and wrong (or foolish) use of power 
(Strauss 1964, 7). But the moral skepticism of realism 
and neorealism forces us to distinguish between prac- 
tical wisdom concerning traditional moral matters— 
whose possibility the (neo-)realist may deny—and 
practical wisdom in “nonmoral,” but political matters— 
whose possibility the (neo-)realist affirms and hopes to 
teach. 

Second, this separation from traditional moral wis- 
dom seems to make realist and, especially, neorealist 
wisdom merely a kind of technique (Forde 1992, 373; 
Wiggins 1980, 221-40). Realism and neorealism do not 
have an account of a final definitive moral human end 
toward which we deliberate and act. Instead, human 
ends are comprehended entirely within the field of the 
material goods and power necessary for corporate and 
individual survival and material flourishing. If gaining, 
keeping, and maintaining power are the best we can do 
concerning questions of the good or the bad, “wisdom” 
may seem an odd term to employ here, especially since 
the means of communicating moral wisdom would not 
seem to be commensurate with the means of commu- 
nicating realist “know-how” of what to do. And yet, 
(neo-)realist deliberation about ends and means re- 
garding power could be described in a manner similar 
to the way in which Aristotle described deliberation 
about moral matters. For example, practical wisdom, 
deliberation, and choice are among the elements of 
right action. Speaking anachronistically, Thucydides is 
agreed with Aristotle that along with these elements 
education and natural endowment secure right action. 
And though human well-being is considered by the 
(neo-)realist not in terms of eudaimonia or happiness, 
but in terms of obtaining material goods, material se- 


4 Compare Hobbes 1968, 82-84; Machiavelli 1980, 1-4 (“Epistle 
Dedicatory”); Machiavelli 1996, 5-6 (“Preface”); 123-25 (“Pref- 
ace”); and Thucydides 1982, 1.22.4. 

5 Sorabji (1980) and Wiggins (1980) provide a full examination of 
these elements of Aristotle's ethics I take education to be an impor- 
tant thrust of Thucydides’ History (1.22 4); we also find it important 
in the writings of classical realists such as Machiavelli. (See the dedi- 
cations and mtroductions of both The Prince [1980] and the Dis- 
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curity, and the reputation (honor) that comes with the 
ability to do so,° deliberation about material well-being 
is in this way parallel to Aristotelian deliberation about 
eudaimonia (Wiggins 1980, 227). 

The second major objection to my argument has to 
do with the necessary bases for practical deliberation. 
Is narrative, in fact, one of them? Neorealists, repre- 
sented most articulately by Kenneth Waltz, seem to 
argue that the behavior of states, which are the only 
significant actors in world politics, is driven by the sys- 
temic imperatives defined by the distribution of capa- 
bilities or the perception of such distribution among 
the states of the world. To understand this principle 
is to know how the international system works: Fur- 
ther narration is superfluous.’ The most widely known 
contemporary version of realism, articulated by Hans 
Morgenthau, may give a similar impression that realist 
practical wisdom or prudence also extends no further 
than to short-hand formulae by which we may calculate 
the logic of interstate (or interpolis or international) 
affairs and then measure our actions by the necessity 
of this logic. Morgenthau’s theoretical framework looks 
something like this: (1) The political behavior of states 
(and individuals) is governed by objective laws that are 
grounded in human nature; (2) this behavior is first and 
foremost determined by interest defined in terms of 
power, (3) in their manifestations, neither interest nor 
power remains either perennially stable or eternally 
identical; (4) although political action may have moral 
significance, morality and the necessities of political 
effectiveness exist in tension at best; (5) the “moral 
aspirations” of a particular political community are 
not identical to universal moral laws, if such exist; and 
(6) the political sphere of human activity—in which the 
principle of interest defined in terms of power rules the 
day—is autonomous (Morgenthau 1954, 1-11). Theo- 
ries based on various or all of these premises—systems 
theories of international relations, reductionist theories 
of interstate behavior, various rational actor models, 
and the like—presume to give us a measure of pre- 
dictability concerning the outcome of interactor polit- 
ical behavior. 

Despite this famous list of realist principles, 
Morgenthau remained skeptical about the sufficiency 
of an axiomatic list for practical guidance (1946, esp. 
204-23). In contrast to the neorealist presentation of 
Waltz, Politics among Nations contains many historical 
examples. Thus, Waltz’s (1979) Theory of International 
Politics wants to steer us in the direction of logical ax- 
ioms (65—67, 116-17, 127, 186-87) and ignore narrative 
as a source of information altogether (Garst 2000, 78), 
while Morgenthau lists such axioms,® but then provides 


6 Of the realist theorists, Hobbes (1968) most explicitly related honor 
to power. “To pray for another, for ayde of any kind, 1s to Honor, 
because a signe we have an opinion he has power to help; and the 
more drfficult the ayde 1s, the more is the Honor. To obey 1s to Honor; 
because no man obeyes them, whom they think have no power to 
help, or hurt them”(152) See also Thucydides 1982, I11.39 5. 

7 The refinements within neorealism, as, for example, in the typology 
of Van Evera (1999, 7-11), do not materially affect the argument here 
5 The six items in Morgenthau’s list no doubt would have a different 
meaning for him than for Waltz. 
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us with an argument sprinkled throughout with anec- 
dotes. Nevertheless, Morgenthau does not integrate 
these stories into a narrative whole, and their narra- 
tive function as a way of delivering practical (real- 
ist) wisdom is understated and undertheorized, if it is 
considered at all. The stories and illustrations even in 
Morgenthau’s realist analysis of politics among 
pations—to say nothing of Waltz’s much more sparse 
neorealist argument—do not do the kind of work that 
I argue here they must do in order for realist practi- 
cal wisdom to become fully apparent and useable (cf. 
Johnson 1993, 62, 70-71). 

Statements by Thucydides and the characters of his 
History in support of several of the realist theses enu- 
merated above are readily available (1982, I.76-1.77; 
V.89, 105.2, 107). Machiavelli similarly provides us 
with a clear political calculus in the form of a num- 
ber of maxims of political action that are consis- 
tent with the aforesaid realist principles (Parel 1972, 
7-12, 29-32). Likewise, Thomas Hobbes writes a trea- 
tise that will teach his readers how one might “gov- 
ern a whole Nation,” which includes a view to for- 
eign powers (1968 [1651], 83, 227-28, 230-31, 257, 272, 
375). All three writers, moreover, believed that their 
works illustrated transhistorical principles based on a 
knowledge of a human nature and a nature of po- 
litical things that was not historically contingent, but 
fairly constant, and that was not governed by uni- 
versal moral laws, but by the autonomous workings 
of the logic of political power. Human beings would 
behave in similar ways in similar circumstances of 
power relations (Hobbes 1968, 81-83, chap. 46, 727-28; 
Machiavelli 1996, 1.39; Thucydides 1982, 1.22 4; cf. II. 
82.2). 

Yet while all three do provide general principles of 
political action and human nature, none simply writes 
out a list of maxims for us to follow. Thucydides writes a 
History, which is primarily an account of human words 
and deeds. It is the ultimate story that, better than any 
other, illustrates “the human thing” (1-22). Machiavelli 
delivers many maxims and proverbs to guide action, but 
his works are largely made up of historical examples— 
. stories, To put his maxims and proverbs into action, 
we require examples that we can follow. For Machi- 
avelli, just as the science of medicine is “the experi- 
ence of ancient physicians,” so statecraft requires for 
its right practice examples (stories) of antiquity to im- 
itate. The use of stories does not deny the possibility 
of scientific rules of action, but it illustrates “the actual 
context of [their] application” (Machiavelli 1996, 5—6, 
123-25; Parel 1972, 10). Although Hobbes (1975a, 18) 
writes careful philosophical treatises, he also translates 
Thucydides’ History, summing up the practical use of 
the History with the observation that “the narration 
itself doth secretly instruct the reader, and more effec- 
tually than can possibly be done by precept”. Even for 
Hobbes, realism must be contextually displayed: “For 
the principal and proper work of history being to in- 
struct and enable men, but the knowledge of actions 
past, to bear themselves prudently in the present and 
providently towards the future: there is not extant any 
other (merely human) that doth more naturally and 
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fully perform it, than this of my author [Thucydides].”* 
Stories give us practical wisdom in a way that is central 
to the realists’ distinct concerns. Just as neorealists miss 
the importance of the speeches that Thucydides records 
for an understanding of Thucydidean realism (Garst 
1989, 3-27), so too do they miss the role that narrative 
plays in delivering the practical wisdom of his (and their 
own) realism. The kind of narrative that Thucydides 
provides, richly reflecting the reality of international 
politics and human agency, accident, chance, and con- 
tingency within those politics, must augment any ax- 
iomatic or even anecdotal account of such politics.” 
The scope of this argument suggests an extended 
monograph, but I restrict myself to a “case study.” To 
demonstrate the argument, I examine for its illumina- 
tion of practical wisdom the story of the Spartan gen- 
eral Brasidas, who, despite his ultimate demise and po- 
litical shortcomings, Thucydides represents as among 
the most excellent of the Spartans. This approach it- 
self seems open at the outset to at least two objections. 
First, an extensive and growing literature that explores 
the literary techniques Thucydides used to answer the 
question “What happened?” finds in his mode of pre- 
sentation a series of patterns, reversals, symmetries, 
deliberate asymmetries, and disjunctions that together 
reveal his thought. Participants in this literature do not 
necessarily agree with each other in either the details, 
the larger patterns, or even the general “lessons” in 
them that Thucydides intends to deliver to his readers. 
Nevertheless, all could lend support to an argument 
(although not all intend to do so) that the lessons of 
Thucydides are not found in details of what, when, and 
how so-and-so did such-and-such, but in the broad pat- 
terns of behavior, action, and response among contend- 
ing polities that together weave the fabric of the War. 
And yet, regardless how we may interpret these larger 
patterns, they are made up not merely of broad and 
vaguely defined “forces” that affect human life, but of 
the speeches and actions of individuals. Cogan (1981), 
in particular, has shown Thucydides’ intense interest in 
the role individual words and deeds play in initiating 
larger movements of people and communities and in 
determining their fates (cf. Garst 1989). Telling stories 
of individual lives and sketching broader historical or 
behavioral patterns are not mutually exclusive. 
Second, and more specifically, the Thucydidean 
case I have chosen—Brasidas—seems particularly 


9 Hobbes 1975b, 6; cf. 17, “In sum, 1f the truth of history did ever ap- 
pear by the manner of relating, it doth so ın this history: so coherent, 
perspicuous and persuasive is the whole narration, and every part 
thereof” 


10 “Counsel, woven into the fabric of life as it is lived, ts wisdom” 
(Benjamin 1972, 442). My argument here implies a critique even of 
Machiavelli on this score; his anecdotal method (in which the his- 
torical accuracy and practical advice of specific anecdotes are clearly 
open to question) can give nse to Inghly intelligent interpretations 
that need to consider narrative nearly not at all. See, for example, the 
muschievous interpretation by Mary Dietz (1986), on the one hand, 
and the numerological aspects of the interpretation by Leo Strauss 
(1958), on the other. 

11 Connor (1984), Cornford (1907), Edmunds (1975a), Hunter 
(1973), and Rawlings (1981) are among the important contributors 
to this tradition. 
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vulnerable to an objection that this “minibiography” 
is only a vehicle for pointing to wider concerns. In this 
vein, Robert Connor argues that Thucydides is not es- 
pecially interested in Brasidas and that he intends “not 
to affirm Brasidas’ virtues, but to lead the reader from 
a recognition of these qualities to an appreciation of 
their long-run importance.” This argument is compat- 
ible with my purpose. It is, after all, within a larger 
context that the actions of the individual are rendered 
intelligible and useful for practical lessons of conduct. 
Connor (1984, 130-31) pushes the argument with the 
claim that Thucydides’ comments about Brasidas are 
neither in praise nor in blame of him, but to emphasize 
Brasidas’ importance in the greater trends of the war. 
Here, I think, Connor overstates his case. We can isolate 
the story of Brasidas, as I have done, and find there a 
roughly complete picture of the man. Thus, to reject 
his interest in Brasidas, we must assume a prion that 
Thucydides is concerned only with larger patterns and 
that his focus is essentially on the cities, without any 
real interest in the character of Brasidas. The larger 
perspective is undeniably present, but my interest is in 
how Brasidas interacts with that context, and it seems 
to me that Thucydides also had an interest in that di- 
rection, so that critical interest in Brasidas is not off the 
mark as Connor suggests. 

Brasidas was the direct opponent of Thucydides dur- 
ing the engagements around Amphipolis and Eion 
(IV. 104-IV.107). The Athenian demos mistakenly at- 
tributed the loss of Amphipolis directly to Thucy- 
dides, for which it exiled him for 20 years, thereby— 
ironically—giving him the leisure to write his History. 
This “personal” connection to Brasidas makes it even 
less likely that Thucydides had no specific interest in 
him unless we again make Connor’s a priori assump- 
tion, but it also raises a new complication: Is Thucy- 
dides’ account of Brasidas self-exculpatory? There is 
scant evidence in the narrative to support such a charge: 
Thucydides was careful to lay out pertinent details of 
the campaign that his readers could check indepen- 
dently. Furthermore, while a case study of the career 
of Brasidas reveals important lessons of the History, 
Thucydides does not elevate the role of Brasidas in the 
overall narrative either negatively or positively beyond 
what the actions and qualities of Brasidas, separate and 
apart from this “personal” connection, would seem to 
allow. 

Finally, Thucydides’ thematic treatment of Brasi- 
das parallels his treatment of another leading charac- 
ter in the History: the Athenian leader, Nicias. Both 
figures clearly display traits of the city—Athens or 
Sparta—against which they are fighting. These oppos- 
ing sets of traits are a major theme of the History. 
Brasidas displays his “Athenian” traits with success, 
whereas the “Spartan” traits—hesitation, caution, and 
superstition—of Nicias magnify the scope of the 
Athenian disaster in Sicily.“ Similarly, Brasidas is set in 
deliberate, ironic contrast with Cleon. Further (ironic) 





12 But see also Palmer 1992, 6-8. 
D While Thucydides uses somewhat different language to describe 
the career of the Athenian general Demosthenes and does not pro- 
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parallels with Demosthenes and Pericles are also pos- 
sible to discern.'4 These broader thematic contrasts, 
which space considerations force us to leave aside here, 
are additional grounds for considering the career of 
Brasidas as a source for practical wisdom. 

This essay is a reconsideration of the mode in which 
realist and neorealist wisdom in general must be pre- 
sented, taking its point of departure from a specific case 
that illustrates the complex contextual interactions of 
luck and excellence. This cannot be an objectionable 
procedure if we remain aware that traditional realists 
do not all write in the same way and that their visions 
of the world are not all identical. Indeed, as Steven 
Forde (1992) argues in his magisterial comparison of 
Thucydides and Machiavelli, it is precisely in the lit- 
erary presentation itself that one may discern crucial 
differences between various kinds of realism (381, 382- 
25; cf. Palmer 1989), 


BRASIDAS: CHANCE [TYCHE] AND 
EXCELLENCE IN COMPLEX CONTEXT 


What does the story of Brasidas tell us about the world 
that a set of axioms or maxims would not? Brasidas 
seems an unlikely character to choose as a paradigm 
of excellence in practical wisdom. Not only may his 
role in Thucydides’ larger narrative be open to some 
question, but also he dies without realizing his goals, 
which are themselves somewhat questionable. The ex- 
tent and nature of Brasidas’ “failure,” however, are less 
than clear, and this ambiguity is itself an aspect of the 
narrative that is meant to teach us something. 
Thucydides’ story of Brasidas is part of a larger one— 
the story of the greatest motion [kinesis] or of the great- 
est war ever. It was the war between the Athenians and 
the Spartans and their allies, whose underlying cause 
[prophasis] was the Spartan fear of Athenian imperial 
expansion (1.23.6, 1.88, 118.2). When we first encounter 
Brasidas in Thucydides’ History, it is the first year of the 
war (432/1 B.C.E.). The Thebans, allies of the Spartans, 
have made their initial, unsuccessful attempt to subdue 
Plataea in the first overt act of war. Both sides have 
begun all-out preparations for war, and approximately 
80 days after the first Theban attempt on Plataea, the 
Spartans invade Attica (1.7.1, 0.19.1). By invading and 
ravaging the Attic plain, the Spartans hope to entice the 
Athenians into an open, general armed conflict, which 
both the Athenian leader, Pericles, and the Spartan 
generals estimate that the Spartans would surely win 





vide his own estimation of the man as he does for Brandas, one 
might also lay these two careers alongside one another to good 
comparative—perhaps even tronic—effect. Tracing the career of De- 
mosthenes through the Aetolian campaign, the actions at Pylos, and 
the expedition to Sicily 1s mstructrve, both by itself and in comparison 
to other careers, as 1s ing those careers among each other. 
The Syracusans butcher Demosthenes along with Nicias after the 
final Athenian surrender in Sicily, despite the efforts of the Spartan 
general Gyllipus specifically to save them both (VII 86). 

14 His overstatement regarding Thucydides’ disinterest in Brasidas 
nothwithstanding, Connor (1984) ts convincing on the structural roles 
that Brasidas plays ın the narrative (126-31, 140). Such roles do not 
seem to me to negate other levels of authonal interest. 
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(I1.13.2-3, 11.12.6-8).> In response to the Spartan in- 
vasion (II.23.1), a sizable Athenian marine expedition 
makes landings along the Peloponnesian coast and rav- 
ages the proximate territory, thereby avenging the in- 
vasion and demonstrating to the Spartans the potential 
costliness to them of the war (I1.25.1; Kagan 1974, 58). 
Enter Brasidas. 

The reader’s first encounter is unique in at least two 
respects. First, Brasidas emerges suddenly and by hap- 
penstance in the narrative. Having “cruised about the 
coasts and ravaged the country,” the Athenians landed 
at Methone, and “it so happened” [etyche] that Brasi- 
das, son of Tellis, a Lacedaemonian, was in command 
of a guard for this district. Etyche—“he/it chanced”— 
is the first word concerning Brasidas. As with the hap- 
penstance of birth, Brasidas does not choose his own 
appearance into the deeds [erga] of the narrative, but 
“happens to be” in the right place at the right time, and 
he has an immediate military success. Hearing of the 
Athenian attack, he hurriedly and with little apparent 
planning throws himself into the threatened city, secur- 
ing it against the distracted Athenians. While his fame 
or glory may begin with the lucky occurrence of a mili- 
tary encounter, Brasidas possesses the necessary qual- 
ities for taking hold of happenstance and succeeding. 
His “daring exploit” wins him the “thanks of Sparta,” 
which is an honor both generally and specifically: He 
is “the first officer who obtained this notice during the 
war.” 

Second, the reader’s first encounter with Brasidas is 
set off in sharp contrast with Thucydides’ description 
of the Spartan general Archidamus only a few para- 
graphs earlier (11.18, I[.20). The verbs and adverbs tell 
the story. Archidamus with his large force wastes time 
delaying, loitering, proceeding slowly, procrastinating. 
He plans carefully (1.20), but his slowness earns him 
the “gravest censure” of the Spartans (II.18).’° 

Thucydides’ introductions of his characters often 
provide clues to what he thinks of them. Nicias, for 
example, is a central character in the History whose hes- 
itation in Sicily leads to the annihilation of the Athenian 
army; he is introduced to us in seven separate actions 
of a wide variety before we first hear him speak in a 
very cautious speech (Strauss 1964, 202). Thucydides 
makes sure to let ug know that Nicias is a man not of 
hesitation, but of action and moderation. His hesita- 
tion and moderate speech appear not to be a display of 
weak character but are prudential and circumstantial, 
and we know this, because his call for moderation is 
preceded by seven actions that show that his modera- 


15 For a general but precise discussion of the vanous strategies and 
issues involved, see Kagan (1974, 17-69). 

16 We find a parallel story in the high admiralship of Alcidas m the 
fifth year of the war (II 28-1133) Edmunds (1975a, 80) points out 
that Thucydides attributes “intelligence” [to zuneton] to Archidamus 
and Brasidas, among others, and that only Archidamus 1s mentioned 
as having both this quality and moderation in combination. In fact, 
Thucydides does not attribute these to Archidamus but states that 
he was “reputed” [dokon einai] to have them (I 79.2). Archidamus’ 
deeds then put this reputation to the question. Thucydides’ attitude 
toward reputed events and characteristics is demonstrated in 120 and 
VL5460. 


tion is neither cowardice nor laziness. Similarly, we first 
meet Cleon, another central character, as he is mak- 
ing a speech advocating the death of the Mitylenians 
for their revolt against Athens. Thucydides introduces 
him as “the most violent man at Athens.” His violence, 
Thucydides implies, is a violence of speech and not 
deeds. In contrast to men like Nicias and Brasidas, 
Cleon would prefer to leave deeds to others (TV.27- 
IV.28).!7 In this vein, Brasidas is a man of action, and 
he is so introduced. Even his speeches are not delivered 
in debate, but are exhortations or declarations with the 
express purpose of accomplishing an immediate, pre- 
determined end. The action of Brasidas, moreover, is 
closely linked to the luck of his appearance; his success 
is bound up with the audacity and swiftness with which 
he takes advantage of tyche. 

Brasidas’ success at Methone was not decisive, since 
the Athenians continued their raiding (1.25.5, 1.30.1), 
but it was, as Gomme (1954, 130-32) notes, a foreshad- 
owing of things to come. Not for the last time, Brasidas 
had surprised the Athenians and upset their plans. The 
incident at Methone is the first of five that make up the 
first half of Thucydides’ “biography” of Brasidas; the 
second half is a protracted campaign in Thrace. After 
Methone, Brasidas disappears from the narrative until 
the third year of the war. He reappears in the story of 
a defeat, brought on by (bad) luck. In the third nar- 
rative, Spartan lack of resolve and fear of risk bungle 
an opportunity to take the Piraeus, Athens’ own port 
(11.93-I.94.1). In the fourth story, the disposition of 
the Spartan high admiral, Alcidas, which was similar 
to that of Archidamus—hesitant (1.27.1), slow to act 
(111.29.1), overly cautious (III.31), and politically inept 
(111.32), so that Thucydides’ illustrations of his blun- 
dering caution let one nearly think he was a coward 
(1.31.2, I1.33.1, I.69.1}—makes Brasidas’ attempts 
at prudent counseling ineffective. In the fifth episode, 
Brasidas’ minor but noteworthy part in the crushing 
Spartan defeat at Sphacteria (IV.15) lets Thucydides 
claim that of all those who took part in the battle, Brasi- 
das most distinguished himself ([V.11.3-4). In these 
first five incidents, then (which we cannot recount fully 
here), Brasidas succeeds once, and on four occasions 
he or those he is with are defeated. Until his death, 
however, Brasidas suffers serious defeat only when he 
remains part of a larger Spartan contingent and cannot 
act independently. In every case, his energetic actions 
and speech are contrasted with the more common Spar- 
tan demeanor of caution and slowness. Two of these 
incidents expressly exhibit the role happenstance plays 
in bringing success or failure to excellence. In one case 
luck is on the side of Brasidas; in the other it is not. 

These first five episodes constitute, as it were, the 
first “act” of the drama of Brasidas. Thucydides’ only 
explicit statement of evaluation about Brasidas’ quali- 
ties in this first act comes in the fifth episode, where he 
points out Brasidas’ distinction among the Spartans at 
Sphacteria. But by drawing the contrasts that he does 
in the stories, and by relating the incidents to us in 


17 On Thucydides’ possibly unfair treatment of Cleon and its basis, 
see Kagan 1981, 323-24. 
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the way that he does, Thucydides seeks to teach us 
about excellence in warcraft and statecraft. Brasidas’ 
virtues include an ability to speak, initiative and speed, 
alertness and audacity, courage, strategic skill, and in- 
telligence. The narrative shows us how these qualities 
operate in the actual (“realist”) world of power politics, 
contingency, and luck. Together, these qualities may not 
yet make a thoroughly wise man, but they may make 
an excellent one, or, at the very least, a “resplendent” 
one (Orwin 1995, 79). We find the remaining qualities 
needed to make a man of practical wisdom in the sec- 
ond half of Thucydides’ biography of Brasidas, where 
the excellence of Brasidas manifests itself in full in the 
campaign in Thrace, 

Thucydides indicates a new beginning, or a second 
act, with the reintroduction of Brasidas in the eighth 
year of the war. The bulk of Thucydides’ narrative from 
this point in Book IV until Book V.20, which brings us 
to the tenth year of the war and the so-called Peace of 
Nicias, is concerned with Brasidas. The sentence that 
begins this second story of Brasidas contains striking 
common elements with the introductory sentence of 
our very first encounter. The situation, moreover, is 
similar to the first. The Athenians have hatched a plot 
to take Megara, and they have succeeded in capturing 
Nisaea, its port. “Brasidas, son of Tellis, a Lacedaemo- 
nian, at this time happened to be [etyche] in the neigh- 
borhood” (IV.70.1). Thucydides fully reintroduces 
Brasidas, who again “happens to be” in the right place 
at the right time. He is on other business, but success in 
this affair can only bolster his prestige as he continues 
with preparations for Thrace. With the haste and clever 
strategy of the first incident at Methone, Brasidas pro- 
ceeds against the Athenians. The Megarians, embroiled 
in a factional conflict that is part of the Athenian plot, 
refuse him entry into the city, so he moves to another 
strategy: He fortifies his position and waits. Brasidas 
wins the waiting game; the Athenians, being outnum- 
bered with the arrival of Brasidas’ forces and no longer 
holding the advantage of surprise, (once again) do not 
desire a pitched battle with the Spartans and, so, with- 
draw (IV.72.4; Kagan 1981, 276-78). The campaign in 
Thrace can now proceed. The story illustrates the prac- 
tical wisdom of Brasidas: His audacity and penchant 
for quick action do not overcome his better judgment, 
so when the situation calls for patient waiting, he waits. 

According to Thucydides, Brasidas was sent to 
Thrace because he wanted to go. This desire is a fur- 
ther indication of his daring and initiative. The unlikely 
and dangerous campaign served at least two impor- 
tant purposes for the Spartans: It relieved them of the 
threat of a Helot revolt (IV.80.5), and by shifting the 
focus of military activities to the north, it might relieve 
somewhat the pressure Athenian raiding from Pylos 
and Cythera was placing on the Peloponnese (IV.80.1). 
It appears to have achieved both of these objectives 
and two more: The Spartan-induced revolt of these 
strategically critical cities from the Athenian empire 
gave them something with which to bargain when they 
eventually concluded an armistice with the Athenians, 
and in the second installment of the war (the so-called 
Decelean War), “it was the virtue and intelligence [aréte 
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kai xynesis] that Brasidas had displayed at this time— 
qualities of which some had experience, while others 
knew of them by report—that did most to inspire in 
the allies of the Athenians a sentiment favorable to the 
[Spartans]” (IV.81.2).18 

Brasidas demonstrated all of his usual qualities dur- 
ing the campaign, and his “comprehension” or “intelli- 
gence” [xynesis] and his “goodness” [agathon] proved 
him to be a wise man as well. His intelligence is dis- 
played in his ability to speak well. On his way to Thrace, 
he had to pass through Thessalian territory, where the 
democratic faction was hostile to Spartan intentions. 
Stopped by a party of the democrats, he was able with a 
clever speech to persuade them to let him pass through 
(quickly) without bloodshed. Having reached Thrace, 
he was able, again without the use of force, to persuade 
a number of the cities there to rebel against Athens. 
Brasidas’ rhetorical skill leads Thucydides to observe— 
it is unclear, Kagan says, whether with “delicious irony 
or condescension”—that he was “not a bad speaker 
for a Spartan.”!° Thus, his “seductive, though untrue 
statements” in some instances (IV.108.5), his skillful 
combination of threats and promises on other occa- 
sions (IV.86.1—2, 87.1-7), and his seeming integrity and 
measuredness” ([V.81.2, 3) led Thucydides to conclude 
that Brasidas was successful because he “in other things 
showed himself measured, and in his declarations ev- 
erywhere made plain that he had been sent out for the 
liberation of Hellas” (IV.108.2). Thucydides contrasts 
the tactful intelligence of Brasidas with the impatience 
and violence of Polydamus, the Spartan commander 
at Mende,”! whose angry behavior is instrumental in 
creating a factional conflict within the city that results 
in its fall to the Athenians, who have come to recover it 
to their empire (IV.130; Kagan 1981, 313). The outcome 
of Brasidas’ actions earlier in the campaign had been 
much different: “The towns subject to the Athenians, 
hearing of the capture of Amphipolis and of the terms 
accorded to it, and of the gentleness of Brasidas, felt 
most strongly encouraged to change their condition, 
and sent secret messages to him, begging him to come to 
them, each wishing to be the first to revolt.” Finally, the 
successes of Brasidas in Thrace are clearly contrasted 
against the concurrent mood of the Spartans, who had 
been entirely unnerved by the string of Athenian suc- 
cesses that included above all the occupation of Pylos 
(IV.55.1-4). 

The crowning achievement of Brasidas’ campaign 
was the capture of Amphipolis. Bit by bit his victories 





18 For further details of the strategic considerations in this campaign, 

see Kagan 1981, 288-94 

19 Kagan 1981, 293n110, and Thucydides 1919, IV84.2, cf. IV.85- 

TV.88. For an analyus of Brasdas’ apparent audacity in at least one 
see Orwin 1995, 84. 

Metria may be translated as “moderation,” which is preferred by 
Crawley (Thucydides 1982) and Smith (Thucydides 1919). Palmer 
(1992, 38, 130031), however, makes a convincing case for translating 
metrios as “measured,” especially in the Thucydidean context, while 
using “moderation” to translate sophrosyne (also “temperance”), 
and for recognizing the importance of the distinchons between them. 
I follow his lead here. 

21 Rawlings (1981) argues that he was not a Spartan (241-42). 
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distracted, then annoyed, then seriously threatened the 
Athenians.” Amphipolis was a key city for the Athe- 
nians: Its capture could interfere with the grain supply 
from the north. To capture it, Brasidas relied on in- 
ternal treachery and surprise, which alone would have 
succeeded, according to Thucydides, but Brasidas for 
once delayed, “vainly waiting for a demonstration on 
the part of his friends within” (IV.104.2). When his par- 
tisans did not fulfill their part of the plan, he “offered 
measured terms” to the Amphipolitans, who, surprised 
at the fairness of the terms “in comparison with what 
their fear had suggested,” gave him the city (IV.106.1). 
Unable to complete the task with the usual sort of mil- 
itary strategy, Brasidas turned to the persuasiveness of 
measured (but deceptive) speech. 

Having briefly recounted the story of Brasidas, we 
may now fairly ask, What do we learn from it that a 
straightforward list of axioms would not tell us? Thucy- 
dides delivers to us a kind of catalog of virtues (ex- 
cellences) in his narrative of Brasidas’ life. We learn 
from his story that Brasidas is conspicuous among the 
Spartans for his ability to speak (not always truthfully) 
(IV.84.2, IV.88.1, [V.108.5, IV.121.1), his valor (IV.2.4), 
his energy (IV.81.1), his apparent justice (IV.81.2), his 
measured conduct (IV.81.2, IV.108.2), his goodness in 
all things (IV.81.3), his gentleness (IV.108.3), his ini- 
tiative (0.25, V.10.5), his superior strategic abilities 
(IV.128.1), and his good fortune (IV.117.2). Thucydides 
seems to share Aristotle’s understanding two genera- 
tions later that the best sort of citizen is the one who best 
serves his city or regime.” If fear, honor, and interest 
are the principal motivations of political action, and if 
they are premised on the primary necessity of securing 
power that is not self-defeating or self-corrupting for 
the city or the state (which is therefore the chief object 
of political action), then the best Spartan is the one 
who best accomplishes the political ends of Sparta. In 
Thucydides’ estimation, it appears that Brasidas was 
such a man. His excellences ranked him among the 
most excellent Spartans. His speeches and his displays 
of measured conduct, justice, and goodness also showed 
him to be wise. Such displays, however, are not the 
same as actually possessing these qualities, a problem 
to which we will return. 


(REALIST) WISDOM AND NARRATIVE 


The nature and role of narrative—a summary term for 
all of the “things” that are “going on” at “the same 
time” in a story—have not been entirely neglected, 
but certainly underrated among political theorists and 
political scientists alike. Recent studies in a variety of 
academic fields have reestablished the importance of 
narratives to our understanding of political phenomena 
from the perspective of practitioners and scholars.” 


2 YV.108.1. For an analysis of Thucydides’ manner of telling this story, 
see Gomme 1954, 136-37 

B Anstotle 1984, 1276b16-35. Arıstotle 1s also careful to distmguish 
this sort of “best” from the morally best (1276b 16-35). Compare 
Thucydides (1919, VL54, V114) 

M Examples include the many instances in the summary article by 
Brysk (1995), Fre: 1974, and Ricoeur 1991. 
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The reasons for underestimating the role of narra- 
tive are neither obscure nor mendacious. Both political 
science and political philosophy are largely analytical 
activities. Analytical categories and the techniques of 
dialectical theorization, not stories, are the modes of 
investigation and explanation. Yet underlying both En- 
lightenment and positivist science, on the one hand, 
and the analytical dialogues and treatises of philosophy 
beginning with Plato, on the other,” is a series of base 
stories or metanarratives without which these activities 
and their written products would be unintelligible. 
Stories are not entirely superseded even in our social 
sciences. 

But why narrative? Is it fair to argue that theoreti- 
cal guidance is inadequate, no matter how theoretically 
complex or valid such guidance might be? Since inter- 
pretation itself is contingent—as Thucydides himself 
admits—and since the wisdom that comes from such 
contingent interpretation seems to be highly variable, 
does narration really provide any useful guidance? Nar- 
rative guidance, moreover, means not merely making 
sense of a “given” past, but making sense of an as yet 
unknown future, which is to say, of events that are not 
yet the subject of a narrative history. Can narrative ac- 
complish this task? 

It would seem that we do not require a story for 
the kind of catalog of the virtues I have listed. The 
list and brief explanations of its contents are all that 
is required to tell us what we should do if we wish to 
achieve success in preserving our polity in a complex 
and anarchic world in which politics are circumscribed 
by the necessities of fear, honor, and interest and af- 
fected by the vagaries of seemingly random events. Not 
so: The story itself serves to render intelligible the func- 
tion or meaning of Brasidas’ virtues—whether actual or 
apparent—in ways that merely listing them would not. 
With regard to our concern for practical guidance, to 
know what a virtue is and to know how it functions 
are to know what one looks like in practice. It is only 
in a story of how a virtue manifests itself in the midst 
of the contingencies and complex, multilayered human 
encounters that we can learn anything practical, not 
to say intelligible, about it or from it (MacIntyre 1984, 
esp. 204-20). 

While the nomothetic theories of realism and neo- 
realism may be a good start, they are insufficient for 
giving statesmen practical advice, and they are too thin 
an account of the world of political action to give schol- 
ars sufficient purchase on the context and possibilities 
of such action. Were it not for the fact that nearly all 
realists and neorealists seem to think that their theories 
should have practical effects, this insufficiency would 
perhaps be inconsequential for (academic) realism and 
neorealism. The need for stories on this practical level 
emerges most immediately out of the practical need to 
marshal personal and material resources to engage the 
world successfully. This need becomes more evident 


3 The narrative character of Plato’s dialogues does not vitate the 
claim that an analytical mvestigation remains at their dialogical core. 
26 For expositions of such metanarratives, see, for example, Milbank 
1990, Mitchell 1993, 1-18, 46-97; and Waterman 1991. 
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when we notice that two overwhelming accompani- 
ments of the principles of action and motivation in the 
world of human beings are the presence of indetermi- 
nacy or contingency, and complexity. To overcome the 
workings of contingency, happenstance, or luck amidst 
a complex necessity requires the practical wisdom that 
points to success, which is constituted by more than the 
predictive axioms that we find, say, in the theories of 
Waltz.” Narrative provides the materials that theoreti- 
cal analysis dissects. It precedes such analysis, and only 
in the context of narrative can such an analysis retain 
its coherence. Because we are contingent beings who 
are temporally constituted, we need narrative to make 
sense of our existence. Indeed, our temporal existence 
means that our lives are at root narratively formed, 
so that the narratives we render to make sense of our 
existence are but reflections of the deeper narrative— 
elusive as that may be—that is our existence. 

This centrality of narrative leads to three more spe- 
cific claims. First, Thucydides, who remains one of the 
foremost thinkers of the so-called realist tradition in po- 
litical thought, confronted these questions and treated 
them in a way that deserves a kind of attention among 
modern scholars of international relations it has not 
received. Second, narrative is the specific way in which 
the problem of human excellence as a means to success 
is illustrated and explored for purposes of practical ad- 
vice in Thucydides, and it cannot be otherwise. For if 
the wisdom of realism is, in fact, a kind of practical 
wisdom, then we may assume that the principles of 
transmitting such wisdom and of deliberating wisely 
hold true for realist wisdom as they do for traditional 
practical moral wisdom. And here again, even Aris- 
totle, although his discussion of ethics is presented in 
the form of demonstrative, analytical treatises and not 
stories, suggests that the virtuous or excellent life begins 
with a model of the good man. To live a virtuous life is 
to live as a good man would (Aristotle 1962, 1105b5— 
9, 1140a24). Such models are only partially made ac- 
cessible to us by means of lists of axioms. Ultimately, 
we require either direct, sustained observation of “role 
models” or apprenticeships under such men, or, failing 
such opportunity, we require as a substitute at least 
such stories about them as realists like Hobbes praise 
and Thucydides and Machiavelli tell, albeit in radically 
different ways and for significantly different practical 
and even moral purposes (cÊ Kahn 1994, 545-56, 553- 
54; Palmer 1989). 

Third, and most importantly, Thucydides implies that 
the possibilities of human excellence and the success it 
brings are substantially attenuated by the radical con- 
tingency that is woven into the fabric of human ex- 
istence, and he proposes to tell his readers stories as 
one way of communicating the characteristics of human 
excellence, which is so fragile and yet so necessary for 
successfully accomplishing one’s ends in the world.” 
The importance of considering the Thucydidean pre- 


77 We recall again Morgenthau’s (1946) skepticism regarding the suf- 
ficiency of axioms for practical guidance (204-23). 

78 For a catalog of Thucydides’ uses of “arête,” see Palmer 1989, 
368-71. 
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sentation of realist wisdom is that the kind of excel- 
lence we think is possible circumscribes the kinds of 
human actions—including political and military—we 
think are possible and desirable. Accordingly, the ques- 
tion of excellence and how to illustrate it remains a 
question for students of international relations and 
political philosophy alike, but also for policy makers 
and political leaders. As Waltz (1988) points out, his 
neorealist theory is an effort to “explain how the or- 
ganization of a realm acts as a constraining and dis- 
posing force on the interacting units within it” (618). 
Structure almost always leaves some room for indi- 
vidual initiative and chance, and stories pick up this 
key point by displaying the ways in which individuals 
maneuver within structural constraints and within the 
vagaries of contingency. Good statesmanship, more- 
over, does not necessarily lead to political or mili- 
tary success: Thucydides, along with later realists like 
Machiavelli, was keenly interested in understanding 
why. 

This argument is not some entirely new reading or 
interpretation of Thucydides or realism and neore- 
alism in general; it shows, rather, that by examining 
his storytelling, we discover a necessary dimension of 
Prescriptive realism and neorealism that has remained 
under-appreciated in the secondary literature. Uncoy- 
ering this dimension adds a needed caveat to a realist 
or neorealist theory of international politics that would 
too readily disparage the “role of the accidental and the 
occurrence of the unexpected” so as not to “dampen 
[our] theoretical aspirations” (Waltz 1995, 71). I am not 
suggesting that the accidental should overtake theory, 
nor should stories exclude other ways of communicat- 
ing what we are calling practical wisdom or prudence in 
the light of the accidental; rather, taking the accidental 
into account is a vital part of realist wisdom, and narra- 
tive is a crucial mode of communicating such wisdom, 
even while nomothetic theories may continue to hold 
first place as the “necessarily slender explanatory con- 
struct[s]” with which we begin and end (Waltz 1995, 
75). Narratives, anecdotal compilations, theories, and 
the conclusions of theoretical analysis are all, for better 
or worse, either forms of knowledge or the media for 
specific kinds of knowledge. As Nietzsche argued so 
eloquently, every form or medium of knowledge is a 
way of both seeing and not seeing (c£ Nietzsche, 1983, 
S9fL; 1968, 470 [sect. 24]; 1966, 9-32). Stories, in partic- 
ular, can exhibit not only explanatory constructs, but 
also sensibilities, through which authors can teach their 
readers. 

The need for stories as transmitters of wisdom rests 
on four general characteristics of narrative. First, sto- 
ries display particularities in such a way as to shed light 
on the complex generalities that constitute the con- 
text of the particulars. They permit a kind of inductive 
reasoning. From a story we may derive one or more 
general, probabilistic principles of human conduct, but 
we also retain the complex context of such principles 
for action. The narrated events of a story are “re- 
vealed as possessing a structure, an order of meaning” 
by the narrative itself and which they do not possess 
as “real events” apart from some narrativized content 
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(White 1987, 4, 5). Stories become a guide for prudent 
deliberation about what to do in similar circumstances 
(Cogan 1981, 232-38). Whereas we may glean axioms, 
maxims, or abstract principles from a story, itis the story 
itself that provides the context that permits judgments 
about what axiom to follow in what way in a specific 
instance. 

Second, stories display the essentially contingent 
character of human life, but they weave the contingent 
occurrences of our experiences into a comprehensible 
whole. Stories “disclose a world in which its readers 
are invited to dwell, or a character in relation to whom 
the readers are asked to see themselves.” Thus, a story 
becomes “a context for reflection and action” (C. Wood 
1987, 12). In this way, a story is not first and foremost 
a work of abstract reason specifically, but of what we 
might call such intelligence generally. Whereas the no- 
tion of reason is a particular philosophical construct 
with clear implications about the structure of the ra- 
tional mind and what it perceives or can perceive, the 
general concept of intelligence is broader. Intelligence 
on this count is “a working endowment rather than a 
theory and can be active in the absence of a philosoph- 
ical theory about the rationality of the universe and the 
structure of mind that enables it to grasp the rationality 
inherent in the world.” We might call intelligence “a 
quality of brightness that enables all normal human 
beings to some extent and some to an extraordinary 
extent to grasp relations and implications in complex 
situations” (Wyschogrod 1983, 5). 

In terms of astory, then, it follows “[f]rom this intel- 
ligible character of the plot” that “the ability to fol- 
low a story constitutes a very sophisticated from of 
understanding” (Ricoeur 1991, 4). Such a notion of 
intelligence or understanding, which is embodied in 
stories, makes it the foremost constituent of wisdom. 
It gives us a wider sense of human knowledge and 
wisdom than analytical reason alone, especially with 
regard to the role of narrative in providing the insights 
of wisdom. Both in its accessibility to induction and 
in its display of contingencies within a larger whole, 
narrative provides a context within which actions and 
events can be understood. It may also provide a con- 
text that clarifies for us why some phenomena remain 
quizzical or even incomprehensible. One such mys- 
tery concerns the phenomenon of radical contin- 
gency within a complex context and successful action 
against it. 

Third, then, narrative is prior to analysis and the 
axioms it produces. Analysis is the activity, as its et- 
ymology indicates, of “loosening up” the narrative, 
which includes discerning its parts, testing them for 
conceptual coherence and consistency in usage, and 
unearthing the logic of the story. Out of such analysis, 
which is a critical reflection on the story being told, may 
come prescriptive axioms of behavior. Both the analysis 
and its axiomatic results, however, are “nonnarrative 
mode[s] of discourse” that depend on a prior narrative 
for their comprehensibility (Lash 1982, 76). Accord- 
ingly, we may note that neither Morgenthau nor even 
Waltz can either derive or present their theories with- 
out reference to examples and illustrations, e.g., stories 


(Morgenthau 1954; Waltz 1988, 620-28). Narrative is 
in this case the instrument for concretizing theory. 

Fourth, if stories are not theoretically (or analyti- 
cally) precise, and yet remain intelligent and intelligi- 
ble, then they illuminate in a particular way. One quality 
of wisdom is that it “cannot be said ... but only done 
or practiced.” This implies that while axioms may help 
us, we can ultimately make our wisdom visible to others 
only indirectly or reflectively (Strauss 1989, 91). Stories 
are such reflections of wisdom by indirection, because 
they form, as it were, a “Gestalt” of the world. Thucy- 
dides used illuminating stories, because the nature of 
that on which he sought to shed light is more accessible 
to the intelligence of narrative than the strictures of 
rational, categorical theorizing. In stories we hear the 
words and see the deeds that manifest wisdom or its 
absence. Wise proverbs are guiding maxims, the past 
practice and future reference of which are manifested 
in the words and deeds of our experiences or of a nar- 
rative. The meaning and substance of prudence and 
choice are made known to us in a narrative that in- 
tegrates the words and deeds that constitute prudence 
into a context that serves to make sense of the world for 
us: “Ultimately, history cannot make a complete break 
with narrative, because it cannot break with action, 
which itself implies agents, aims, circumstances, inter- 
actions, and results both intended and unintended. But 
the plot is the basic narrative unity that organizes these 
heterogeneous ingredients into an intelligible totality” 
(Ricoeur 1991, 5; cf. Hauerwas 1983, 38-44). 

In the context of their plot, stories contain a subject, 
which is sometimes a group but most often an indi- 
vidual. For our purposes, the storied activities of such 
groups or individuals satisfy the need of the decision- 
maker to be given illustrations of principles. Since sci- 
entific realism does not explicitly examine the role of 
group behavior or individual action, it seems to offer 
little for the individual decision-maker beyond a justifi- 
cation for resignation. Scientific realism and neorealism 
articulate the structural constraints of a system; they 
may leave room for action and for chance, but that is 
not their concern.” Morgenthau and Waltz, for exam- 
ple, tell leaders what is beyond their control—which is 
important. But what such leaders also want to know 
is what to do with that which is within their control. 
To that end, they must observe the prudent and excel- 
lent actor, the wise man, to become wise themselves. 
Machiavelli, Thucydides, and Hobbes, among others, 
all claimed to have observed, to have become wise, and 
to have passed on to us (in very different ways) the 
words and deeds they saw or about which they read, so 
that we, too, might become wise.*! 

In summary, then, a narrative is a particular way of 
answering the question, “What happened?” It is a way 





2 A more explicity “narrational” explication of realism may be 
found ın Carr 1964, esp. 22-94. 

D “To realize the possibility [of systems theory] requires concerving 
of an international system’s structure and showing how it works its 
effects” (Waltz 1979, 69). 

31 Hobbes 1968, 82-83; Machiavelli 1996, 5-6, 123-25; Th 

1919, 121.1, 122.34. In this way, as Euben (1977, 50-56, 1990, 198- 
99) argues, Thucydides 1s a political educator 
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of unifying speech and action into a comprehensible 
whole after they have occurred. At the same time, it 
may display disjunctions and disunities between the 
two (Euben 1977, 37-38). It is prior to theoretical anal- 
ysis insofar as it provides the material for such an anal- 
ysis. But here, it is also after theoretical analysis that 
Thucydides is trying to communicate in stories a notion 
of excellence. Told in stories, such a contextualized pic- 
ture of practical excellence communicates to leaders 
the wisdom that is their concern as practical actors. 


THE “MESSAGE(S)” OF THE STORY 


Practical guidance requires a context in which to make 
sense of action. That context is usually a complex one. 
In Thucydides’ account, its properties include irony, 
which is made evident in his praise of Brasidas and 
which is best expressed in a narrative. The import of 
this irony is to contextualize Brasidas’ virtues in such a 
way that we learn to be measured and moderate in our 
praise of them. We also learn, therefore, to be measured 
in our expectations of the “technical expertise” that the 
substitute apprenticeship of narrative or any set of ax- 
ioms derived therefrom can provide.* It is a common- 
place to observe that as Thucydides unfolds his story, 
he establishes a general contrast between the Spartan 
and the Athenian ways. He illustrates the differences 
in manner between the two chief antagonists of the 
war by means of their deeds, through the words of his 
characters, and sometimes in his own evaluations. The 
Spartans are slow and unwilling to dare, whereas the 
Athenians are quick and enterprising (VIIL96, I.70); 
the Spartans are moderate (sophrosyne), the Athenians 
immoderate (1.68, I.70, 1.83); in foreign affairs, the Spar- 
tans are accused of limited knowledge, the Athenians 
credited with extended knowledge (1.68, 1.71); the Spar- 
tans are characterized as conservative and hesitant, the 
Athenians as innovative, hasty, and tenacious (1.70, 
1.71, 1.102). These contrasting characteristics may be 
related to the differing bases of power of the two foes: 
Sparta is a continental power, exhibiting the calmer 
qualities of land, and Athens is a maritime power, ex- 
hibiting the restless qualities of the sea (1.83, VII.96). 
In brief, Archidamus in certain respects and Alcidas in 
general both display examples of the worst characteris- 
tics of the Spartans, whereas Brasidas frequently seems 
more an Athenian than a Spartan.” 

Against this background we observe several ironies 
in the story of Brasidas. The first emerges as we begin 
to notice with Edmunds that while Thucydides seems 
to praise Brasidas and is himself an Athenian, he also 
seems to affirm the Spartan virtues of provident de- 
lay, moderation, and planning, or cautious deliberation, 
while he associates the Athenian characteristics of au- 
dacity, impulsiveness, and quick action with the vices 
and injustices of the internecine strife of the Greek 





%2 Here agai, a comparison with the setbacks and successes in the 
career of Demosthenes may be instructive. 

© Strauss (1964) calls Brasidas the Athenian among the Spartans 
(213). 
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cities during the war. In the stasis of Corcyra, for ex- 
ample, virtues (usually associated with Sparta) become 
vices, and their opposites (often Athenian character- 
istics) virtues. This stasis, Edmunds argues, contains 
an inversion of what was for Thucydides the natural 
or normal order of political things (Edmunds 1975a, 
75, 76, 81, 89-91; Thucydides, 1982, IN.81.4-5, [.82). 
Accordingly, Thucydides would seem to favor aristo- 
cratic Spartan over democratic Athenian qualities. As 
the story of Brasidas unfolds within the context of the 
story of Athens and Sparta, however, we are led down 
a path only to find a jester at its end.* Spartan virtues 
seem superior to the Athenian, but the most success- 
ful Spartan in the first part of the war behaves like an 
Athenian. Ultimately, the very nature of irony implies 
that this problem of the Athenian/Spartan dichotomy 
in Brasidas’ career cannot be entirely resolved. What 
are we to make of this as a comment on excellence and, 
especially, practical wisdom? 

Let us recall that the great motion (kinesis) of which 
Thucydides provides a history is itself a kind of dis- 
order. The stasis of Corcyra and eventually of other 
Greek cities is possible only as a result and in the con- 
text of such a kinesis (I1I.82.1). In such a situation, the 
virtues of the Spartans are insufficient. In the Archi- 
damian war, at least, the characteristics of the Athe- 
nians seemed largely to give them the upper hand, as, 
for example, in the battle at Sphacteria. The Athenian 
qualities, however, are not stable. The Athenians (“with 
whom an enterprise unattempted was always looked 
upon as a success sacrificed” [IV.55.2; cf. 1.70.7]) want 
too much after their victory at Pylos/Sphacteria and, 
so, eventually lose much of what they have won there 
(IV.41.4, IV.21.2). They do not know when to stop, con- 
sider carefully, consolidate their gains, and perhaps rest, 
as the Spartans urge them to do at the peace conference 
in 425 B.C.E.” At the same time, the portrait of Athe- 
nian power and adventuring spirit, which is in large part 
responsible for initiating and sustaining the events that 
allow Thucydides to write the work that reveals “for all 
time” its abiding truths concerning human things, shows 
us that Spartan qualities alone are also insufficient. The 
“Athenian” virtues of Brasidas win wars, but only if 
they are tempered by “Spartan” virtues, and Spartan 





% Connor (1984) has illuminated the role of surprise and the unex- 
pected in Thucydides’ History generally (16-17, 53f£). 

Cogan’s (1981, 73-76) argument that the Spartan presentation to 
the Athenians is rhetoncally melegant us persuasive; however, it 18 
the Athenians, as he points out, not the Spartans, who make it so. 
One element of rhetorical effectiveness is the audience, which in this 
case 1s impossible to please. The Spartans may not have been good 
speakers, but it is difficult to imagine what could have persuaded the 
Athenians for peace at this point. This difficulty rests on what have by 
this time clearly become Athenian vices—ambition and greed. These 
vices, combined with general Athenian enthusiasm, have made the 
Athenians armless, In Cogan’s words, “They lacked a general and 
agreed conception of their aims that could tell them when they had 
won,” whereas Pericles’ strategy would have indicated victory at this 
pomt (76). In the Sicilian expedition, predicated upon Alcibiades’ as- 
sertion that endless activity and expansion are not vices, but necessary 
for the city’s survival (Thucydides, 1919, VI.18.6-7), these Atheman 
qualities end in a digaster that 1s not the end of the war only because 
Spartan hesitation does not take advantage of it (VIII 2.1, VIL24 5, 
VII.96.4-5). Compare Connor’s (1984) summary analyus (73ff.). 
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qualities alone do not win wars either. The conservative 
and self-serving behaviors of many Spartan leaders, for 
example, extended the war in its final stages (Kagan 
1987, 70, 330-31), and the Spartans were not above 
cruel severity (IV.80). When the Spartans did eventu- 
ally win the war, however, they did not deal with the 
Athenians (despite the urging of their allies) as these 
did with the Melians. In the latter stages of the war, 
the extreme impatience of the Athenian demos, on the 
one hand, and Spartan moderation combined with a 
newly developed but desultory adventuresomeness, on 
the other (as when, under Lycurgus, they built a large 
navy to match that of the Athenians [Kagan 1987, 325- 
53, 376-412]),** gave the Spartans the ultimate victory 
even while they permitted Athenian survival. 

Thucydides’ realist wisdom is correspondingly bifur- 
cated. In the greatest motion [kinesis] of all time, human 
nature leads to political corruption (III.82—II.83), yet 
some of the qualities needed to keep power for pre- 
serving one’s own city are precisely the ones that bring 
about the kinesis, which is in conflict with the aims of 
preservation. The tendency to kinesis is brought on by 
the desire for preservation (1.75.1-1.76.3, 1.88.1). If 
they are not checked, the qualities needed for preser- 
vation of the city will lead to a kinesis that may well 
destroy it. In the same way, then, those excellences of 
Brasidas that helped his city and permitted personal 
satisfaction for him may ultimately have contradicted 
the interests of his city. We see this contradiction in the 
conclusion to Brasidas’ story. 

In the spring of 423 B.C.E., the Athenians and Spar- 
tans agreed to a one-year truce. Both sides hoped to 
negotiate a more general peace. During the truce and 
contrary to the agreement, Brasidas continued military 
activity in Thrace. His actions understandably aroused 
Athenian ire (IV.122.4—5, IV.123.3) and threatened the 
prospects of a more lasting peace. Thucydides does not 
tell us if Brasidas was pursuing a military or political 
policy in his continuation of the operations in Thrace 
that were excluded by the truce. We are told only that 
Brasidas was the principal Spartan opponent of peace, 
because of the “success and honor”[eutychein te kai 
timasthai] the war brought him. Brasidas’ opposition 
to peace pairs him with Cleon, the greatest Athenian 
opponent to peace, for whom the war kept his crimes 
and slander from catching up with him. Apart from the 
possible damage to his own city’s interests, Brasidas 
was running considerable personal risk: If he failed, 
he could be exiled or even capitally punished (Kagan, 
1981, 309). 

At the same time, it was Brasidas’ successes that 
helped end the first war. The towns Brasidas captured 
either by persuasion or by treachery from within gave 
the Spartans something with which to bargain when 
negotiating the peace (IV.81.2, IV.117, V.14.1-2). In 
the view of the Spartans, however, the “good fortune” 





% On the “democratic” importance of the Athenian navy, which ran 
strongly contrary to Spartan inclinations, see Garst 2000, 71-72, and 
Thucydides VIIL.72-VII1.79 

37 This problem ıs analyzed more generally by Robert Jervis (1978) 
as the “security dilemma” in international affairs (167-214). 


[eutyche] of Brasidas would not last forever (IV.117). 
The success of Brasidas, moreover, was not entirely 
equivalent to the success of Sparta: The Spartans hoped 
to bargain their way out of what remained for them an 
adverse situation. Not until the battle of Mantinea in 
the third year of the Peace of Nicias did they regain 
their self-confidence and their reputation among the 
Greeks (V.75.3; Kagan 1981, 133f£; Strauss 1964, 222). 
Brasidas, on the other hand, was considerably more am- 
bitious. His enterprise in northern Greece, if success- 
ful, would establish a virtual empire there, for whose 
existence he would be almost singly responsible while 
commanding an army composed of Helots and other 
Greeks but few, if any, Spartans. In his final compaign, 
and the last military action before the peace, Brasidas 
surrendered to the darkness of chance (I.78.1-2, V.10). 
With his usual skill, swiftness, strategy, and good speak- 
ing, he successfully defended Amphipolis against an 
Athenian attempt to retake that vital city, but he fell in 
battle. Ironically, his death cleared the way for peace, 
but his ambition and successes made the negotiations 
and Spartan calls for moderation possible. Success— 
the achievement of one’s purpose—is not always what 
or as it appears to be. 

A second irony in Brasidas’ story, which foreshadows 
later episodes in the war and to which I have already 
alluded, concerns the question of his honesty. Brasidas 
was not always truthful (IV.85.7, IV.88.1, TV.108.5, 
1V.122.3, IV.122.6, IV.123.1, V.16.1).% The untruth- 
fulness of Brasidas’ speech is deeper, however, than 
mere misrepresentation or seductive oratory, so that 
his reputation for honesty, measured conduct, jus- 
tice, and goodness results in an irony that Machiavelli 
would have especially appreciated. Before Amphipolis, 
Brasidas had presented himself to the Acanthians as a 
liberator of Hellas, promising them autonomy, not a 
new subjection to Sparta (IV.85.1, IV.86.1). The Spar- 
tans intended, he said, to free Hellas, not to enslave it. 
The Athenians, moreover, would not attempt to retake 
the cities for their empire. If necessary, he would force 
the Acanthians to be free from the Athenian yoke. His 
character was such that force was not required: The 
cities gladly revolted. The measured conduct and ap- 
parent honesty with which Brasidas freed the Hellenic 
cities from Athenian hegemony were “what mainly 
created in the allies of Athens a friendly feeling for 
the Lacedaemonians.” Indeed, his reputation was so 
strong that the Hellenes held a “confident belief” that 
the other Lacedaemonians were like him (IV.81.2-3). 
This confidence would be tragically disappointed. In 
the “latter part of the war” (IV.81.2), Lysander would 
establish a harsh hegemony over the former Athenian 
allies: Thucydides foreshadows this Spartan rule at 
IV.132.3. The Spartans, contrary to Brasidas’ promises, 
send Spartan military governors to Amphipolis and 
Torone, and the Athenians do reply to the revolts in 





38 Hunter (1973) shows that the first untruthful speech of Brasidas is 
before his own men (IL87) as he tries to encourage them into battle 
against the previously victorious Athennian commander, Phormio. 


131 


Realism, Narrative, and Happenstance 


force.” As a careful reading of the latter part of Book 
IV makes clear, Brasidas himself is at least partially re- 
sponsible for the falsification of the claims in his 

at Amphipolis (Conner 1984, 130-38). In Machiavellian 
terms, the representative of the Spartans (Brasidas) 
made the Spartans seem good by his words and deeds 
when neither, especially the latter, was (cf. Machiavelli 
1980, chap. XVII). “Later on in the [Decelean] war” 
(IV.81.2), the reputation Brasidas gave to the Spartans 
would serve them well. 

Thus, Thucydides’ seeming preference for the “Spar- 
tan manner” is ambiguous. Sparta’s success rests in part 
on duplicity. Blithe estimations of Athenian reaction 
prove false, while the Spartans, too, will seek hege- 
mony over Greece, just as the Athenians had warned 
that they would even before war broke out. Thucy- 
dides praises the “excellence and comprehension” of 
Brasidas (IV.8.2) that secured these cities as bargaining 
chips, a purpose that Brasidas had originally denied, 
most likely because he did not know ultimate Spar- 
tan intentions. The “just and measured” Brasidas at 
the beginning of the campaign (IV.81.2) merely “shows 
himself’ measured in Thucydides’ second evaluation, 
and the cities “hear” of his “gentleness” and his as- 
surances, which helps incite them to revolt. Calling 
Brasidas’ statements “enticing but untrue” (IV.108.5), 
Thucydides can drop the attribution of justice, which 
the reader has discovered is a deceptive appearance 
whose seductive power will end either with Athenian 
revenge on some of the “liberated” cities or with Spar- 
tan tyranny. 

Third, the complications, ambiguities, and ironies of 
Thucydides’ account of Brasidas’ campaign in Thrace 
stand in sharp contrast to the simpler account of the 
distribution of capabilities that the structural theoret- 
ical framework of a neorealist paradigm might supply. 
As both Garst and Johnson argue in different ways, ne- 
orealism, in particular, pays insufficient attention to the 
human intentions, interpretations, and efforts of rhetor- 
ical manipulation and counter-manipulation that make 
up the substance of Thucydides’ story of this campaign 
and, indeed, of the war (Garst 2000, 75-84; Johnson 
1993, 208-29). Why do the cities of Thrace align as 
they do? Perhaps, following one part of neorealist rea- 
soning, they rallied to the side of Sparta to balance 
the growing power of Athens after her victory at Pylos 
(c£ Waltz 1979, 126; 1987, 148-61). On the other hand, 
some neorealists might predict (or even prescribe) that 
the Thracian states would “bandwagon” rather than 
“balance,” keeping and even strengthening their ties to 
Athens, given their closer geographic proximity to that 
increasingly powerful city (Van Evera 1999, 176; Walt 
1987, 172-80). As Garst persuasively argues, however, 





3 For a close analysis of this episode and of the Hellenic “hope” 
based on Brasidas’ character, see Rawlings 1981, 234-43. 

4 1761-2, VII.2 4. The apparent binary structure of the History 
adds weight to the ironic contrast between Brasidas and Lysander 
(see Rawlings 1981, esp 357). 

41 Van Evera (1984), for example, does not neglect “human factors,” 
but, in contrast to Thucydides, he does not elaborate beyond interna- 
tional structural or baselme domestic constraints on the calculations 
of statesmen. 
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the democratic character of these small Thracian cities 
made them natural allies with Athens, and Thucydides 
appeals to this internal characteristic as an “explana- 
tion” for their alliance behaviors (Garst 2000, 84). It 
is their decision to forgo this domestic link in favor 
of the persuasive arguments of the tactful, measured, 
and therefore smooth-tongued Brasidas, who repre- 
sents the oligarchic Spartans, rather than a calculation 
of the interpoleis balances of threat and power, which 
is of particular note to Thucydides. In contrast to the 
possibilities of balancing and bandwagoning, which can 
certainly constitute a simplified baseline of reckoning 
even for Thucydides (IV.108), his more complicated, 
ironic account may render no greater predictive cer- 
tainty than the hypotheses of neorealism. Indeed, the 
quest for certainty appears to be contradicted by Thucy- 
dides’ political story, but the prescriptive power of his 
narrative lies precisely in the ability of his more com- 
plex and even ironic narrative to give us practical wis- 
dom. 

The theme of “context” has appeared on multiple oc- 
casions in this essay, and we have seen that it is common 
for commentators on Thucydides’ History to refer in 
one way or another to the ways in which he establishes 
“context” in his work. Discerning the various threads 
in the woof and warp of the narrative tapestry that 
Thucydides weaves may well be the central problem for 
interpreting his History and thereby extracting from it 
the practical wisdom that he intends to bring before us 
for consideration. Because the texture of the History 
is so multilayered and complex, one may even ques- 
tion whether practical wisdom—if such wisdom, as I 
am arguing, requires some “context”—is possible at all 
for Thucydides. He believes it to be possible, but it is 
precisely the need to establish a context that is rarely, 
if ever, predetermined that gives practical wisdom its 
third ironic quality: No context can be absolutely de- 
termined, and contexts are likely to be multiple. 

The force of the ironies that surround the complex 
context of Brasidas’ life can only be delivered in a story 
(laying them out axiomatically would be like explaining 
the punchline of a joke), and they are important to a 
full (practical) understanding of the world Thucydides 
seeks to illuminate for us. A simple, axiomatic, struc- 
turalist, or hypothesis-driven description of the world, 
attenuated with a few provisos and caveats, is not in and 
of itself false, but insufficient.“ Thucydides implies that 
the world does not appear to us in that way, and neither 
can an account that purports to give us guidance in it. 
Furthermore, even though this guidance, permeated by 
the ironies of the situation, cannot be absolute, a simple 
list of “laws” as a tool for guiding practical action will 
not do: Its decontextualized character makes it insuffi- 
ciently intelligible to be practically useful. Such a list is 
also boring; despite an “absence of romance” in Thucy- 
dides’ history (1.22.3), political pedagogy means that 
the History’s readers are to be inspired by it to action 
(Euben 1990, 195). In this sense, Euben is right to note 
that Thucydides’ History contains an element of the 








42 Compare Benjamın 1972, 144-45. 
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“bardic” in its preservation of great deeds for all time. 
Such a poetic immortalization or preservation renders 
the deeds of the political actors in Thucydides’ History 
meaningful in a way that transcends their immediate 
occurrence and ific utility for those whom they 
first concerned. Political (and military) action is thereby 
preserved—in a story—as something worth doing, be- 
cause it brings a glory that does not pass away immedi- 
ately after its appearance and that displays a meaning 
that is greater than the act or the words themselves, 
at the same time that it may be beset by moderating 
ironies. Such a display is possible only in narrative. 
Even as we may be encouraged by Thucydides’ ac- 
count to be just, energetic, moderate, measured, coura- 
geous, clever, and audacious (if not for reasons of 
morality, then for reasons of successful action), the 
ironies of his history contextualize this list of human 
excellences in such a way that they remain undogmatic 
and not merely utilitarian. The accounts of excellence in 
Thucydides display the tragic truth, as Euben (1977) bas 
it, “not that truth is never useful, but that itis sometimes 
futile” (199). Brasidas died in battle—a risk in any mili- 
tary venture. More importantly, his political ambitions, 
while they could be sustained by his abilities in the field, 
could not overcome the lack of appreciation his city 
Sparta had for them. Whether his failure to recognize 
the lack of Spartan support constitutes a miscalculation 
on his part, or simple political stupidity, or deliberate 
(and perhaps foolish) disregard is unclear from Thucy- 
dides’ account alone. In any case, Brasidas’ excellences 
were imperfect, the success they wrought was partial, 
and their ultimate outcome ambiguous and ironic. And 
that is best grasped—for practical purposes—by telling 
a story about it. Thucydides’ view of excellence as nec- 
essary, yet assaulted by contingency and enmeshed in 
complexity, can be conveyed in an intelligible manner 
only by way of either a story or a list of axioms, but the 
story, or “narrating,” is much more powerful, “exactly 
because narration is the ‘science’ of the particular, is a 
more basic category than either explanation or under- 
standing” (Hauerwas 2002, 206; Milbank 1990, 264-67). 


SUCCESS, FAILURE, AND EXCELLENCE: 
REALIST INDETERMINISM 


The story of Brasidas is an illustration of excellence, 
yet also of ultimate failure, which is not, however, en- 
tirely a matter of bad luck. Brasidas committed a cru- 
cial political blunder by overestimating his ability to 
prevail against the Spartan government in his apparent 
ambition to expand the Spartan empire into northern 
Greece. He either misjudged or ignored the Spartan 
rulers, who seem to have been uninterested at that point 
in empire. They sought only the return of their soldiers 
(IV.117.2). His death, the ultimate “defeat,” may have 
foreshortened a possible success, but it is incidental to 
his political shortcomings. Why consider such failures? 

We may make five brief observations. First, one can 
learn as much about the requirements for excellence 





& On the political sigmficance of such stones, see Arendt 1958, 175- 
99. 


from an example of their absence—a story of failure— 
as from an example of their presence. Second, it is hard 
to find an unambiguously successful man in Thucy- 
dides’ History. Pericles and Nicias, perhaps the two 
most obvious candidates for excellence and success, 
are both eventual failures Their failures are partially 
of their own doing and partially circumstance. Pericles’ 
hold on the Athenian demos was never absolute (1.21.3; 
11.65.1-4), and he died too soon to see his policies 
through. Athens was left in the hands of increasingly 
self-seeking, unscrupulous despots, while the morals 
and even the instrumental rationality of the city itself 
seem to have declined (IV.65.3-4, I1.65.6-12; Edmunds 
1975b, 73-82; cf. Plato’s [1987, 515e-516d] criticism). 
One may argue, moreover, that many of the apparently 
tyrannical activities and porposals of Alcibiades were a 
consistent extension of the policies of Pericles (Palmer 
1992), which would further complicate any assessment 
of Pericles’ ultimate success or failure. In a difficult and 
perhaps ironic passage, Thucydides eulogizes Nicias as 
a man who gave “lifelong devotion to the practice of 
law-bred virtue [aré@te]” (VI.86.5).4 Yet Nicias was 
unable to persuade the adventuresome Athenians to 
moderation, and as the result of unfavorable political 
and military circumstances and his own mistakes, he 
ended his life as a traitor, butchered by ignoble en- 
emies (VI1.86.4; Strauss 1964, 198-99), and leading a 
large Athenian army with him into annihilation. The 
absence of unambiguous success even in virtue that 
is indicated in Thucydides’ qualified praise of Nicias 
seems itself to tell us something about human possibili- 
ties. Thucydides’ portrayal of the world may serve to at- 
tenuate striving for power and security, if not for moral 
reasons, then for practical ones. Thucydides’ story of 
Brasidas is therefore one example of an effort to teach 
us something practical about human excellence and its 
possibilities. 

Third, Thucydides’ narrative does not deliver to the 
reader an explicit or exhaustive catalog of the ex- 
cellences that he believes are required for success in 
any circumstances. A life is itself a circumstantial set 
of events, never to be repeated. At the same time, 
however, to recount it in a story illuminates for us some, 
even if not all, of the principles that attend our human 
existence, regardless of the outcome. No matter which 
figure in Thucydides’ History one were to choose, the 
catalog, by reason of the nature of narrative, would 
always be partial, requiring analysis of the stories that 
offer us the material to consider. Such partiality ob- 
tains even when a larger narrative (such as Thucydides’ 
History in toto) incorporates the many smaller stories 
into itself, While we might thereby achieve a kind of 
“impartiality” because of the multiperspectival, sum- 
mary viewpoint we are given and can share with one 





4 I am here following Palmer’s translation of this phulologicalty en- 
tangled passage On its importance to an interpretation of Thucy- 
dides’ understanding of virtue [aréte], see especially Palmer 1989, 
369-70, 380-81, 383n18, and Strauss 1964, 207-9; Thucydides’ qual- 
ified praise of Nicias 1s an important point for reading the History 
in general, even while its meaning is open to considerable dispute. 
Compare Connor 1984, 205-6, Edmunds 1975a, 141f£, and Gomme, 
Andrews, and Dover 1970, 461-64. 
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another, it is not possible to retreat to a synoptic, Archi- 
median point (cf. Arendt 1958, 11, 248, 251 vs. 175-212; 
and esp. Arendt 1954, 48-52). In this way—to repeat— 
narrative remains an essential complement to nomo- 
thetic approaches, a primary if not exclusive source of 
practical wisdom. Since I am articulating the story of 
Thucydides in a partially “nonnarrative” and certainly 
abbreviated form, this exercise seems circular, but it is 
not viciously so, because the order of our lives is indeed 
given only narratively, so that “we cannot set [theo- 
retical or axiomatic] explanation and [narrative] un- 
derstanding over against one another” (Milbank 1990, 
266). The two are complementary. They are two aspects 
of an analyzable but irreducible whole. Accordingly, 
as Robert Keohane (1998) has argued, “Each analyst 
of world politics has to locate herself or himself some- 
where along the dimension between . . . nomothetic and 
narrative epistemology” (194). (Nevertheless, the very 
form of the present argument illustrates the recurrent, 
ironic tensions between narrative and analytical expli- 
cation or exposition.) 

Fourth, one may proceed from this point to a larger 
mode of comparison. How, for example, do the narra- 
tive styles of other traditional realist-—say Machiavelli 
or Morgenthau—stack up against that of Thucydides? 
Among the many comparisons of differences between 
realists, Steven Forde has shown that close attention to 
their narratives is one way in which we discern these 
differences. We may find that the relationship between 
excellence and happenstance in different realist narra- 
tives points to a difference in the wisdom they pass on 
to us. 

Fifth, there is the matter of luck. Thucydides seems to 
show in the story of Brasidas and its context excellences 
that require each other for their full realization but 
that also tend to exclude each other from fulfillment. 
He also means to show the role of contingency in the 
complex of human affairs, an understanding of which 
remains for him a crucial part of wisdom.*? We may 
note that the gods of the Greek pantheon are largely 
absent from Thucydides’ history. This absence has led 
Bernard Williams to observe that Thucydides’ picture 
of the world is peculiarly modern—an observation that 
is confirmed by Nietzsche’s appraisal of Thucydides— 
namely, as a world in which there is nothing “beyond 
some things that human beings have themselves shaped 
... that is intrinsically shaped to human interests, in par- 
ticular to human beings’ ethical interests” (Nietzsche 
1968, 558-59; Williams 1993, 163). In such a world, 
luck or contingency plays a crucial but impersonal role. 

The foremost realist systematically to take into ac- 
count the role of contingency, which he referred to as 
“Fortuna,” is generally considered to be not Thucy- 
dides, but Machiavelli. In Machiavelli’s Italy, “person- 
alized” notions of contingency (known as Fortuna) and 





45 Indeed, Cornford (1907) clams that tyche ıs a central figure in 
Thucydides’ “drama.” 

Thucydides’ “modern” qualities may also be accentuated by his 
methodological relationship to the Greek students of medicme 
and to naturalistic philosophers ike Anaxagoras and Democmitus 
(cf Voegelin 1957, 351-58). 
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the human excellence required to overcome it were 
widespread and influential (Flanagan 1972, 130-35). 
Derived adjectivally from the Latin fors, which ulti- 
mately derives from the root of ferre (to bring), fors 
is that which is brought; Fortuna is the goddess who 
brings it. Tyche, similarly, derives from a Greek root 
meaning “to succeed,” or “to attain.” Thus, as Thomas 
Flanagan (1972) concludes, 


The basic meaning ın both cases is not what we moderns 
term ‘chance,’ that 1s, events which seem to occur randomly. 
Rather the connotation is that of success, which is brought 
about by an unseen person or power who works in ways 
inscrutable to us. Thus there is never a clear distinction 
between fortune and fate. Both conceptions refer to the 
order of the gods which can never be fully understood by 
men. If there 1s a difference, it is one of emphasis. Fate 
represents the divine will as something fixed and inflexible, 
while fortune represents it as elastic, unpredictable, and 
open to influence by human supplication. (130) 


Fortune was for Machiavelli, however, an entirely 
immanent phenomenon, which he personalized for pur- 
poses of prescription. The vagaries and vicissitudes of 
fortune in the realist world have no transcendent, di- 
vine source, nor do we have recourse against them in 
such a transcendent realm. Fortune is what times and 
circumstances bring us, to which we must respond with 
the immanent tools we have at hand (Flanagan 1972, 
142-56; Newell 1987, 628-29). Moreover, Machiavelli 
did not believe that men can entirely overcome Fortune 
with virtu, but that they can at times control “her.” As 
Flanagan has it, “Machiavelli in effect promises only 
that we can increase our chances against Fortune, not 
that we can eliminate her effects entirely.” Fortune is 
variable, changing with time and place.‘’ The man of 
virtu must accommodate himself to her caprice if he 
hopes to achieve his ends.® 

Thucydides, similarly, did not treat tyche as either 
a force or a goddess that we can control. The unex- 
plained, unforeseen events of tyche are not to be manip- 
ulated by us, nor are they the product of malevolent or 
benevolent forces. Like the later Machiavelli, however, 
Thucydides believed that tyche can be countered with 
preparation and experience. The Athenians, for exam- 
ple, take advantage of tyche at Sphacteria, and Brasidas’ 
generalship along with the discipline of his troops over- 
comes adverse tyche in his campaign in Thrace. Earlier, 
in a somewhat disingenuous speech before his troops 





47 One might usefully compare this notion of Fortuna with Carl von 
Clausewitz’s (1968) depictions of “friction,” chance, and incalcula- 
bility ın war (117, 140-42, 157, 162-67). 

“8 In the famous twenty-fifth chapter of The Prince, Machiavelli sug- 
gests that Fortune controls our doings about half the time, and ın the 
other half we can control her with the proper qualities of virtu. Space 
limitations deny a rehearsal of Machiavelli’s teaching on Fortuna, nor 
is ıt to the point here, but his statement that “I thmk it may be true 
that fortune is the ruler of half our actions, but that she allows the 
other half or thereabouts to be governed by us” (1980, chap. XXV) 
[“Nondumanco, perché ıl nostro libero arbrtrio non sla spento, iudico 
potere essere vero che la fortuna sta arbura della metà della azioni 
nostre, ma che etiam lei ne lasa governare l’altra metà, o presso’, a 
now” (Machiavelli 1995, 162-63)] does not mply, it seems to me (pace 
Strauss 1953, 178), that proper virtu will also permit us entirely to rule 
that half of our actions of which fortune 1s the ruler 
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prior to the naval engagement at Rhium, Brasidas en- 
courages his men with the questionable (and ultimately 
false) observation that previous defeats were the prod- 
uct of adverse tyche, now to be overcome by their own 
preparations. These words prove false largely because 
the Spartan preparation against the unforeseen is not 
as thorough as that of the Athenians (1.85.2, II.87- 
90). Tyche is the material happenstance to which the 
man of excellence is prepared to respond. Tyche is 
for Thucydides, too, a material and immanent, not a 
transcendent or divine element of incalculability in the 
affairs of men. Such incalculability can have various 
immanent sources, and it is not always possible to over- 
come it (Bury 1909, 128-30; Edmunds 1975a, esp. 7- 
88, 174-204; Woodhead 1970, 168-69), as the stories 
of Brasidas show. Most narrowly, the virtue that may 
overcome tyche is gnome, which consists principally of 
the twin aspects of reason and resolve (Edmunds 1975a, 
esp. 7-88; Rawlings 1981, 130-35). The other virtues 
that Brasidas exhibits supplement these and assure the 
success that begins with reason (or intelligence) and 
resolve. 

Tyche is the term Thucydides finds at hand to symbol- 
ize systematic unpredictability in the world. Since tyche 
is not systematic in its appearances, however, we cannot 
overcome it by mere calculation. The stories we tell are 
a way of rendering the incalculability of fortune or tyche 
intelligible and, thereby, of gaining wisdom concerning 
its place in human affairs By means of stories we may 
prepare ourselves to be excellent and thereby coun- 
teract bad luck or take advantage of good luck within 
the complex reality that is our existence. The story of 
Brasidas, which includes his political failures and his 
untimely demise, is one such tale. 


CONCLUSIONS 


Daniel Garst has persuasively argued that Thucydides’ 
History presents a picture of political power that not 
only is richer and more nuanced than the neorealist 
conception, but also points to a serious theoretical de- 
fect in it. Thucydides’ use of political speeches captures 
a distinction between the possession and the exercise 
of political power that neorealism misses. This confu- 
sion also leads neorealism to understand less nimbly 
than Thucydides did the workings of hegemony in in- 
ternational affairs (Garst 1989, 20-22, 22-24). On the 
literary front, we have noted that Steven Forde finds 
in Thucydides a presentation of the nature of power 
that perceives ironies and tragedies not evident in the 
writings of realists like Machiavelli (or of contemporary 
neorealists). I have added to these arguments concern- 
ing the richer depths of Thucydides’ realism an argu- 
ment that the narrative itself works as a pedagogical 
device to which modern realists and neorealists aspire, 





49 Hunter's (1973) analysis of this incident gives several reasons why, 
in Thucydides’ likely estimation, Brasidas’ speech is flawed (140-41, 
150-51, 157-58); cf. Allison 1989, esp. 28-65 Since the intention of 
this speech was to motivate troops to victory, “tactical” (but unsuc- 
cessful), rather than “flawed,” may be the correct adjective for its 
apparent lack of veracity 


but that they cannot emulate with nomothesis or even 
anecdotes alone. 

The Thucydidean realist world is a world of action 
and contingency, of human purposes and the vagaries 
of happenstance. Human beings are motivated to ac- 
tion primarily (but not necessarily exclusively) by fear, 
honor, and interest, which are linked either to acquiring 
or to maintaining political and material power. One 
overcomes contingency and luck not with obedient 
piety, but with preparation and skill, which are shaped 
by the qualities of one’s own character that one brings 
to bear in a particular situation. Moderation or even 
hesitation may, as with the Spartans, help us to see the 
limitations of our pursuit of power, but other qualities 
actually enable the pursuit. These other qualities are 
displayed in stories of how men pursue and keep power. 

Realist and neorealist reservations concerning 
morality notwithstanding, Thucydides’ story of the war 
may also have a moral purpose of sorts. Perhaps it in- 
tends to moderate our adventurism, because it shows us 
the ironies and ambiguities of pursuing power. Thucy- 
dides’ wisdom moderates us in a Spartan direction, be- 
cause we come to see that an unbridled response to fear 
and an unbridled pursuit of honor and interest defined 
in terms of power are ultimately self-contradictory or 
self-destructive, as was Brasidas’ strong-willed policy 
in Thrace and Athenian policy after the death of Per- 
icles. This self-contradiction may not always be appar- 
ent, as it was not to the avaricious Athenians, or to 
the Hellenic cities of the Athenian Empire, who, won 
over by the smooth words and seeming goodness of one 
general, exchanged a measured tyranny for a harsher 
one (which, ironically, was Spartan, despite that city’s 
many measured characteristics). Thucydides does not 
explicitly espouse a moral theory, and a reading of his 
History might make one wonder if that were possible, 
yet he leads us to see through his narrative that the pure 
pursuit of material advantage may require a conven- 
tional moderation to keep it from self-contradiction.® 
Perhaps this is one of the “secret instructions” of which 
Hobbes speaks"! 

Thucydides’ concern with human limitations along 
these registers is not a sentiment merely of frustration, 
but of a kind of tragedy that serves as an admonition. 
For Thucydides, pursuit of one’s own ends and the ends 
of one’s city, for example, may result in a tragic conflict, 





I See also Palmer 1989, 373, 381, and Forde 2000. In the larger struc- 
ture of the History, we might also compare the statements of the 
Athenian ambassadors at Sparta before the war with the growmg 
extremism of Athenian polices as the war continues. In the pre- 
war debate, the Athenians admit to “realist” motivations of “fear, 
honor, and interest” for retaining their empire (I 76, 1.77), but they 
also dam “measured conduct” in the treatment of their imperial 
subjects (I 77). That measured “tyranny,” which parallels the initially 
measured tyranny of Hippias and his predecessors at Athens (VI.54), 
stands m contrast to Athenian language and conduct at Melos, m 
Thrace, and m later episodes (c£. VL61). 

51 Srmularly, the Periclean appeals to glory (IL42-IL45) imply that 
longevity, stability, or even moderation is not necessarily politically 
ultimate. Attaming glory may render mdrvidual or collective survival 
itself secondary: Better a short, bright light than a dimmer and longer, 
but less worthy one. Since Hobbes ([1651] 1968) preferred survival, 
he endeavored to discount glory (130, 150-60). 
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as they did for Brasidas. Similarly, the virtues that pre- 
serve a city in war must be tempered by those that keep 
peace, yet these virtues may conflict. To survive, one not 
only must shore up and expand what one has, as some 
realists might suggest: One also must rest as the Spar- 
tans suggested to the Athenians? This, too, is part of 
“realist” wisdom. The need for moderation in the face 
of tragedy means that Thucydides cannot tell stories, 
except of certain military incidents, that are as simple as, 
say, Machiavelli’s terse examples of virtu in the Prince. 
Thucydides does not reduce every human activity to 
a kind of military—which is to say, technical—activity 
(Wood 1967, 170-71). Instead, the intentions of oth- 
ers, the ambiguities of objectives, and the tragedies of 
complex human interactions are a part of that which 
is permanent in the nature of things, exposed by the 
greatest kinesis of all time. Thucydides need not tell 
us so directly: His History can mean nothing less. This 
aspect of the History reminds us of Hayden White’s 
(1987) observation, taken from Hegel, of the seeming 
fact that “every historical narrative has as its latent or 
manifest purpose the desire to moralize the events of 
which it treats” (14). Thucydides seems strictly to avoid 
moralizing, and yet, to understand his reallist] world 
of human action and contingency, we must understand 
not only calculations of interest or fear or honor de- 
fined in terms of power, but also the other elements 
of human life. These include the conflicting intentions 
of our desires (justice and revenge, security and glory), 
the conflicting ends within a single city or even person 
and, most especially, the possibility of happenstance. 
Without being aware of these dimensions of human 
reality, we cannot be wise in Thucydides’ sense. 

While the focus of this essay has been on the decision- 
making of individual leaders, similar considerations 
are valid for decision-making “collectives,” such as the 
Athenian assembly. As we have seen, Thucydides iden- 
tifies specific “character” traits of the two central cities 
of his historical narrative, Sparta and Athens, a number 
of which he then transfers to a variety of individual ac- 
tors in that narrative, including Brasidas. The Athenian 
demos, moreover, is a decision-maker in its own right: 
its hastiness and impatience, subsequent regret, and 
false hopes, for example, lead to decisions of consid- 
erable import throughout the History (111.49, VI.93). 
Thucydides’ lessons concerning moderation and mea- 
sured conduct can apply alike to individuals and to 
groups that are articulated to action. While we should 
beware a one-to-one transference and while leadership 
within group processes is an important aspect of such 
processes (that it was for Thucydides is revealed, for 
example, in many of the speeches he records), groups, 


*2 To give an example of such disputes among realists, one might point 
to Machiavelli’s (1996) negative evaluation of Spartan moderation 
(21-23 [Duscourses, 1.6]; Mindle 1985, 219). His indictment seems 
nearly “Athenian,” at least if Alcibiades’ arguments for imperialism 
can be thought to be the attitude of the majority of the Athenians 
after the death of Pericles (Thucydides 1919, VI 18.4-7; cf. Forde 
1992, 384-87, 389); as noted above, Palmer (1992, esp. 41-42, 97,119- 
120) shows that the arguments of Alcibiades in favor of Athenian 
impenalism are consistent with the pnor arguments and policies of 
Pericles 
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like individuals, can be characterized by the qualities 
of their collective decision-making processes and out- 
comes (cf. Ober 1989, 68-95). 

The story of Brasidas reminds those who argue for 
the realist or neorealist schools in the study of political 
philosophy or of international relations of three princi- 
ples. First, axiomatic knowledge of material or systemic 
factors in international relations may be a good “first 
cut” (even though it is logically subsequent to narra- 
tive), but it is insufficient for practical purposes, since 
we must also take into account the roles of contingency, 
luck, and chance in a narratively contextualized way. 
Nomothesis, therefore, remains perhaps a good first 
glance but an insufficient lasting endowment for politi- 
cal practice. We require not only theoretical exposition, 
but narrative. That is, we require stories that integrate, 
contextualize, illustrate, and illuminate in those specific 
ways that theoretical analysis cannot. 

Second, luck and contingency are central to the suc- 
cess of human enterprises and they require human 
excellence—revealed in narrative—to deal with them, 
even under the conditions that realism holds to be true. 
The first point of wisdom in realism is that happen- 
stance is a real phenomenon of the world, requiring ex- 
cellence to overcome, and that this combination of luck 
and human response is an integral part of the world, 
which the practitioner (or student) of politics denies at 
his or her peril. Success, fame, or glory in international 
affairs requires not only the knowledge of the scholar 
or keen observer, but also the excellence of character to 
use that knowledge to one’s own purposes and, perhaps, 
the good fortune to be placed into the circumstances of 
power in which one can, in fact, employ such knowledge 
and character. 

Finally, as Daniel Garst (1989) also shows in a dif- 
ferently contextualized argument, the story of Brasidas 
points to the prudential requirements of political power 
(21-24). The incalculability in the complexity of human 
affairs that the story of Brasidas illustrates prevents 
politics from becoming simply a technique of mastery 
and demands that it be a practical science requiring 
prudence, wisdom, and, most likely, luck. The nature 
and qualities of this incalculability can be partially ar- 
ticulated in axiomatic generalizations, but, without ex- 
cluding lists of axioms, they are more ably and usefully 
demonstrated in that most potent substitute for direct 
experience—stories. 
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: | Wie view of institutions as coercion rather than as contracts dominates the comparative politics 
literature on both institutional creation and the politics of economic reform. The emergence of a 
collectively optimal tax code in Russia demonstrates the limitations of this emphasis on coercion. 

This new tax code was not imposed by a strong central leader, mandated by international institutions, or 
the result of state capture by powerful economic interest groups. Rather, it is the product of a mutually 
beneficial exchange between the Russian government and the Russian oil companies. Russia’s ability to 
negotiate an effective tax regime also suggests the conditions under which and the microcausal mechanism 
whereby exogenous shocks promote institutional change and economic reform. Owing to their mutual 
vulnerability and interdependence, the August 1998 financial crisis generated widely shared perceptions 
among these actors that the payoffs of cooperation had changed. Yet the economic reform institution that 
resulted required a series of incremental strategic moves that established common knowledge. 


of how institutions emerge—that is, whether 

they are the product of social contracts and 
strategic bargains, power asymmetries and coercion, or 
path dependence and evolutionary development. In- 
deed, although each subfield has favored a different 
perspective on institutional emergence, this is one of 
the central questions that unifies them. 

The view of institutions as contracts (or the contrac- 
tarian approach), which was born with Thomas Hobbes 
but reached its maturity in the public choice and ratio- 
nal choice literature, has dominated American politics 
(see, e.g., Kiewiet and McCubbins 1991, Shepsle 1986, 
and Weingast 1993) and the new economics of orga- 
nization (see, e.g., Moe 1984 and Williamson 1985) for 
almost two decades. According to this view, institutions 
are the product of individual exchange and competi- 
tive selection aimed at deriving a mutually beneficial 
outcome, such as efficiency or cooperation (see, e.g., 
Shepsle 1986). As comparative politics embraced the 
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new institutionalism, however, the contractarian ap- 
proach was largely rejected in favor of the view of insti- 
tutions as coercion (see, e.g., Bates 1988, Firmin-Sellers 
1995, and Knight 1992).! Despite their many differ- 
ences, this is true of both historical institutionalist 
accounts, which clearly eschew contracting (see, e.g., 
Thelen and Steinmo 1992), and rational choice institu- 
tionalist accounts, which have come to embrace the as- 
sumption that institutional outcomes are distributional 
(see, e.g., Knight 1998). 

The view of institutions as coercion rather than as 
contracts has also come to dominate the literature on 
the politics of economic reform. This is apparent in 
the prevalence of the argument that economic policy 
c e requires an authoritarian leader and/or a strong 
state.* Based on the divergent experiences of East Asia 
and Latin America, the view became widespread that 
necessary but difficult market reforms must be imposed 
from above—either to make effective investment de- 
cisions, to thwart organized opposition, or to over- 
come an inherent popular bias toward the status quo 
(see, e.g, Fernandez and Rodrik 1994, Haggard and 
Kaufman 1992, Stallings and Kaufman 1989, and Wade 
1990). Alongside this literature an analogous argument 
emerged regarding the need for international pressures 
to mandate or encourage market reforms (see, e.g., 
Kahler 1989 and Stallings 1992), ranging from com- 
pensatory aid packages and International Monetary 
Fund (IMF) conditionality (see, e.g., Sachs 1994 and 
Stone 2002) to the need to attract foreign capital (see, 
e.g., Simmons 1999).? The rejection of contractarianism 





1 The opposite trend occurred ın international relations, which 
moved from the dommation of the institutions as coercion approach 
(Neorealism) to the institutions as contracts approach (Neoliberal- 
ism). For an overview of this debate, see Baldwin 1993. 

2 In this literature, state strength is often equated with autonomy 
or insulation from popular demands and narrow economic mter- 
esta. For example, this ıs why Williamson (1994) advocates the use 
of autonomous technocrats (or technopols) to implement economic 
reform in developing countries. 

3 These two arguments come together most explicitly in Sachs 1994, 
which argues that successful economic reform requires a combination 
of strong (i-e., autonomous) leaders and international assistance 
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is also manifested in the predominant treatment of eco- 
nomic reform as benefiting a particular group at the ex- 
pense of the majority rather than expanding the pie to 
make everyone better off (see, e.g., Alesina and Drazen 
1991; Olson 1982, and Schamis 1999). 

We demonstrate that this emphasis on coercion hin- 
ders our understanding of the process by which insti- 
tutional creation and economic reform occurs through 
a detailed analysis of the emergence of a collectively 
optimal tax code in Russia from 1998 to 2002* based 
on extensive elite interviews. ° Following the Soviet 
Union’s retreat from Eastern Europe in the late 1980s 
and ultimate demise in 1991, scholars also endorsed 
the view of institutions as coercion in their attempts 
to understand institutional emergence and evaluate 
the prospects for market transition in the postcommu- 
nist world (see, e.g., Hellman 1998, Stone 2002, and 
Treisman 1998). We find, however, that this new tax 
code was neither imposed by a strong central leader, 
mandated by international institutions, nor the result 
of state capture by powerful economic interest groups, 
as both the general literature predicts and specialists 
on Russia commonly argue. Rather, it represents a 
negotiated settlement between the Russian govern- 
ment and the most powerful set of domestic eco- 
nomic actors—the Russian oil companies (ROCs}— 
in which each side incurred gains and losses from the 
exchange. 

At the same time, the emergence of an effective tax 
regime in Russia at the end of a decade marred by 
stagnant reform efforts offers an important corrective 
to the contractarian approach because it suggests an al- 
ternative mechanism whereby two equally powerful ac- 
tors can achieve mutual gains from cooperation. Owing 
to their mutual vulnerability and interdependence, the 
August 1998 financial crisis generated widely shared 
perceptions among the Russian government and the 
ROCs that the payoffs of cooperation had changed. The 
economic reform institution that resulted, however, 
was contingent upon a series of incremental strate- 
gic moves that established common knowledge. Thus, 
we also build on the literature that invokes exogenous 
shocks to explain economic reform by clearly specifying 
the conditions under which and the microcausal mecha- 
nism whereby exogenous shocks promote institutional 
creation and economic policy change. 


‘ The new Rusman tax code consists of two parts Part I was adopted 
in July 1998 and enacted in January 1999; ıt covers administrative and 
procedural matters, including the introduction of new taxes and the 
protection of taxpayers’ nghts. Part II was adopted ın August 2000, 
amended in November and December 2001, and enacted m 2001 and 
2002; ıt includes specifications on various taxes, including the WAT, 
corporate profits tax, personal mcomes tax, and the socal tax This 
article focuses primarily on explarming Part II, although Part I laid 
the groundwork for the approval and mplementation of Part IT. For 
an overview see, OECD 2001, 115-144. 

5 These mterviews were carried out systematically in Moscow from 
September 2001 to July 2002 with representatives from both Russian 
and foreign oil and gas companies, Russian government officials, 
and Russian and foreign financial and energy experts In most cases, 
names are withheld at the request of the mterviewee. 
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THE PUZZLE OF RUSSIA’S NEGOTIATED 
TAX REFORM 


Russia’s ability to negotiate a new tax code after almost 
a decade of abortive attempts presents us with a striking 
puzzle in light of both the dominance of the coercion 
approach in comparative politics and the widespread 
pessimism about Russia’s prospects for market reform. 
First, it directly contradicts the view of institutions as 
coercion whereby difficult economic reforms can only 
be imposed from above by a strong leader within or 
external pressures from outside the country. Second, 
both the recent literature on economic reform in post- 
communist states and the long-held conventional wis- 
dom on resource-rich states predict that Russia will 
fail to develop the necessary institutions to promote 
economic growth, particularly a stable and compre- 
hensive tax regime.” Finally, the mechanisms that have 
been posited to facilitate cooperation elsewhere—such 
as a precommitment strategy and social norms—were 
clearly absent in this case. 


From Informal Exchange to Formal 
Guarantees 


Tax policy in Russia for most of the 1990s can best 
be described as a classic collective action problem’ 
in which the two most powerful actors concerned— 
the Russian government and the Russian oil compa- 
nies (hereafter ROCs)—followed individually rational 
strategies that were collectively suboptimal. Owing to 
mutual suspicion and weak enforcement, the Russian 
government opted to maximize its revenue in the short 
term through setting arbitrarily high tax rates by de- 
cree and the ROCs opted to minimize their tax burden 
in the short term by developing increasingly elaborate 
schemes to evade these exorbitant tax rates. As a result, 
the government was able to collect approximately half 
ofits projected tax revenues and the ROCs could keepa 
substantial portion of their profits. Both sides, however, 
incurred heavy costs over the long term in the form of 
unreliable revenue streams, insecure property rights, 
and asymmetrical information that had a negative ef- 
fect on the Russian economy as a whole and could have 
been reduced or eliminated through cooperation. 
From the Russian government’s perspective, partic- 
ularly the Ministry of Finance, maintaining high tax 
rates was the best way to replenish the budget despite 
low compliance rates (see, e.g., RPI, March 1996 and 


6 Russia not only possesses the world’s largest natural gas reserves 
and the eighth largest oil reserves, but us the world’s largest ex- 
porter of natural gas and the second largest exporter of oil. See 
http//www.cia doe gov/emeu/cabs/russia.html. 

7 For a comprehensive discussion of collective action problems, see 
Hardin 1982 and Olson 1965. 

8 Although other actors (such as regional leaders) were also mfluen- 
tial m shaping taxation policy in the early 1990s, we focus our analysis 
on the ROCs and the Russian government because they were the 
most important actors involved ın the subsequent formulation of the 
new tax code—particularly Part I (authors’ personal communication 
with Alexander Ustinov, Economic Expert Group, June 2002). 
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Samoylenko 1998).? The official tax rates for industry 
and revenue sharing for regions that existed on paper 
were rarely, if ever, observed in practice, and tax col- 
lection rates were abysmally low as a result (Gustafson 
1999, chap. 9; Shleifer and Treisman 2000, chap. 6). Ac- 
cording to the IMF (2000, 69, 71), even as of January 1, 
2000, the value of unpaid taxes at the consolidated level 
(federal and local governments) was 8.3% of the GDP. 
Instead, economic elites presiding over the country’s in- 
dustrial enterprises as well as regional leaders engaged 
in ongoing negotiations with the government to de- 
termine their respective tax burdens (see, e.g., Easter, 
2002). Not surprisingly, the government purposefully 
targeted the highly lucrative and concentrated indus- 
tries in the energy sector for revenue extraction and 
its budget was highly dependent on this sector as a re- 
sult. In 1998, for example, the oil sector accounted for 
approximately one-fourth of all tax revenues (Russian 
Economic Trends 1998, 6). The Russian government 
was able to collect at least a portion of the revenue it 
sought from this particular sector by raising tax rates 
arbitrarily and threatening to employ different coercive 
means, such as blocking access to pipelines and export 
markets, if companies did not comply. Moreover, re- 
sponsibility for determining tax rates in the energy sec- 
tor was often shared by the Ministry of Fuel and Energy 
and the Ministry of Finance, which resulted in constant 
fluctuations in the tax rates, especially pertaining to ex- 
cise taxes, Despite the high transaction costs involved 
in revenue extraction, the Russian government, nev- 
ertheless, managed to collect between 33 and 35% of 
the ROCs’ revenues based on a statutory tax rate of 
53%.1° At the same time, the government could guar- 
antee itself a source of fuel by arbitrarily granting tax 
arrears to companies in exchange for providing energy 
to delinquent customers, which often included domes- 
tic industries as well as households (Jones Luong 2000). 

Yet this strategy only exacerbated the problem of 
noncompliance, particularly among the ROCs, by cre- 
ating a strong incentive for economic actors to hide 
their profits. It also failed to take into account that 
the ROCs’ tax burden was much higher than even the 
government projected because of the regional and local 
governments’ tendency to levy informal taxes on the oil 
companies operating in their regions by forcing them to 
provide social services and infrastructure investments 
(Gustafson 1999, 207). The ROCs responded to this 
untenable situation by developing a series of legal and 
semilegal schemes to hide their profits through which 
they effectively evaded heavy taxation."! Transfer pric- 
ing was the most common form. Because the corporate 





9 This was also confirmed in authors’ personal communication with 
Arkady Dvorkovich, Deputy Minster, Ministry of Economic Devel- 
ent and Trade, July 5, 2002 

10 Authors’ personal communication with Vitaly Yermakov, Re- 
search Associate, Cambridge Energy Research Associates (CERA), 
June 2002, and with Dvorkovich, op. at. 

11 Although the Russian government’s auditors and the Ministry of 
Finance could ascertain a close estimate of what the ROCs’ actual 
profits were through export quotas and yearly audits, the prevalence 
of these legal and semlegal schemes made it extremely costly to 
catch or to sanction the ROCs for tax evasion (authors’ personal 


income tax (or profits tax) was based on trade rather 
than production, parent companies could reduce their 
official income by creating trading subsidiaries (often 
located in a low tax zone within Russia) from which they 
purchased oil at below-market prices and then resold 
this oil at equally low prices to offshore Russian inter- 
mediaries (often located in a free-trade zone). By some 
estimates, the ROCs have been able to hide at least 25% 
of their export proceeds through transfer pricing.” As 
a result, the government only received 22% of the ap- 
proximately $30 billion in windfall rent from natural re- 
sources sales in 2000, while 78% remained in the hands 
of oil and gas exporters.! Actual (versus statutory) tax 
rates on oil not only were lower than they should be, but 
also differed markedly from company to company (see, 
e.g., Novaiia gazeta, August 7, 2000). Another form of 
tax evasion that detracted from profit-making activities 
included the development of intricate schemes to avoid 
payroll taxes. Here, parent companies would create 
offshore subsidiaries to pay their employees, arrange 
for insurance companies to pay their employees under 
the guise of large monthly payouts from life insurance 
policies, or pay higher corporate banking fees so that 
employees would earn higher interest rates than the 
market rate on their checking accounts.“ 

Thus, by the end of the 1990s the Russian govern- 
ment and the ROCs were locked in a vicious cycle 
in which exorbitant tax rates encouraged evasion and 
evasion encouraged even higher tax rates and threats 
of expropriation, leading to more elaborate and time- 
consuming schemes to hide profits. The results were 
disastrous for the Russian economy on the whole, which 
did not record positive investment rates or economic 
growth for most of the decade (IMF, 2000). Although 
the government had attempted to introduce new tax 
legislation since 1995, it was continuously stalled in 
the Duma due to the opposition of the “Left” (ie., 
the Communists and Yabloko) and of the oil and gas 
lobby, which the Yeltsin administration attempted to 
circumvent by issuing new decrees. Even as late as April 
1998, the Duma and the government could not reach 
an agreement concerning tax reform. Rather, the for- 
mer continued to impede reform by adopting numerous 
amendments, while the latter refused to compromise 
by addressing procedural issues related to taxpayers’ 
rights or lowering the tax burden (see, e.g., Samoylenko 
1998). Similarly, in addition to blocking tax reform in 
the Duma, the ROCs continued to hide their income, 
to exaggerate their losses in order to obtain tax breaks 
(RPI, April 1998, 10) and to pressure the government 
to reduce the energy sector’s tax burden (RFE/RL, 
July 23, 1998). 

This situation, however, changed dramatically by the 
end of 1998. Despite nearly a decade of failed tax re- 
form, from 1998 to 2002 the Russian government and 





communicaton with representatives of ROCs, September 2001 and 
June-July 2002, and with tax auditors, July 2002). 

12 Authors’ personal communication with Yermakoy, op. at. 

3 Authors’ personal communication with Yermakov, op at. 

14 Authors’ personal communication with Peter L Reinhardt, 
Pnnaple, Erst & Young, September 2001. 
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the ROCs were able to achieve mutual gains from co- 
operation in the form of a new tax code. In sum, this 
new tax code is designed to increase the reliability of 
revenue streams for both sides by creating incentives 
for broad-based tax compliance without a significant 
improvement in the government’s administrative ca- 
pacity and pegging the ROCs’ tax payments to fluctua- 
tions in the global price of oil. It also reduces informa- 
tion asymmetries by eliminating transfer pricing and 
encouraging the ROCs to disclose their actual profits. 
By setting fixed tax rates and reducing incentives for 
the government to resort to coercive tactics to meet 
its revenue needs, moreover, the new tax code has the 
potential to secure property rights for the ROCs and 
thereby enable them to both increase their investments 
at home and attract foreign partners to expand their 
operations abroad. 

Furthermore, it puts Russia decisively on the path 
toward market reform. By most accounts, Russia’s new 
tax code exceeds Western standards not only because 
it sets lower tax rates than the OECD recommends but 
also because it is much simpler and clearer than the pre- 
vious one. For example, it has introduced a 13% flat tax 
on personal income, capped corporate contributions to 
the social insurance fund, reduced the profits tax (a.k.a. 
corporate income tax) rate from 35 to 24%, abolished 
turnover taxes (as of 2003), tied export tariffs directly 
to the price of oil, and established new accounting pro- 
cedures that are on par with international accounting 
standards. In addition, a new mineral extraction tax was 
introduced in January 2002 (Chap. 26, Part II) as a flat 
rate pegged to the price of oil. It has also won the praise 
of foreign and domestic financial and political analysts 
for the inclusive nature of tax benefits and, thus, its po- 
tentially positive impact on the Russian economy as a 
whole (see, e.g., Andersen 2002 and Rabushka 2002).!5 
These assessments were confirmed by the initial re- 
sults, which indicate that tax collection rates have in- 
creased since the new code was put into effect (see, e.g., 
Kommersant, October 19, 2001; Pravda, October 18, 
2001; and Russia Journal, July 18, 2002). For example, 
the fact that income tax revenues jumped by 70% fol- 
lowing the introduction of the flat tax on personal in- 
come has been directly attributed to the abandonment 
of costly tax-evasion schemes (Aslund 2001, 22). 


The Fallure of Existing Accounts: Coercion, 
State Capture, and the “Resource Curse” 


How did the Russian government and the ROCs man- 
age to overcome their collective action problem and 
cooperate with one another to their mutual advantage? 
This outcome is especially puzzling because none of 
the existing accounts provide a satisfactory explana- 
tion. Neither the general predictions nor the particu- 
lar assessments of Russia that stem from the predom- 
inant view of institutions as coercion can explain the 
emergence of this new tax code as a product of intense 





15 This is the consensus among domestic and foreign financial an- 
alysts according to authors’ personal communications, September 
2001 and June-July 2002. 
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negotiation and compromise. Indeed, both the litera- 
ture on institutional emergence generally and that on 
economic reform in postcommunist states specifically 
predict that Russia should fail to establish a viable tax 
code—either through contracting or at all. Nor were 
the mechanisms often cited in contractanian explana- 
tions of institutional emergence through cooperation 
present in this case. 

That the view of institutions as coercion has dom- 
inated scholarly assessments of Russia’s transition is 
clearly demonstrated in the overwhelming pessimism 
concerning its prospects for economic reform in the ab- 
sence of either a strong leader within or external pres- 
sures from outside the country. Many have attributed 
market failure in Russia throughout most of the 1990s 
to the weakness of either President Boris Yeltsin or cen- 
tral state institutions vis-a-vis powerful economic inter- 
ests (see, e.g., Hellman 1998, McFaul 1995, Reddaway 
and Glinski 2001, and Robert and Sherlock 1999). 
Thus, not surprisingly, Russia’s hope for establishing 
the political order necessary to achieve market reform 
has often been placed in its newly elected president, 
and seemingly more authoritarian leader, Vladimir 
Putin, who succeeded Yeltsin in March 2000 (see, e.g., 
Nicholson 2001 and Shevtsova 2003).!® Because he 
not only enjoyed a popular mandate but also faced 
a Duma that was less polarized, and therefore more 
pliable, after the 1999 elections Putin seemed poised 
to unilaterally redefine Russia’s political climate and 
single-handedly push through his economic agenda 
(see, e.g., Remington 2001). Putin’s position as former 
head of the Soviet Union’s internal security service (i.e., 
the KGB) bolstered the presumption that he would 
be a strong leader, capable of using force to impose 

According to this perspective, the new tax code is 
best understood as yet another example of Putin’s unri- 
valed authority since his election. Almost immediately 
after ascending to the presidency, for example, Putin 
launched sweeping administrative changes aimed at re- 
ducing the power of the regions to influence national 
policy-making, including tax reform. The most signifi- 
cant of these was the reorganization of the Federation 
Council, through which the regional leaders had pre- 
viously obstructed the government’s attempts at tax 
reform and weakened the federal budget by demand- 
ing complex revenue-sharing arrangements (see, e.g., 
Treisman 1999). Putin also demonstrated to the ROCs 
that he was willing to use force through his decisive 
treatment of the notorious oligarchs Boris Berezovsky 
and Vladimir Gusinsky in the first few months of his 
presidency (see Lloyd 2000 for details). Yet, if this is 
what brought the ROCs to the bargaining table, it fails 
to explain why negotiations over a new tax regime be- 
gan in late 1998 under Yeltsin. 





16 Putin was appointed as prme minister in August 1999 and then 
elected to the presidency in March 2000 with 52.94% of the vote 
(RFE/RL Russian Election Report, 2000, 13) The Duma elected in 
Decamber 1999 contained a majority of his supporters, who appeared 
to form the first stable policy coalition in the Rusman parhament 
(the Unity bloc) For details, see Colton and McFaul 2000 and Smyth 
2002. 
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It also fails to explain why Putin did not simply 
impose a new tax regime that solely reflected the 
government’s preferences. With the elimination of the 
two biggest obstacles to tax reform in the 1990s—the 
polarized Duma and recalcitrant regional leaders— 
perhaps Putin could have passed the new tax code with- 
out even the tacit approval of the ROCs. He also could 
have continued Yeltsin’s arbitrary practice of changing 
tax rates by decree. Yet Putin chose to forfeit this partic- 
ular advantage vis-a-vis the ROCs and, instead, to con- 
tinue the negotiations and make the necessary compro- 
mises so that a comprehensive tax code could be passed 
through the Duma as a law that was endorsed by the 
country’s most important taxpayers—the ROCs.” As 
will be described in more detail below, to reassure the 
ROCs that their assets would not be expropriated after 
the 1998 crisis, he also publicly recognized the property 
tights of the oil industry and pledged his commitment 
not to renationalize. Thus, he seemed to be interested in 
making a truce with the remaining oligarchs rather than 
alienating them. Moreover, despite signs of economic 
growth and increased tax revenues in 1999, agreeing 
to formalize a tax code with lower fixed rates was not 
costless for Putin. If the ROCs chose to renege on their 
commitment to comply with the lower tax rates, the 
Russian government risked a budget deficit because it 
could no longer arbitrarily raise tax rates. 

Others have looked to the international community 
to provide either the financial incentive or direct pres- 
sure to propel market reform in Russia and other post- 
communist states (see, e.g., Aslund 2002, Sachs 1994, 
and Stone 2002). Indeed, the undeniable importance 
of international competitiveness and the increasingly 
direct role of transnational actors in domestic affairs in 
the context of globalization (Rodrik 1997) and a post- 
Cold War world (Weinthal 2002) lend support to this 
argument. Thus, an obvious alternative explanation for 
the new tax code is that the government pushed through 
this new tax regime in order to attract foreign invest- 
ment or appease its foreign creditors. Yet it has been 
clear since independence that, particularly in the en- 
ergy sector, Russia has very little interest in bringing in 
direct foreign investment (Jones Luong and Weinthal 
2001). In fact, until very recently, both the executive 
branch and the Duma have stalled the adoption of 
PSAs, which are universally viewed as the key to further 
foreign investment in the energy sector.!® As will be- 
come clear in the following section, the Russian govern- 
ment also did not adopt a new tax regime to attract for- 
eign direct investment but, rather, because the ROCs 
sought an additional guarantee for their property rights 
at home by attracting Western partners for joint ven- 
tures abroad. Nor is the new tax code simply a product 
of direct pressure from foreign financial consultants or 
the IMF compelling the Russian government to design 
a more suitable (i.e., Western-style) tax regime. In fact, 


17 According to the 1993 Russian Constitution, taxation is under the 
purview of the legislature; thus, a comprehensive tax code could only 
be adopted as a law. 

18 Authors’ personal communications with Vladimir Konovalov, 
Director, Petroleum Advisory Forum, September 14, 2001. 
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since 1997 the IMF has been encouraging the Ministry 
of Finance to raise, not lower, Russia’s tax rates. 

The contractarian approach has also been rejected 
in the form of two prominent and closely related ap- 
proaches to understanding the failure of economic re- 
form in postcommunist states. The first is the “winner 
take all” view, according to which a narrow set of actors 
captured the gains from early market reforms by virtue 
of their position and then effectively prevented sub- 
sequent reforms that might threaten these early gains 
(Hellman 1998). 

For Russia, in particular, this perspective is directly 
related to what many have deemed the evils of “insider 
privatization” (or privatization policies that deliber- 
ately favor those closest to the regime), because it either 
enriches a few agents who prefer to maintain insecure 
property rights so that they can exploit their privileged 
access to social assets (Sonin 1999) or creates “early 
winners” who block subsequent reforms that would 
threaten their ability to continue to reap the benefits of 
their ill-gotten gains (Alexeev 1999; Black, Kraakman, 
and Tarasova 2000). The most notorious “early win- 
ners” are the ROCs because of the way in which these 
well-positioned private actors acquired ownership of 
the energy sector—first, through a pure “asset grab” 
and, then, through a suspicious “loans for shares” deal 
(see Johnson 1997 and McFaul 1995 for details). By 
agreeing to the new Russian tax code, however, the 
ROCs do not “take all” but, rather, incur substantial 
economic losses. Despite reduced tax rates, the new 
tax code actually increases their tax burden because 
it eliminates the tax benefits of transfer pricing and 
capital expenditure deductions.” Even when oil prices 
began to recover, moreover, the ROCs chose to con- 
tinue negotiations with the Russian government and to 
bring their profits onshore rather then reverting back 
to tax evasion. 

More specifically, according to this perspective 
Russia’s inability to develop a viable tax regime thus 
far has also been attributed directly to this elite “re- 
distribution of power resources” following the Soviet 
collapse (Easter 2002). The new Russian tax code, how- 
ever, demonstrates not only that the ability of eco- 
nomic elites to derail the economic reform process is 
more limited than these pessimistic accounts suggest 
but also that, under certain conditions, these “insiders” 
or “early winners” can in fact serve as the engine of 
further reform. 

The second is the view of the Russian state as fully 
“captured” by key economic interests. The tax code, 
then, is merely the result of “state capture”—that is, 
the ROCs have used their excessive political influence 
to impose lower tax rates for their own benefit. Yet 
this is an unsatisfactory portrayal of the emergence 
of the new tax code for several reasons. First, if the 
ROCs were capable of simply dictating their preferred 
policies to government officials, it is not clear why they 
would prefer to establish a formal tax regime rather 


19 Authors’ personal communication with Dvorkovich, op. cat. 
2 Authors’ personal communication with Yermakoy, op cit.; see also 
Mazalov 2001, 2. 
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than continuing to benefit from the status quo in which 
they relied on informal networks and influence both to 
negotiate favorable tax rates and to evade unfavorable 
taxes. It also does not explain why the ROCs endorsed 
a tax code that inflicted short- and long-term costs on 
them by forcing them to fully disclose their profits, 
adopt accounting procedures, and accept an overall in- 
crease in their tax burden. Second, the literature on 
“state capture” predicts that those who benefit from 
a “capture economy” will reject institutions that dis- 
perse rents (Hellman, Jones, and Kaufman 2000) or 
improve the protection of property rights (Sonin 1999). 
Yet by protecting tax payers rights vis-a-vis state agen- 
cies, lowering rates for all taxpayers, and endeavoring 
to increase compliance across sectors and broaden the 
tax base, the new Russian tax code does both. It thus 
provides tax benefits that are inclusive in nature and is 
already having a positive impact on the Russian econ- 
omy as a whole. 

Beyond the literature on institutional emergence and 
the politics of economic reform, the long held con- 
ventional wisdom on resource-rich states also predicts 
that Russia will fail to develop the necessary institu- 
tions to promote economic growth, particularly a stable 
and comprehensive tax regime. In the vast literature 
on the “resource curse,” a weak (or nonexistent) tax 
regime is viewed as perhaps the most prevalent nega- 
tive outcome of resource wealth due to state leaders’ 
myopic thinking and heavy reliance on external (ie., 
rather than internal) sources of revenue (see, e.g., Karl 
1997, Mitra 1994, and Shafer 1994). The emergence of 
a viable tax code in energy-rich Russia, therefore, is a 
striking anomaly. 

Finally, Russia’s ability to achieve mutual gains from 
trade in the form of a new tax code is also puzzling 
because the causal mechanisms that are commonly in- 
voked in the literature to explain cooperation—such as 
a precommitment strategy?! or social norms (e.g., focal 
points)**—-were clearly absent in this case. 

Neither the Russian government nor the ROCs were 
unilaterally willing to make an extreme strategic move 
to signal their commitment to cooperation. Rather, 
each party continued to pursue independently ratio- 
nal strategies. For example, even when oil prices were 
declining in early 1998, rather than easing the tax bur- 
den on the ROCs, the government sought to squeeze 
the ROCs for additional taxes to cover the gaps in the 
government budget (RPI, June/July 1998). Likewise, 
the ROCs’ strategy to exaggerate their losses to obtain 
tax breaks only added to the level of mutual distrust 
between the ROCs and the Russian government (RPI, 
April 1998).7 

Moreover, the persistence of the behavioral norms 
and informal networks that guided elite behavior in the 


21 A precommitment strategy occurs when one player unilaterally 
makes and extreme strategic move to convince another player that 
he or she 1s willing to abandon his or her dominant strategy and 
change his or her behavior. On precommutment strategies, see, € g, 
Maoz and Felsenthal 1987. 

2 On focal points, see Schelling 1960. 

Z Authors’ personal communications with Dvorkovich, op. at., and 
Konovalov, op at. 
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Soviet period actually contributed to their incentive 
not to cooperate rather than serving as a focal point 
to overcome the collective action problem they faced. 
As noted earlier, the ROCs benefited from arbitrarily 
high tax rates under Yeltsin because they had privi- 
leged access to both formal and informal policy-making 
channels. Throughout the 1990s, they exerted political 
influence through two main forms of lobbying. The first 
and most common form was to influence deputies in the 
Duma to oppose or support proposed government leg- 
islation. The ROCs achieved this by either simply brib- 
ing deputies or supporting their own candidates (often 
former employees) for election to the single-mandate 
seats. The second form of lobbying took place through 
direct, personalized contact with members of the exec- 
utive branch—most importantly, the Ministry of Fuel 
and Energy.” Many oil companies used their close re- 
lations with key government ministries to block what 
they considered to be unfavorable legislation, includ- 
ing previous versions of the tax code.” Regardless of 
the method, however, the oil and gas lobby was highly 
effective and thus widely considered to be “one of the 
strongest and most effective lobbies” in Russia.” For 
the latter part of the 1990s, this lobby convinced a suffi- 
cient number of deputies and government officials 
to block tax reform and even to reverse unfavorable 
changes made by executive decree. In March 1997, for 
example, the ROCs persuaded both the Duma and the 
government to reverse a R15,000 increase in the oil 
excise tax (from R70,000 to R55,000 per ton of crude) 
that the Ministry of Finance and the State Tax Service 
pushed through several months before (RPI, March 
1997). 


EXPLAINING COOPERATION: EXOGENOUS 
SHOCKS AND INSTITUTIONAL CHANGE 


The emergence of a tax regime in Russia that is both 
collectively optimal and effective not only calls into 
question the dominance of the coercion approach to in- 
stitutions and economic reform in comparative politics 
but also presents us with an opportunity to refine the 
contractarian approach. The fact that Russia’s new tax 
code was the product of a strategic exchange between 
two sets of equally powerful actors—the Russian gov- 
ernment and the ROCs—aimed at mutual benefit sug- 
gests that the contractarian approach is more widely 
applicable beyond the United States and Western 
Europe than comparativists working on develop- 
ing countries may have thought. At the same time, 


4 Authors’ personal communication with Bons Makarenko, Deputy 
Director General, Center for Political Technologies, September 19, 
2001, Oleg Vyugin, Chief Economist, Troika Dialogue, September 
17, 2001; and representatives of ROCs, op. cit. 

> Authors’ personal communications with Vladimir Konovalov, op. 
at 

26 Authors’ personal communication with Mark Umov, Center for 
Political Technologies, September 19, 2001 

27 Authors’ personal communication with Vadim Eskin, CERA, 
September 12, 2001 This sentiment was shared by other experts 
whom the authors interviewed, including Makarenko, op cit., and 
Urnov, op cit. 
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however, the contractarian approach does not provide 
a complete explanation for why these two equally pow- 
erful actors were able to successfully negotiate a viable 
tax regime after almost a decade of failed attempts. As 
discussed in the preceding section, the mechanisms that 
are most often cited to explain cooperation are clearly 
absent in this case. We offer an alternative mechanism. 

In short, we argue that the shock of the August 
financial crisis provided the impetus for institutional 
creation and economic policy change in the form of a 
new tax code because it generated widely shared per- 
ceptions among both the Russian government and the 
ROCs concerning the benefits of cooperation and the 
need for formal guarantees. These shared perceptions, 
however, were contingent upon both sets of actors feel- 
ingequally vulnerable to the effects of the crisis and rec- 
ognizing that they depended on one another to recover 
from the crisis. Nor were shared perceptions alone suffi- 
cient to bring about institutional or economic policy 
change. Rather, the new tax code required that both 
sets of actors possessed common knowledge,” which 
was achieved through a series of incremental strate- 
gic moves. Thus, we also build on the literature that 
invokes exogenous shocks to explain change in either 
longstanding institutions or deleterious economic poli- 
ces (see, e.g, Drazen and Grilli 1993, Haggard and 
Kaufman 1997, and Krasner 1984) by clearly specifying 
the conditions under which and the microcausal mech- 
anisms whereby shocks induce institutional change and 
economic reform. 


Mutual Vulnerability and Interdependence 


Although its root causes were much deeper, the 
Russian government’s decision to devalue the ruble 
and place a moratorium on external debt payments trig- 
gered the August 1998 financial crisis that sent shock- 
waves throughout the Russian economy: Real GDP 
plummeted, inflation and unemployment soared, and 
commercial banks went bankrupt (see, e.g., OECD 
2000, 33-45). This crisis, of course, was also precipitated 
by the recession that spread across Asia in 1997 (or the 
“Asian flu”) and the steep decline in world oil prices 
in 1998. The immediate effects on Russia’s economy 
were devastating, yet the political effects (and the po- 
tential they offer for long-term economic growth) are 
even more striking. This exogenous shock enabled the 
Russian government and the ROCs to escape the vi- 
cious cycle of exorbitant tax rates and widespread eva- 
sion that characterized Russia’s tax regime in the 1990s. 
In short, both sets of actors emerged from the August 
1998 financial crisis with the realization that the costs 
of their past failure to cooperate were too high. 

Two conditions facilitated this change in perceptions: 
the mutual vulnerability and interdependence of the 





7% Common knowledge refers to shared information among players 
concerning the parameters and payoffs of the game In most game 
theoretic accounts, it eather is assumed to exist or can be achieved 
without direct communication. See Werlang 1989 


Russian government and the ROCs.” Mutual vulner- 
ability provides both actors with an incentive to come 
to the bargaining table because their own survival is 
threatened. While interdependence often gives rise to 
collective action problems, it also serves as the basis 
for cooperation when it creates a situation in which 
each side needs the other in order to recover from an 
economic crisis. Thus, our argument contrasts with the 
widespread depiction of institutional creation as bar- 
gains struck between kings and capitalists (or rulers 
and key economic actors) found in the contractarian 
literature in which it is sufficient for one side to feel 
threatened by the potential actions of the other (see, 
e.g, Levi 1988, North and Weingast 1989, and Tilly 
1990). It also diverges from the view that a crisis can 
induce economic reform by weakening the bargaining 
power of one set of actors—usually organized interests, 
such as labor, or the losers of reform—and strength- 
ening another’s—usually the state or incumbents (see, 
e.g., Remmer 1998) or when the state is sufficiently 
strong vis-à-vis societal actors to utilize the crisis as 
a window of opportunity to push through unpopular 
reforms (see, e.g., Krueger 1993). Likewise, it differs 
from the literature that argues economic reform can 
only come about after bargaining power shifts in a “war 
of attrition” such that one of the vulnerable socioeco- 
nomic groups concedes and bears most of the distribu- 
tional costs of the reform process (see, e.g., Alesina and 
Drazen 1991). Rather, both mutual vulnerability and 
interdependence are contingent upon the existence of 
socioeconomic actors who are powerful enough to chal- 
lenge the state. Although the existence of such actors 
is taken for granted in standard contractarian models, 
this is more often the exception rather than the rule in 
developing countries where the norm since the 1950s 
has been state-led development (see, e.g., Chaudhry 
1993). In Russia, this is the case only because the oil 
sector was privatized to domestic actors in the mid- 
1990s (for details, see Jones Luong and Weinthal 2001). 

The August 1998 financial crisis, which resulted in 
enormous losses in profits and tax revenue to the ROCs 
and Russian government, respectively, revealed the ex- 
tent to which these equally powerful actors were both 
vulnerable to global markets and, thus, the costliness 
of their previous failure to cooperate. The crisis was 
“sobering” for the ROCs because it “forced [them] to 
acknowledge the cumulative negative effects of their 
previous behavior.”*! Following the crash in August 





23 While Weyland (1998, 2002) also suggests that an economic shock 
can lead to a change ın perception, our argument differs in two ways 
First, we do not find that socioeconomic actors merely support the 
government’s reform efforts to recuperate their losses and avert fur- 
ther ones but, rather, that both sides have an incentive to bargain over 
the creation of economic reform institutions. Second, we take the 
argument that shocks cause a change in perception one step further 
by elucidating how this change in perception translates mto a change 
in behavior among equally powerful actors, resulting in these new 
economic policies. 

3 In fact, this may help explain why the contractarian approach 
has seemed less applicable outside the United States and Western 
Europe. 

31 Authors’ personal communication with ROC representative, 
June-July 2002 
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1998, many ROCs (e.g., YUKOS, Sibneft, and TNK) 
faced bankruptcy and lacked a cash flow to service 
their short-term secured bank debt (Tanguy 2002). 
Because many of them acquired substantial foreign 
debt, the Russian government’s decision to devalue the 
ruble made it even more expensive for the ROCs to re- 
pay these loans (RPI, September 1998, 7). Over the first 
nine months of 1999, for example, the indebtedness of 
5 major ROCs went up 54% (Kommersant, November 
23, 1999). As a result, they were unable to pay salaries 
and some of them (e.g., Sibneft) were forced either to 
shut down their operations for several months or to 
radically downsize their operations and decrease ex- 
penditures (RPI, October 1998, 17). Moreover, due to 
their reduced cash flow, combined with the lack of prior 
strategic planning and domestic investment to boost 
productive capacity, they could not immediately reap 
the benefits of the ruble’s devaluation by increasing 
production.* 

Similarly, the 1998 financial crisis exposed the gov- 
ernment’s susceptibility to fluctuations in the global 
marketplace because of its dependence on the oil sec- 
tor for budgetary revenue. Almost immediately, federal 
government revenues and expenditures plunged in re- 
sponse to the 1998 crisis. According to the IMF (2000, 
60, 63), cash revenues dropped in the third quarter of 
1998 to just 7% of the GDP, from 10.5% of the GDP 
in the first half of 1998, and cash spending fell in the 
third quarter to 11% of the GDP, from 16% of the 
GDP in the first half of 1998. As a result, according to 
Sergei Shatalov, First Deputy Minister of Finance, who 
had been a strong (but lonely) advocate of tax reform 
since the mid-1990s, “The 1998 financial crisis created 
the collective realization that we needed to change the 
[current] system of securing revenue.” 

The August 1998 financial crisis also made it 
painstakingly clear that they were dependent on one 
another not only for their economic recovery but also 
to insulate themselves from the effects of future crises, 
which required stable rules. For the ROCs, at a mini- 
mum “recovery” required investing in modernization, 
for which they also needed both greater predictability 
and more secure property rights. Prior to the 1998 crisis, 
the ROCs’ property rights were threatened by frequent 
tax inspections that resulted in profit losses through 
indiscriminate taxes and fees.» The 1998 crisis, further- 
more, exacerbated the fear among the ROCs that the 
government might renationalize their assets because 
many of them accumulated huge tax arrears and faced 
bankruptcy.’ 


X Authors’ personal communications with ROC representatives, op. 
at 
B Authors’ personal communications with ROC representatives, op 
cit. 


% Authors’ personal communications with ROC representatives, op 
cit. 

35 Anthors’ personal communication, July 2002. 

% Authors’ personal communication with Tom Adshead, Troika Di- 
alog, July 2002 

37 Authors’ personal communications with Stephen O'Sullivan, 
United Financial Group, July 2002, and with representatives of ROCs, 
op. cit. 
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At a maximum, the ROCs’ “recovery” required at- 
tracting Western partners in order to improve their 
assets at home and expand their operations abroad, 
which in turn required increasing transparency (e.g., 
disclosing information about profits and providing ac- 
curate information about profitability to shareholders) 
and establishing corporate governance. Specifically, 
the ROCs recognized that strategic partnerships with 
Western companies would give them access to foreign 
capital and technology, thereby enabling them to en- 
gage in the long-term strategic planning and domestic 
investment that were clearly so vital after the August 
1998 crisis.® At both ends of the spectrum, then, the 
ROCs could not recover without obtaining formal guar- 
antees from the government that it would not arbitrar- 
ily expropriate their assets or the proceeds from these 
assets through indiscriminate taxation.” Tax legisla- 
tion, moreover, would both enhance predictability and 
require increased transparency, thus providing them 
with the stability to invest in their assets at home and 
the credibility to attract Western partners to expand 
their operations abroad. 

“Recovery” for the Russian government required, 
at a minimum, budgetary stability. In order to regain 
their budgetary losses in the short term and to stabilize 
the flow of tax revenue from the oil sector, the Russian 
government needed to give the ROCs a greater incen- 
tive to pay their taxes voluntarily. As aforementioned, 
throughout the 1990s, the Russian government relied 
heavily on taxing the oil sector to fill its coffers. In 
addition to exorbitant corporate income tax rates, oil 
producers were subject to a fixed excise tax rate and 
stiff export tariffs. Yet, by August 1998, it became clear 
that this strategy was leading to the ROCs’ insolvency 
and creating an unreliable revenue stream. The fixed 
excise tax rate, for example, meant that oil producers 
paid approximately 55 rubles per ton (or about $9.00) 
regardless of the price of oil. Thus, when oil prices plum- 
meted in 1998, “many of the companies [were driven] to 
the verge of bankruptcy” (Samoylenko 1998, 3) At the 
same time, cumbersome bankruptcy procedures ham- 
pered the Russian government’s ability to seize private 
assets to induce tax compliance.” 

The government’s “recovery” at a maximum re- 
quired expanding the tax base and increasing compli- 
ance overall so as to reduce the budget’s dependence 
on the oil sector for revenue. Structural reform aimed 
at promoting real growth across sectors thus became 
a top priority for the government, and establishing 
a stable tax regime was a central component of this 


38 The desire for strategic partnerships does not mean that the ROCs 
sought foreign direct investment or that they were willing to share eq- 
uity in the Russian energy sector with Western companies but, rather, 
they were willing to make Western companies minonty shareholders 
in order to gain access to foreign markets, 
P Authors’ personal communications with representatives of ROCs, 
op- cit, and domestic and forezgn oil and gas analysts, June-July 2002. 
Russia has introduced several versions of insolvency laws since 
independence (i.e., 1992 insolvency law and the 1998 insolvency law), 
which has resulted in many ambiguities, making implementation an 
extremely difficult process (For details, sse OECD 1995). In 2002 
Putin vetoed the latest version (Pravda, August 7, 2002). 
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strategy (see, e.g., Vasiliev, February 2000).“ In par- 
ticular, Prime Minister Yevgeni Primakov sought to 
reduce the overall tax burden (e.g., lower the profit tax 
and VAT) and to introduce a package of basic laws in 
the Duma concerning fiscal reform (see, eg., RFE/RL, 
October 14, 1998, and Vasiliev 2000). His efforts were 
supported by the Ministry of Energy and Fuel in late 
1998 and later the Ministry of Economic Development 
and Trade in 2000, both of which viewed lowering the 
tax rate and tying export tariffs directly to the price 
of oil as promoting long-term recovery and growth in 
the oil sector while also serving to stabilize the govern- 
ment’s revenue stream, broaden the tax base, and im- 
prove the rate of compliance (RPI, September 1998).@ 


Incremental Strategic Moves, Common 
Knowledge, and Institutional Change 


Although the shock of the August financial crisis gen- 
erated widely shared perceptions among both the 
Russian government and the ROCs concerning the 
benefits of cooperation and the need for formal guar- 
antees, this was not sufficient to produce institutional 
change. To achieve a formal agreement, both sets of 
actors needed to demonstrate convincingly to one an- 
other that this mutual realization had actually occurred 
and would be sustained—that is, they needed to have 
common knowledge that all the relevant actors now 
preferred cooperation. In contrast to the standard ap- 
proach in international relations in which institutional 
bargains are struck after one side makes a single uni- 
lateral or preemptive strategic move (see, e.g., Jervis 
1978 and Maoz and Felsenthal 1987), we argue that this 
was achieved through a series of incremental strategic 
moves—both private and public—whereby each side 
assured the other that they were committed to tax re- 
form. 

In sum, the ROCs’ strategic moves included bringing 
their business activities back onshore through increas- 
ing domestic investment, adopting new international 
accounting standards, and embracing corporate gov- 
ernmance, while the government’s included pledging 
to honor private ownership of the oil sector and con- 
tinuing to work with the Duma to pass an acceptable 
tax code as a law rather than to change the tax system 
by decree. These assurances were effective not only 
because they were reciprocated but also because they 
were consistent—they continued even as the price of 
oil rebounded in late 1999” and the economy began 
to recover, resulting in an increase in both the govern- 
ment’s tax revenues (see, e.g., OECD 2000, 61) and the 
ROCs’ profits (see, e.g., RFE/RL, September 10, 1999). 

Shortly after the August 1998 crisis, the ROCs initi- 
ated an informal dialogue with the Russian government 

/ 





41 Authors’ personal communications with Dvorkovich, op. cit., and 
with Shatalov, op. at. 

© Authors’ personal communication with Dvorkovich, op. cit. 

4 In January 1998, Russan oil sold for $15.79 on the open market. By 
January of the following year, ıt dropped to $10 09, then rebounded 
to $23.36 by January 2000. See httpv/Awww.e1a.doe.gov/emew/cabs/ 
russia.html. 


to convince it to “soften” the tax system—that is, to 
become “less aggressive vis-a-vis the oil companies”— 
in order to aid their recovery by making investment in 
modernization possible.“ Specifically, the ROCs lob- 
bied for several features that are embodied in the new 
tax code—including the flat rate for corporate profits 
tax,“ the cap on contributions to the social fund, and 
export duties based on profits rather than production— 
because these changes were conducive to their new- 
found desire to invest more capital in domestic pro- 
duction and expansion.” In addition, the new tax code 
includes their demands for increased exemptions for 
investment in infrastructure, new technology, and new 
fields exploration, and compensates companies drilling 
in older fields. The ROCs also pressured the govern- 
ment to introduce a flat rate for the mineral tax, which 
would be tied directly to the world price of oil so that , 
when the price is low, the companies will be assured 
of a guaranteed amount of cash available for investing 
and long-term planning.” 

Several other more demonstrative strategic moves 
followed that served to reassure the Russian govern- 
ment that the ROCs were committed to reform. First, 
in June 1999 they showed good faith by agreeing to 
pay all their taxes in cash by December 1999 despite 
having recently downsized their operations (Moscow 
Times, June 23, 1999). This was a sharp departure from 
their behavior prior to the August 1998 crisis, when the 
ROCs would often run up huge tax arrears on their 
exports and then settle their tax liabilities in kind (e.g., 
barter deals), which made it extremely difficult for the 
Russian government to estimate how much revenue 
the oil sector would actually contribute to the bud- 
get on an annual basis (Aitken 2001). Second, at the 
end of 1999, the ROCs took an even bolder step by 
beginning to increase their level of domestic invest- 
ment and increasing their transparency. To ensure their 
immediate recovery, the ROCs began to invest heav- 
ily in increasing production either through fixing old 
wells (e.g., Yukos’s strategy) or through drilling new 
wells (e.g., Sibneft’s strategy). This trend continued un- 
abated, such that the following year (2000) the ROCs’ 
total investment jumped by 95% (see Figure 1) and 
shareholder confidence was actively being restored. 
In March 2001 at the Second Annual Russian Energy 
Summit, Lukoil’s chief executive Vagit Alekperov em- 
phasized the need for oil companies to enhance the 
transparency of their operations and adhere to other 
standards of corporate management and declared his 





M Anthors’ personal communication with ROC representatrve, July 
2002, and with Konstantin Reznikov, Alfa Bank, Senior Oil and 
Gas Analyst, September 2001. See also Fimansovye Izvestia 1998 and 
Rosstyskaya Gazeta 1998. 

45 The new corporate profits tax specifically targets the ROCs, be- 
cause the foreign oil companies still pay taxes abroad, usually at a 
higher rate (authors’ personal communication with Richard Lewis, 
Ernst and Young, June 2002). . 
4 Authors’ personal communications with representatives of ROCs, 


: oP cit., and with domestic and foreign financial analysts, op. cit. 


Authors’ personal communication with Steven Dashevaky, CFA, 
Aton Group, September 19, 2001. 
48 Anthors’ personal communication with O’Sullivan, op cit. 


147 


Contra Coercion 


February 2004 








FIGURE 1. Increase In Domestic 
Investment/Capltal Expenditures, 1998-2001 
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own company’s intention to continue to reinvest in 
its domestic projects. Finally, the ROCs agreed to re- 
veal their profits through their implementation of inter- 
national accounting methods (i.e., GAAP standards) 
and corporate governance codes (RPI, October 2001). 
Overall, these incremental strategic moves assured the 
Russian government that the ROCs were serious about 
bringing their profits onshore and no longer hiding 
them by investing offshore. 

The ROCs’ overtures did not go unrequited. Several 
key actors within the Russian government simultane- 
ously made incremental strategic moves that were in- 
tended to convey its own commitment to reform. First, 
the Ministry of Energy and Fuel introduced several tax 
proposals in late 1998 to eliminate the oil excise tax 
and to introduce instead a variable tax on incremen- 
tal revenue in order to reduce the tax burden on the 
ROCs and provide incentives for the ROCs to adopt 
Western technologies and to establish alliances with 
foreign partners (RPI, September 1998, 27). Although 
there were a few officials in the Ministry of Finance 
receptive to tax reform in the early to mid-1990s, such 
as Shatalov, this ministry was generally opposed to such 
cuts after the crisis for fear that it would lead to further 
losses to the federal budget (see, e.g., RPI, March 1998, 
39). Second, Yeltsin’s dismissal of Prime Minister 
Kiriyenko, who had initiated accelerated bankruptcy 
proceedings against the ROCs in mid-1998, and his 


® According to Shatalov, he sought to push through tax reform in the 
early 1990s, but he resigned in frustration because, prior to the 1998 
financial crisis, “the Yeltsin administration’s was not committed to 
real tax reform.” Only after the Putun administration demonstrated 
its commitment to tax reform did Shatalov agree to return to the 
government, at which time the Ministry of Finance also became 
more receptive to tax reform. Authors’ personal communication with 
Shatalov, op. cit. 
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replacement by Yevgeni Primakov in September con- 
veyed to the ROCs that the Russian government had 
abandoned its attempts at coercion and was willing to 
negotiate. Although initially he advocated a reversal of 
Russia’s neoliberal reforms, shortly after his appoint- 
ment Primakov met with representatives of the largest 
ROCs to begin seeking a compromise (Kommersant, 
September 30, 1998, 4). Primakov’s appointment also 
conveyed to the ROCs that the Yeltsin administration 
was committed to pursuing the formalization of the tax 
code by working with the Duma on tax reform rather 
than reverting to setting arbitrary tax rates by executive 
decree. While Kiriyenko and “the reformers” lacked a 
majority coalition in the Duma, Primakov was consid- 
ered to be acceptable to the large faction of Commu- 
nists in the Duma (Shleifer and Treisman 2000, 177). 
This was reassuring to the ROCs, both because it sig- 
naled the government’s willingness to compromise and 
because they universally considered a formal tax code, 
which required the approval of the Duma, to offer more 
stability than tax reform by decree.™ This commitment 
was reinforced in December 1998, when the Russian 
government proposed a package of eight laws to the 
Duma, which linked tax rates to the profitability of oil 
wells, and then again in January 1999, when Part I of the 
Russian Tax Code officially went into effect (Vasiliev 
2000). To further demonstrate its commitment to fis- 
cal reform and its willingness to compromise with both 
the ROCs and the Duma, the Primakov government 
introduced a reduction in the profits tax from 35% to 
30% (RFE/RL, November 17, 1998; Vasiliev 2000). Al- 
though the Duma had approved a similar reduction in 
July 1998 (RFE/RL, July 16, 1998), President Yeltsin 
vetoed the law (RFE/RL, July 20, 1998). 

The government’s next set of incremental strategic 
moves took place under Putin’s leadership. First, at 
the beginning of 2000 he established the Ministry of 
Economic Development and Trade to deal precisely 
with the task of promoting long-term economic growth. 
This new ministry immediately began to dominate the 
debates over tax reform inside the government and to 
represent the government’s position in private negoti- 
ations with the ROCs and public debates in the Duma. 
It thus replaced the Ministry of Finance, which had 
spearheaded the government’s previous approach to 
tax reform (see, e.g., Samoylenko 1998, 2) and which 
financial analysts and ROCs alike agree was fixated on 
fulfilling short-term budgetary requirements.*! 

Then, at the end of 2000, Putin supported the forma- 
tion of a special “working group” composed of govern- 
ment officials, Duma deputies, and the ROCs to discuss 
the remaining aspects of the tax code, especially those 
that were of special concern for the ROCs, such as the 
corporate profits tax and the mineral tax.*? The Min- 
istry of Economic Development and Trade in these dis- 
cussions advocated dramatic cuts in tax rates as a way to 
simultaneously stimulate investment in long-term and 


D Authors’ personal communication with Ustinov, op cit. 

51 Authors’ personal communications with representatives of ROCs, 

op. cit, and with domeste and foreign financial analysts, op cit. 
Authors’ personal communication with Shatalov, op. cit. 
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new oil projects” and increase tax compliance across 
sectors. By advocating a flat tax on corporate profits 
of 24%, for example, the government was committing 
to a much lower share of the oil companies’ profits in 
exchange for the full disclosure of their actual profits. 
In fact, the Russian Tax Ministry actually anticipated a 
decrease in tax collection in 2002 because of the new 
profits tax (Pravda, June 12, 2001). Morevoer, the Min- 
istry of Economic Development and Trade, unlike the 
Ministry of Finance, supported the flat tax on personal 
income and corporate profits tax because they believed 
that it would eventually expand the government’s tax 
base by encouraging investment in other economic sec- 
tors (i.e., beyond the oil sector) and reducing the size of 
the illegal economy.» They were also behind the more 
recent proposals to compensate companies “for losses 
resulting from adjustment of transfer prices for tax 
purposes”™ and to establish a flat mineral tax pegged to 
the world market price of oil. The latter, in particular, 
was deliberately designed to guarantee a more stable 
revenue flow to both the ROCs and the government 
over the long term.” The formation of this group thus 
conveyed to the ROCs not only that the government 
remained committed to tax reform, but also that it re- 
mained committed to working with the Duma to enact 
tax legislation. At this time, the Russian government 
also provided additional assurances to the ROCs—in 
both public and private forums—that it would not make 
significant changes to the tax code for at least three 
years after it was adopted in 2001, which is considered 
to be a fairly long period of time in Russia.” 

Finally, as each side became more secure with 
their reciprocal reassurances, their respective strate- 
gic moves became increasingly bold. Thus, in order 
to reassure the ROCs that the Russian government 
would cease to arbitrarily expropriate their profits and 
assets, Putin met with the Russian oligarchs in mid- 
2000 in Moscow, where he informed them privately 
and then announced publicly that “[he] would stay out 
of business, if [they] stayed out of politics” (see, e.g., 
Feifer 2000). The reaction of the ROCs was universal 
relief because they viewed this public announcement 
as a credible commitment to respecting their property 
rights because Putin had effectively tied his hands by 
raising the costs of renationalizing their assets. 





3 The previous tax system discouraged o1l companies from investing 
in long-term and large new projects (authors’ personal communi- 
cation with Yermakoy, op. cit, and with representatives of ROCs, 
June-July 2002). 

54 Authors’ personal communication with Dvorkovich, op. cit. 

35 Authors’ personal communication with Dvorkovich, op. cit., 
Reznikov, op. at, and Vyugin, op. at. 

56 Authors’ personal communication with Yermakov, op cit. 

57 Authors’ personal communication with Dashevaky, op cit. 

3% Authors’ personal communication with Ustinov, op. cit. Instead, 
the government pledged to introduce only minor amendments to 
“work out the bugs out of the code” (authors’ personal communica- 
tion with Ustinov, op cit.). 

9 Authors’ personal communication with O'Sullivan, op. cit, and 
with Adshead, op cit. 

6 Authors’ personal communications with representatives of ROCs, 
September 2001 and June 2002. Since this tıme, Putin has publicly 


CONTRA COERCION: ELITE BARGAINS 
AND INSTITUTIONAL CHANGE 


As the new institutionalism moved from American to 
comparative politics, the scholarly conception of in- 
stitutions as contractual agreements was transformed 
into a conception of institutions as coercive or im- 
posed outcomes. Russia’s ability to establish a viable 
tax code, however, demonstrates that this emphasis on 
coercion, particularly in discussions of economic re- 
form processes, is inappropriate. This new tax code was 
not imposed by a strong central leader, mandated by 
international institutions, or the result of state capture 
by powerful economic interest groups, as both the gen- 
eral literature predicts and specialists on Russia often 
presume. Rather, we argue that it is the product of a 
mutually beneficial exchange between the two most 
powerful actors in Russia—the Russian government 
and the Russian oil companies (ROCs)—following an 
acute economic crisis. 

The explanation we put forth provides an opportu- 
nity to refine both the contractarian approach and the 
literature on economic reform because it suggests both 
an alternative mechanism whereby two equally power- 
ful actors can achieve mutual gains from cooperation 
and a set of conditions under which and an alternative 
mechanism whereby exogenous shocks induce institu- 
tional and economic policy change. The August 1998 
financial crisis provided the impetus for change in the 
form of a new tax code because it generated widely 
shared perceptions among both the Russian govern- 
ment and the ROCs concerning the benefits of cooper- 
ation and the need for formal guarantees. These shared 
perceptions, however, were contingent upon both sets 
of actors feeling equally vulnerable to the effects of 
the crisis (i.e., mutual vulnerability) and recognizing 
that they depended on one another to recover from 
the crisis (i.e., interdependence). Shared perceptions 
alone were also insufficient to bring about institutional 
or economic policy change. Rather, the new tax code 
required that both sets of actors possessed common 
knowledge, which was achieved through a series of in- 
cremental strategic moves. 

Our findings also have several implications for the 
future study of institutional creation and economic pol- 
icy change. First, they suggest that institutional change 
is a gradual process with distributional consequences 
(gains as well as losses) for all the actors involved. In 
contrast with the contractarian approach to institutions 
found in both comparative politics and international 
relations, the recognition that there are mutual gains 
from cooperation does not immediately generate new 
institutions and economic policies. Furthermore, the in- 
stitutional change and economic policies that do come 
about require all parties to the bargain to make consid- 
erable trade-offs. While both sides benefited from reli- 
able revenue streams and stable rules of the game, they 
also had to incur heavy losses. The ROCs paid for these 





reaffirmed the government’s commitment not to renationalize assets 
that were privatized before 1998 on numerous occasions (see, ¢ 8, 
RFE/RL, 29 September 2003, RFE/RL, 22 September 2003) 
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benefits by agreeing to full disclosure and stricter gov- 
ernmental controls, which, despite the reduced profits 
tax, increased their overall tax burden. Similarly, by 
agreeing to the new tax code the Russian government 
forfeited its advantage vis-a-vis the ROCs—not only 
could it no longer raise tax rates arbitrarily, but it also 
risked a deficit in its budget revenue if the ROCs failed 
to comply with the new tax rates. 

Second, they demonstrate that neither a strong 

leader nor powerful economic interests can simply im- 
pose institutional creation or policy change. This chal- 
lenges not only the conventional wisdom concerning 
the politics of economic reform in developing countries 
but also the two dominant (and contradictory) portray- 
als of the Russian state as either fully “captured” by 
economic interests or steered from above by President 
Vladimir Putin since his election in March 2000. More- 
over, it suggests that the existence of two equally pow- 
erful actors enhances rather than hinders prospects for 
economic reform—a notion that has found empirical 
support beyond Russia (see, e.g., Arce 2003 and Kang 
2002). 
Finally, our findings belie the perception that Russia 
is “cursed” by its resource wealth (see, e.g., Rutland 
2001 and Starr 1998) and, instead, support the claim 
that private ownership of natural resources fosters the 
development of institutions conducive to long-term 
economic growth (Weinthal and Jones Luong 2002).%! 
If Russia’s oil sector had been state-owned in August 
1998, the financial crisis would not have had the same 
effect on actors’ perceptions and consequent behav- 
ior because these actors would not have felt the same 
degree of vulnerability and interdependence following 
the financial crisis. The Russian government, for ex- 
ample, would have had several other options, such as 
confiscating profits arbitrarily, increasing exports, and 
borrowing abroad against future revenue from their 
resource wealth.” The existence of private firms, how- 
ever, meant that seizing assets required complicated 
and costly legal procedures. Similarly, the ROCs could 
have turned to the government to subsidize their re- 
covery through foreign loans rather than seeking to 
finance it themselves. They would have neither been 
in a position to pressure the government to establish 
formal guarantees against arbitrary expropriation of 
their assets nor felt the need for such an agreement. 

Thus, Russia’s new tax code both demonstrates how 
transition economies can move beyond the supposed 
deadlock created by “early winners” and provides some 
confirming evidence for the hypothesis that privati- 
zation to domestic actors offers a potential way for 
resource-rich countries to escape the so-called “re- 
source curse” because it forces governments to negoti- 
ate with these actors for revenue. 





61 Goldman (1999) argues precisely the opposite— e. that privati- 
zation has hampered reform efforts. 

& These are the most common responses to “busts” among resource- 
rich countnes in which the resource in question 1s state-owned (see, 
e.g , Karl 1997 and Ross 2001) 
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for Agricultural Trade Liberalization 
CHRISTINA L. DAVIS Princeton University 


article explains how the institutional context of international negotiations influences their out- 

comes. I argue that issue linkage counteracts domestic obstacles to liberalization by broadening the 
negotiation stakes. Institutions bolster the credibility of the linkage to make it more effective. I test 

the argument in the agricultural sector, which has been among the most difficult sectors for governments to 
liberalize. Statistical analysis of U.S. negotiations with Japan and the EU from 1970 to 1999 indicates that an 
institutionalized linkage between agricultural and industrial issues encourages agricultural liberalization 
in both Japan and Europe. Through case studies of key negotiations, I first examine why countries choose 
to link issues, then show how the linkage changes interest group mobilization and shifts the policy process 


to promote liberalization. 


hy do some international economic negotia- 
We bring major policy changes while oth- 

ers end in deadlock? The difference between 
success and failure in these negotiations often amounts 
to billions of dollars and the seeds of economic disorder 
or cooperation. A successful negotiation can establish 
rules that open markets and promote coordination of 
policies. For example, the Bretton Woods conference of 
1944 established the framework for postwar economic 
cooperation that promoted greater interdependence. 
Fifty years later, the Uruguay Round Agreement re- 
duced agricultural and industrial trade barriers and ex- 
panded trade rules to regulate services and investment. 
On the other hand, failed negotiations often leave both 
sides worse off as relations between participants de- 
teriorate. One such setback was the World Economic 
Conference of 1933, which ended without agreement 
and was followed by retaliatory trade protectionism 
and competitive currency devaluations. Failures on a 
smaller scale can also have significant consequences. 
For example, inability to reach agreement on wheat 
support policies in the Tokyo Round led to a subsidy 
war between the United States and Europe during the 
1980s that drained their budgets and undercut the sales 
of developing country farmers. While the consequences 
of a negotiation may be far-reaching, the source of suc- 
cessful negotiation strategies lies in the details of the 
institutions that shape the negotiation process. 

To explain negotiation outcomes, one must look 
closer at how the agenda, rules, and procedures of a 
negotiation influence state choices. Power and interests 
alone fail to account for the variation across negotia- 
tions. Strong states sometimes are unable to persuade 
weaker states to open their markets, and influential 
lobby groups are not always able to prevent their gov- 
ernment from signing a liberalizing agreement. This 
article focuses on issue linkage, which is a common 
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negotiation strategy that involves combining multiple 
issues to change the balance of interests in favor of a 
negotiated agreement. Only when the institutional con- 
text supports a linkage strategy, however, will it appear 
credible. Once established, the institutionalized issue 
linkage applies greater pressure for liberalization than 
threats or domestic political and financial constraints. 
Moreover, issue linkage can bring liberalization even 
when it would be least expected in sensitive sectors. 
Using agricultural trade as a hard case that has been 
a frequent source of trade disputes, I present evidence 
that linking negotiations on agriculture and other sec- 
tors brings more agricultural liberalization than other 
strategies. Historically, agriculture stands out as a sector 
where countries stubbornly defend domestic programs. 
Farm lobbies represent the classic example of an influ- 
ential pressure group (Olson 1965). Indeed, nearly all 
industrialized countries raise the levels of protection 
on farming as the sector’s size in the economy shrinks 
(Hayami and Anderson 1986). Collective action incen- 
tives motivate farmers to organize, and both strong 
lobbies and electoral rules favoring rural districts guar- 
antee that farmers wield political strength beyond their 
numbers. As a result, while bound tariffs on industrial 
goods have fallen to an average rate of 5% for OECD 
countries, agricultural protection has remained high, 
with bound tariffs averaging 60% (OECD 2002a, 22). 
Nontariff barriers remained common in the agricultural 
sector long after they were eliminated for most indus- 
trial goods. Japan and Europe stand out among those 
giving the most protection to agriculture.’ 
Agricultural protection brings high costs in terms of 
financial expenditures, lost export opportunities, and 
increased trade friction. Agriculture exporters, which 
include the United States and the developing coun- 
tries, demand liberalization because protection closes 
off valuable markets. A study by the U.S. Department 
of Agriculture (2001) indicates that elimination of agri- 
cultural protection and support could increase global 


1 OECD producer subsidy estimates for 2001 show that 59% of the 
value of farm production resulted directly from government policies 
ın Japan, while the was 35% for the EU—both 
above the OECD average of 31% and the US. levels of 21% (OECD 
2002b, 160-61). 
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economic welfare by 56 billion dollars annually, which 
would be in addition to the direct budget savings. In 
Japan and Europe, on the other hand, where many 
producers are not competitive in world markets, liber- 
alization threatens the welfare of rural society. Politi- 
cization and high economic stakes make for an ex- 
plosive combination that threatens the stability of the 
trade system. Japan and Europe both have risked trade 
wars with the United States over food fights (Davis 
2003a). Agricultural issues have nearly blocked the 
conclusion of successive trade rounds and generated 
half of all GATT trade disputes over the period 1960- 
89 (Hudec 1993, 327). Agriculture continues to present 
a central challenge for the successful conclusion of the 
new WTO trade round launched in November 2001 
in Doha, Qatar. The large share of agriculture in the 
economies of developing countries makes further lib- 
eralization of agricultural trade essential if the Doha 
Round is to fulfill its development agenda. 

Although agriculture remains protected in compari- 
son to other sectors, liberalization has occurred. Over 
the past 30 years, even Japan and Europe have agreed 
to reduce many trade barriers, and the share of im- 
ports in total consumption has increased. According 
to the OECD measures of agricultural protection, the 
total support for agriculture as a share of GDP has 
declined from 2.4% for Japan and 2.6% for the EU 
in 1986 to 1.4% for both in 2001 (OECD 2002b, 187, 
196). In specific policy changes, market price support 
has been reduced, Japan dismantled its system of quota 
restrictions one by one, and the EU replaced its trade- 
distorting variable levy with a more transparent tariff 
system. Thus, negotiations on agricultural trade policy 
have included both dramatic negotiation failures and 
negotiations that brought substantial liberalization. 


FRAMEWORK FOR NEGOTIATION 
ANALYSIS 


In negotiations that bring liberalization, what has al- 
lowed governments to overcome domestic interests 
that will be harmed? The literature on trade politics 
highlights the importance of distributional stakes for 
interest groups (Grossman and Helpman 1995; Hiscox 
2002; Milner 1997). I emphasize how the institutional 
context of the international negotiation changes the 
aggregation of these interests. Negotiations that link 
issues across multiple sectors have a different impact on 
domestic politics than single sector negotiations, An in- 
stitutionalized linkage of negotiations on multiple sec- 
tors broadens interest group lobbying and bureaucratic 
jurisdiction to counter the domestic bias that favors 
protection. 

Putnam (1988) introduced the analogy of two-level 
games to characterize the observation that a leader ne- 
gotiates simultaneously over domestic goals and the 
international bargain. Since then, a growing literature 
has attempted to explain how interest groups, domes- 
tic political institutions, or the bargaining strategies of 
negotiators determine the range of possible negotia- 
tion agreements (Evans, Jacobson, and Putnam 1993; 
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Iida 1993; Odell 2000). While two-level game analy- 
sis has improved our understanding of how domestic 
politics affect outcomes, many studies treat the inter- 
national level as an undifferentiated bargaining arena. 
Milner (1997, 70), for example, writes, “The interna- 
tional game adopted does not have a well-defined in- 
stitutional structure; politics on that level are assumed 
to be anarchic, and international negotiations are gen- 
erally conducted without a constitutionally mandated 
sequence of moves.” This disregards the dense network 
of international institutions that shapes the conduct of 
any given negotiation. 

The institutions of the negotiation structure—the 
agenda, rules, and procedures that guide the interac- 
tion between states in a policy dispute—influence the 
negotiation outcome because they establish which al- 
ternatives are considered and how they are decided. 
As with other institutional constraints, such as interna- 
tional treaties or legislative committee rules, the insti- 
tutions of the negotiation structure favor certain actors 
and policy options (Keohane 1984; North 1990; Shepsle 
and Weingast 1995). In trade negotiations, institutions 
change both the mobilization by interest groups and 
the policy track for decision making. 

First, the negotiation agenda sets the negotiation 
stakes and policy scope. Publicly announcing the issues 
that are up for discussion informs domestic actors of 
the potential gains and losses. Interest groups are in- 
volved in the process of creating the agenda as they 
lobby for the inclusion or exclusion of their own issue. 
Few, however, lobby beyond their own issue. Govern- 
ments must aggregate diverse demands from domestic 
interests while also trying to accommodate other gov- 
ernments to produce a single agenda. The final agenda 
reveals the full array of issues that then become im- 
portant for all groups with a stake in any one issue. 
Likewise, the issues on the agenda determine the scope 
of bureaucratic and political committee jurisdictions 
that will address the negotiation. This matters given the 
importance of who initiates policy proposals and who 
makes the last decision (Garrett and Tsebelis 1996). 

Second, the negotiation procedures guide the se- 
quence of decisions, If there are multiple issues on the 
agenda, for example, the negotiation could culminate in 
a single decision on all the issues or separate decisions 
on each one. As shown by the literature on institutions 
in American politics, outcomes often differ according 
to which alternatives are presented to the legislature 
(Shepsle and Weingast 1995). 

Third, the nature of the rules determines the form 
of commitments that are reached in the negotiation. 
Specifically, the rule framework creates the expecta- 
tion for whether a negotiated agreement will represent 
a binding legal commitment with a monitoring mecha- 
nism. This raises the costs of later defection from agree- 
ments. Greater legalization adds the value of the rule 
system and future cooperation as new incentives. Taken 
altogether, the institutions of the negotiation structure 
have a direct impact on the distributional consequences 
of the negotiation. 

There are multiple venues for trade negotiations, 
and the institutional context influences the potential 
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for effective issue linkage. The General Agreement 
on Tariffs and Trade (GATT) and its successor, the 
World Trade Organization (WTO), form the core in- 
ternational institution for trade policy. Within the 
GATT/WTO framework, negotiations consist of com- 
prehensive trade rounds or legalistic dispute settlement 
procedures (DSP). The former bring together all mem- 
bers and are launched with an opening declaration that 
sets forth an agenda for discussion of liberalization 
across sectors. Rounds proceed as a mix of informal 
bargaining and consensus decisions that culminate in 
a multilateral agreement with binding commitments. 
Issue linkages are integral to producing agreement 
among the diverse economic interests of members. The 
Uruguay Round formalized more than any prior nego- 
tiation the explicit commitment to a package approach, 
which continues in the Doha Round. 

In contrast, linkages are more difficult to sustain in 
other institutional contexts. The DSP negotiations re- 
semble adjudication and begin with the filing of a legal 
complaint against a specific policy that leads to either 
plea bargaining or a negotiated settlement after a panel 
of judges provides a legal ruling. The narrow focus on 
the legal status of a trade barrier tends to exclude link- 
age among issues even while it raises normative pres- 
sure. Outside of the GATT/WTO framework, other 
types of trade negotiations include bilateral talks on 
either a single policy or a broad agenda of issues. In ad- 
dition, meetings of regional trade associations share the 
comprehensive character of trade rounds, but follow 
different procedures. Asia-Pacific Economic Coopera- 
tion (APEC), for example, emphasizes the voluntary 
nature of participation in nonbinding agreements. In 
bilateral and regional trade negotiations, linkages are 
possible, but not always present. 


Interests and the Cholce to Llink Issues 


I first consider the domestic and international politics 
that shape negotiation structures. Since states design 
institutions in order to address particular policy prob- 
lems, “institutions are both the objects of state choice 
and consequential” (Martin and Simmons 2001, 451). 
In addition to the anticipated functions performed by 
an institution, path dependency can make the initial 
choice of rules constrain policy choices even after it no 
longer serves those interests. For example, in an his- 
torical irony, the United States shaped GATT rules in 
1947 to create special exceptions to fit protection pro- 
grams for U.S. agriculture. Later, when U.S. agricultural 
interests had shifted to favor exports and other coun- 
tries had developed entrenched agricultural protection, 
the United States could not easily change these rules 
(Goldstein 1993).? 


2 The exception in Article X1:2(c) of GATT 1947 rules allowed agri- 
cultural quota restrictions when applied in conyunction with domes- 
tle supply constraints, and the exception in Article XVI:3 allowed 
the use of export subsidies for prumary goods. These exceptions did 
not exclude agriculture from GATT rules, and GATT panels have 
strictly interpreted their appHcability. Nevertheless, the regulation 
of agricultural trade has been problematic in part because of these 
exceptions See Hudec (1993, 326-36). 


The decision to establish a linkage in a negotia- 
tion agenda raises the possibility of a selection effect. 
Skeptics caution that international institutions cannot 
change state behavior on hard issues that raise distribu- 
tional concerns or strong domestic interest group oppo- 
sition (Downs and Rocke 1995; Mearsheimer 1994/95). 
From this perspective, states would only agree to link 
issues in a negotiation agenda when there is no strong 
opposition to any individual component of the agree- 
ment (Moravesik 1998). To address these concerns em- 
pirically, I investigate whether institutional linkages 
promote agreements even when they involve an issue 
where cooperation is unlikely on that issue alone— 
agricultural liberalization by Japan and Europe faces 
opposition by strong domestic lobby groups. In order 
to account for variation among the agricultural negotia- 
tions, I include measures for other characteristics, such 
as budget and economic conditions, that could make 
liberalization more or less likely for a given case. 

Several factors facilitate the acceptance of issue link- 
age even when there is strong opposition to agricultural 
liberalization. First, governments realize that a broad 
agenda encourages wider participation and greater po- 
tential gains from liberalization. Indeed, negotiations 
over the agendas that launched past trade rounds have 
consistently added more issues in order to gain the 
consent of all members (Steinberg 2002, 350-51). Sec- 
ond, protectionist interests face higher costs to mobilize 
early in the negotiation. The broad diplomatic coordi- 
nation required for the meetings that set the negotia- 
tion agenda privileges foreign policy elites and national 
leaders more than sectoral representatives. Although 
farm lobbies and agriculture ministries in Japan and 
Europe are likely to resist the inclusion of agriculture 
on the agenda, they will find it difficult to veto agenda 
items because the foreign policy decisions at this stage 
of the negotiation lie outside of their jurisdiction. More- 
over, the decisions that shape the structure of the nego- 
tiation occur amidst uncertainty about the timing and 
shape of the future agreement. This is particularly true 
for trade rounds, which bring together many countries 
and often last five to eight years. For politically sensi- 
tive cases, linkage in a multilateral setting will be more 
appealing than a bilateral negotiation or legal dispute. 
The longer time frame as well as the broader context 
avoids placing the spotlight on a single product, which 
makes it easier for protectionist interests and diplo- 
mats to reluctantly agree to talk about the issue (Davis 
2003b). Facing strong U.S. demands, the EU and Japan 
may prefer to negotiate their most vulnerable products 
in a linkage setting rather than in a bilateral or DSP 
negotiation.’ 





3 Some cases may simply never be raised in any kind of negotiation. 
For example, this could occur when there is strong resistance by the 
EU or Japan and the United States is willing to forgo 1ts demand for 
liberalization of the product The key point for this study, however, 18 
that when comparmg those products that are raised m negotiations, 
some of the toughest negotiation topics are likely to arise m negotia- 
tions that embrace an issue Imkage. Withm the sample of negotiated 
products, the selection bias, if anything, leads to underestimating the 
effect of linkage because harder cases may go to a negotiation forum 
with issue hnkage. 
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The Uruguay Round illustrates how these factors 
helped persuade governments to accept an agenda call- 
ing for liberalization across all sectors. When negotia- 
tors tried to formulate the agenda for a new trade round 
in 1986, developing countries such as Brazil and India 
were reluctant to discuss service sector liberalization 
and intellectual property rights as part of the Uruguay 
Round. Eventually, however, they agreed to participate 
because they expected to benefit from liberalization in 
other areas to be included on the agenda, namely, agri- 
cultural and textiles trade. For France and other Eu- 
ropean states, the incentives were the opposite; poten- 
tial gains from service sector liberalization persuaded 
governments to agree to a negotiation agenda includ- 
ing agriculture.‘ During EC decision making for the 
acceptance of the Uruguay Round agenda, the scope 
of jurisdiction favored foreign affairs officials over the 
representatives of specific sectoral interests. The Com- 
mission Directorate for External Relations produced 
initial proposals, and all of the important decisions 
were discussed in the trade committee and COREPER 
(the committee composed of heads of delegations) and 
then forwarded to the General Affairs Council for ap- 
proval (Vahl 1997, 80). Nevertheless, agriculture inter- 
ests were not shut out entirely. Their consent reflected 
that many in the Commission and national delegations 
believed that the final agreement would not require 
substantial changes of the Common Agricultural Policy 
(CAP) (Josling, Tangermann, and Warley 1996, 172). 
A Commission negotiator for the agriculture group 
negotiations of the Uruguay Round said, “We knew 
agriculture might be a problem, but this was not re- 
ally clear until the mid-term review in 1988. Nobody in 
Europe thought there could be a negotiation that left 
out agriculture, but it was hoped that there might not 
have to be major reforms—like in the Tokyo Round.”> 

Similar logic led Japanese government officials to 
plead with the U.S. government to address rice market 
access as an issue in the Uruguay Round rather than in 
a bilateral negotiation.® Yet at the same time, the Diet 
passed a unanimous resolution against any liberaliza- 
tion of the ban against rice imports. An official from an 
agricultural interest group explained in an interview 
that his organization did not oppose the government 
promise to discuss rice liberalization in the Uruguay 
Round because his group hoped to get support from 
France and other countries and thought the talks would 
go better than if Japan faced the United States alone 
in bilateral talks.” Yet in the end, the Uruguay Round 
did bring substantial reforms in the agricultural sector 
as well as in the industrial and service sectors, leading 
to both an overhaul of CAP and the partial opening 
of Japan’s rice market. In sum, economic interests, the 


4 For more on the mportance of linkage in getting the Uruguay 
Round started, see Croome 1995, 31, and Paemen and Bensch 1995, 
31-48. 

5 Official of the Commission, Directorate General for Agnculture. 
Interview by author. Brussels, 4 July 2000. 

6 Asahi Shimbun, 24 October 1986, 9 April 1987. 

7 Former official of Zenndrin Interview by author Tokyo, 26 
October 1998 
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costs of mobilization, and uncertainty about outcomes 
encourage acceptance of the issue linkage in the agenda 
despite resistance to liberalization of one component. 


Package Negotlations: Institutionallzed 
Cross-Sector LInkage 


Issue linkage has long served as a basic tool for politi- 
cal bargains and diplomatic deals. Sebenius (1983, 287) 
provides the definition that issues are linked “when 
they are simultaneously discussed for joint settlement.” 
This definition encompasses side-payments, log-rolling 
bargains, or a formal agenda on a broad range of issues. 
The focus of this article is on tactical linkages, which 
combine issues that do not substantively require joint 
settlement.® In such cases, multiple issues are included 
in the final settlement in an effort to create a balance 
where both sides gain enough to accept the costs. Trade 
liberalization, in particular, has relied upon negotiating 
across a range of products as countries exchange re- 
ciprocal concessions. Under what conditions will issue 
linkage promote agreement? 

One challenge for successful linkage is finding com- 
plementary issues. Sebenius (1983) points out that sim- 
ply adding issues does not necessarily promote agree- 
ment. Rather, adding a nonnegotiable issue to the 
agenda can cause the collapse of the entire negotia- 
tion. Much of the study of issue linkage focuses on 
combining issues so that all participants gain from the 
agreement (Mayer 1992; Tollison and Willett 1979). A 
second challenge is the difficulty of convincing all actors 
to believe that agreement on one issue is conditional 
on agreement on the other issue. Tactical linkages can 
be unstable when some participants resist the linkage. 
Several scholars voice skepticism about issue linkage 
because of this added credibility problem (Aggarwal 
1998; Eichengreen and Frieden 1993; Morrow 1992). 
Lohmann (1997) counters that it is possible for issue 
linkage to promote cooperation if actors care suffi- 
ciently about future interaction on one of the issue di- 
mensions for this “credibility surplus” to spill over and 
increase incentives for cooperation across issues, While 
issue linkage can promote cooperation in some cases, 
either the wrong combination of issues or an inability 
to credibly commit to the linkage may undermine the 
effectiveness of a linkage strategy. 

I examine how a particular kind of linkage, a pack- 
age negotiation structure, addresses these two problems. 
Package negotiations have a formal agenda that com- 
bines distinct issues for joint approval or rejection. This 
resembles the bargaining tactic Raiffa (1982) refers to 
as a “single negotiating text” that unifies discussion of 
different issues. I focus on the use of package nego- 
tiation structure to institutionalize cross-sector issue 
linkages. These negotiations address agriculture and 
industry sectoral issues along with other trade topics 
in a single negotiation. The cross-sector scope of the 
agenda combines issues to produce overall gains, while 


8 See Aggarwal 1998, 16-17, and Haas 1980 for discussion of different 
kinds of linkages 
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the institutional context promotes the credibility of the 
linkage. 


Cross-Sector Interests. Liberalization depends on 
overcoming the collective action problems and institu- 
tional biases at the domestic level that favor protection. 
For trade policies in general and agricultural issues in 
particular, those who demand protection have strong 
incentives and high levels of organization, while those 
who pay the costs are loosely organized taxpayers and 
consumers (Lindert 1991; Olson 1965). Protection poli- 
cies also persist because policy makers with a vested 
interest in the status quo retain control over decision 
making. The closed policy communities formed by the 
ties among farm groups, agriculture ministries, and po- 
litical committees in Japan, the EU, and France have 
been described as forming a corporatist relationship 
(Keeler 1996; Mulgan 2000; Sheingate 2001). Using is- 
sue linkage to mobilize industry groups and to broaden 
the policy jurisdiction helps to counter both problems. 

Cross-sector issue linkage offsets the influence of 
farmers by engaging interests important to other pow- 
erful lobby groups. Japan and the EU must offer conces- 
sions in agriculture, while both can gain much from lib- 
eralization in the industrial and service sectors. When 
there is a credible cross-sector linkage, industry lob- 
bies also advocate agricultural liberalization in order 
to achieve specific gains for industry from conclu- 
sion of an overall agreement. Finding domestic allies 
to support foreign demands has been a critical fac- 
tor in explaining variation in outcomes across differ- 
ent U.S.-Japan bilateral negotiations (Mulgan 1997; 
Schoppa 1997). The importance of the expansion of 
actors has also been widely commented on in stud- 
ies of European and American politics (Gilligan 1997; 
Lieber 1970). Schattschneider (1960, 40) argues that, 
by expanding the scope of conflict, losers try to “in- 
volve more and more people in the conflict until the 
balance of forces is changed.” In the case of agricul- 
tural liberalization, farmers represent the entrenched 
interest group, and issue expansion offers one route to 
dilute their influence by forcing competition with other 
interests. 

Not only does issue linkage lead to competition 
among interest groups, but also among actors across 
jurisdictional boundaries. The framing of issues in the 
negotiation shifts the policy discussion from one venue 
to another in the domestic arena. Studies of domestic 
institutions have long emphasized the importance of 
agenda setting given the substantive impact of differ- 
ences between domestic actors. Baumgartner and Jones 
(1993, 31) argue that “each institutional venue is home 
to a different image of the same question.” The view of 
the problem and preference for a solution will reflect 
the particular ministry’s own bias and interests—the 
agriculture ministry favors farmer interests, the trade 
ministry favors industry interests, and the foreign min- 
istry tries to balance national interests with concern 
for maintaining better foreign relations. Similar differ- 
ences occur across the boundaries of political commit- 
tees. To the extent that the negotiation lifts decision 
making outside of the corporatist ties in the agricul- 


tural policy setting, it will create new opportunities to 
promote agricultural liberalization. 

Evidence confirms that the international setting has 
an impact on the domestic policy jurisdiction. With re- 
gard to Japanese trade negotiations, for example, Fukui 
(1978) argues that the Foreign Ministry influence was 
greater in the Tokyo Round, while the domestic min- 
istries such as the Ministry of Agriculture had more 
influence in bilateral negotiations on narrow issues. 
Similarly, Japanese government decision making dur- 
ing the Uruguay Round brought top officials from five 
ministries together to coordinate policies. This enabled 
ministries such as the Ministry of International Trade 
and Industry (MITI) that typically have no voice on 
agricultural trade issues to play a role in agricultural 
policy decisions because these decisions also affected 
the progress of the entire negotiation. 

In the EU, the comparable question centers on which 
officials in the Council of Ministers shape the negotia- 
tion mandate. Although the agriculture ministers meet- 
ing in the Agriculture Council dominate decision mak- 
ing for issues directly related to CAP, broader trade 
policy issues related to negotiations are likely to be ad- 
dressed by the foreign and trade ministers meeting in 
the General Affairs Council. Member states opposing 
agricultural reform prefer to maintain control within 
the Agriculture Council, while those favoring reform 
try to push issues into the trade committee or the Gen- 
eral Affairs Council.!° The scope of issues in the ne- 
gotiation influences which council is likely to take the 
lead role and which procedures are followed under EU 
treaty provisions. 


Institutions and Credibility. Many have pointed to 
the role of international institutions in facilitating pos- 
itive linkages that promote cooperation (Haas 1980; 
Keohane 1984; Martin 1992). First, as Keohane (1984, 
91) writes, more quids make it easier to reach agree- 
ment in guid pro quo deal making. The institutional 
context of the GATI/WTO promotes the kind of cross- 
sector issue linkages discussed above. Equally impor- 
tant, the institutional context adds credibility to the 
decision to link issues because it makes the deci- 
sion represent a commitment in an ongoing process 
of repeated negotiations. Martin (1993, 129) explains, 
“Deals cut within an institution rather than outside one 
gain stability because members put increased value on 
their reputations for living up to agreements.” The costs 
of backing down from a commitment to link issues are 
greater because such action damages their “ability to 
reach mutually beneficial cross-issue deals” in the fu- 
ture. The institutional context also endows the linkage 
with greater legitimacy by providing a common set of 
procedures and norms that justify use of the linkage 
strategy. 

Publicizing the issue linkage as a formal agenda ac- 
complishes two purposes: First, it creates a focal point 


9 Ministry of Foreign Affairs Official Interview by author. 
Cambndge, MA, 18 November 1999; and MITI official. Interview 
by author. Tokyo, 3 June 1999. 

10 Official of the German Federal Ministry of Food, Agriculture, and 
Fishery. Interview by author Bonn, 20 July 1999 
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from which negotiators are reluctant to retreat; second, 
it signals to domestic groups that the success of any part 
of the negotiation will depend upon reaching an overall 
agreement. The formal agenda of the negotiation estab- 
lishes whether talks will address multiple sectors and 
whether agreement on the entire range of issues will 
form a single package. Considerable time and hard bar- 
gaining go behind the creation of the agenda, and this 
inhibits frequent renegotiation. Subsequent behavior 
by states reinforces the linkage. By obstructing discus- 
sions in one negotiating group to match the deadlock 
in another negotiating group, a state can force parallel 
progress on different issues. The combination of the 
formal agenda and the support for each agenda issue 
by some participating states creates a more credible 
cross-sector linkage. Information that liberalization for 
any sector is conditional on agreement on all issues 
provides an incentive for interest groups and officials 
to closely follow all parts of the negotiation rather than 
just the part related to their own sector. As a credible 
signal, the institutionalization of the issue linkage in the 
agenda and procedures of the negotiation strengthens 
the incentives for those who gain from free trade to 
lobby against protectionist interests. 


Linkage Hypothesis. The more institutionalized the 
linkage among multiple sectors in a negotiation, the 
more likely that the negotiation will liberalize agricul- 
tural trade barriers. 

Different levels of institutionalization form a weak or 
strong linkage among issues. The two key institutional 
features concern whether the agenda commits to lib- 
eralization of multiple sectors and whether procedures 
call for a single agreement with binding commitments 
on all of the agenda issues. An agenda that only calls 
for liberalization of one sector does not have any insti- 
tutionalized cross-sector issue linkage. An agenda that 
includes multiple issues but provides for flexibility to 
reach settlements on each issue separately forms a weak 
linkage. In contrast, a package negotiation structure es- 
tablishes a strong linkage by explicitly mandating that 
the negotiation will proceed in an all-or-nothing ap- 
proach that ties together deals on multiple sectors and 
issues to culminate in signing a single agreement. Com- 
parison of two GATT trade rounds and one APEC ne- 
gotiation illustrates these differences. The agendas for 
the two GATT negotiations, the Tokyo Round (1973- 
79) and the Uruguay Round (1986-94), as well as the 
Kuala Lumpur APEC ministerial meeting in 1998, gave 
a mandate for talks on a wide range of issues including 
both primary and industrial sectors and other rules re- 
lated to economic activities, such as investment regula- 
tions and product standards Nevertheless, the agenda 
and procedures of each negotiation present observable 
differences in the commitment to the cross-sector link- 
age. 

Weak institutionalization of the cross-sector link- 
age characterized the APEC “Early Voluntary Sec- 
toral Liberalization” agenda. APEC trade liberaliza- 
tion talks jointly address trade sectors ranging from 
agriculture to automobiles. The principles of volun- 
tarism and flexibility, however, explicitly allowed coun- 
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tries to set their own pace for the timing and scope 
of liberalization in any particular sector (Aggarwal 
and Morrison 1998, 13). Likewise, bilateral negotia- 
tions may also address multiple issues, but they often 
do not bind them together with a formal agenda. For 
both APEC and comprehensive bilateral negotiations, 
tradeoffs across issues in the negotiation are possible. 
The lack of a formal commitment to the linkage, how- 
ever, makes it more difficult to signal that concessions 
in one area are necessary for gains in another. 

There was a moderate linkage between agricultural 
and industrial issues in the Tokyo Round. The declara- 
tion that set the agenda for the Tokyo Round stated that 
the negotiation “shall cover . . . both industrial and agri- 
cultural products,” but also added flexibility by urging 
that the negotiations should “take account of the spe- 
cial characteristics and problems in this [agricultural] 
sector.”! It established several negotiation groups that 
produced agreements from which nations could pick 
and choose: “GATT à la carte” (Jackson 1997, 47). 
The agricultural group had a particularly weak link- 
age with the rest of the round. Subgroups for dairy, 
meat, and grains discussed separate commodity agree- 
ments rather than general principles for agricultural 
policy. Moreover, at the U.S. initiative, which wanted 
to include the Soviet Union (a major purchaser on 
grains markets that was nota GATT member), the talks 
on grains were primarily conducted outside the Tokyo 
Round in the context of the International Wheat Coun- 
cil (Winham 1986, 252). These talks ultimately failed to 
produce a new commodity agreement on grains, and 
states were able choose whether to join the dairy and 
meat commodity agreements regardless of their posi- 
tion on other issues in the round (GATT 1979, 26). In 
sum, the agenda statement and the procedures that al- 
lowed stand-alone agreements provided only moderate 
institutionalization of an issue linkage. 

In contrast, a strong cross-sector linkage character- 
ized the Uruguay Round. The Punta del Este Dec- 
laration that set the agenda for the Uruguay Round 
called for 15 groups to negotiate issues ranging from 
industrial goods to agricultural goods to reform of 
GATT trade rules. In the declaration, the term single 
undertaking referred to the commitment to decide 
jointly on all the parts of the negotiation and supported 
the refrain among negotiators that “nothing is agreed 
until everything is agreed” (Croome 1995, 34). Two 
later events further strengthened the linkage credibil- 
ity. First, some Latin American states walked out of the 
1990 Brussels meeting, declaring that they would not 
negotiate anything if the United States, EU, and Japan 
would not agree to a substantive agricultural liberal- 
ization package. Then, at the end of 1991, the GATT 
Director-General Arthur Dunkel independently pro- 
duced a draft agreement binding all parts of the negoti- 
ations into a single text.” A procedural step reinforced 





1! Text of the Tokyo Declaration issued at the ministerial meeting 
held ın Tokyo, 12-14 September 1973, in Winham 1986, 414. 

12 The “Draft Final Act Embodymg the Results of the Uruguay 
Round of Multilateral Trade Negotiations” specified terms for all ne- 
gotiation groups, including agriculture. Its provisions for agricultural 
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the concept of a single undertaking: The final agree- 
ment texts would form a single charter for a new trade 
organization such that accepting all agreements was 
a condition of membership (Hudec 1993, 193). Given 
this strong linkage, one would expect more liberaliza- 
tion in the Uruguay Round than in other negotiation 
fora. 

On the other hand, less liberalization is expected 
when there is no cross-sector linkage. The U.S.—Japan 
talks on beef and citrus in 1983 and the U.S.—EU talks on 
wine in 1991 are examples of single-sector negotiations 
that focused exclusively on agricultural products. This 
observable variation in the agenda, rules, and proce- 
dures across negotiations in terms of the sectoral scope 
and institutionalization of the commitment to a linkage 
approach facilitates testing the linkage hypothesis. 


ALTERNATIVE EXPLANATIONS 


Many factors influence why countries liberalize poli- 
cies. In order to evaluate the relative importance of a 
cross-sector issue linkage, I also consider the following 
alternative explanations. 


U.S. Influence: Threats and Appeals 


One way for the United States to exercise its power 
in economic negotiations is by threatening to close off 
access to U.S. markets through retaliation. It is a re- 
current pattern in U.S. trade policy for the administra- 
tion to pursue the goal of freer trade while threatening 
protectionism (Bayard and Elliott 1994; Gawande and 
Hansen 1999). United domestic support for the use 
of sanctions will increase the credibility of this strat- 
egy (Odell 2000; Zeng 2002). Public threats would be 
expected to increase the likelihood of liberalization 
because the prospect of retaliation triggers lobbying 
by targeted industries. Although implicit diplomatic 
threats and linkage with alliance relations are also im- 
portant sources of leverage, they are difficult to ob- 
serve in any given negotiation. Moreover, both Japan 
and most EU member states have been allied with the 
United States over the period of this study. Therefore, 
I focus on threats of economic retaliation. 

The bilateral trade balance will also influence the 
effectiveness of U.S. pressure. During times of large 
trade deficits, political demands in the United States 
urge protectionism at home and push for export ex- 
pansion. In response, industries in Japan and Europe 
dependent on exports to the U.S. market may urge their 
government to reduce trade friction with compromises 
on agricultural trade disputes. Thus a growing deficit 
could increase pressure for liberalization. In addition, 
from the perspective of balancing the U.S. trade deficit 
and promoting U.S. priority items, one might expect 
a pattern favoring liberalization of products that con- 
tribute most to U.S. agricultural export earnings. 


policy changes broadly outlined the final agreement that was ac- 
cepted, although some revisions were made (Croome 1995, 296). 
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Domestic Politics In Japan and the EU 


Bargaining models that incorporate domestic politics 
imply that Japan and the EU will be less likely to lib- 
eralize agricultural policies when the government is in 
a weaker position. A negotiator whose hands are tied 
by domestic resistance has a smaller range of possible 
agreement (Schelling 1960), and this situation may 
force the United States to accept less liberalization. 
When the government has a narrow majority it can cre- 
dibly argue that agricultural liberalization is politically 
impossible because politicians must appeal to the im- 
portant farm vote. Political strength at home, therefore, 
can translate into weakness at the bargaining table. 

Japan provides an interesting case for testing 
whether a stronger majority will make liberalization 
more likely even when the ruling party favors farm 
protection. Farmers have long formed one of the key 
support bases for the conservative Liberal Democratic 
Party (LDP). Although the LDP maintained ruling 
party status from its formation in 1955 until 1993, there 
has been intense competition over district seats and 
a continued majority has not always been a certainty. 
When the LDP majority is slim, the party leadership 
finds it difficult to persuade party members to consent 
to an agreement that would inflict pain on an important 
constituency like farmers (George 1990, 133). Conse- 
quently, liberalization will be more likely when the LDP 
has a stronger majority in the Japanese Diet. 

In the case of the EU, domestic politics and inter- 
governmental politics between member nations calls 
for playing a three-level game (Patterson 1997). For the 
sake of the aggregate analysis, however, I only exam- 
ine two levels by considering politics in the Council of 
Ministers as representing the domestic politics of the 
EU. National governments in the Council approve the 
mandate to give negotiating authority to the European 
Commission. Thereafter, the Commission acts as the 
sole negotiating authority. Since the final negotiation 
agreement must also gain Council approval, however, 
the Council continues to play a role setting limits on 
negotiation agreements (Meunier and Nicolaidis 1999). 
The rotating office of the president of the Council of 
Ministers offers a proxy for variation in Council poli- 
tics. Although any member can try to build a coalition 
against an agreement, the government that holds the 
Council presidency has a stronger role as agenda-setter 
(Sherrington 2000). When France, the strongest oppo- 
nent of agricultural liberalization, holds the influential 
post of Council president during the key initial or fi- 
nal years of a negotiation, Commission negotiators can 
more credibly argue that their hands are tied so that 
they cannot accept any liberalization. 


Budget Constraints 


Liberalization may occur because a government de- 
cides it no longer can afford the costs of protection in 
terms of budget expenditures and inefficient use of eco- 
nomic resources. Studies of EU and USS. agricultural 
reforms cite cost-cutting as a major motivation for low- 
ering agricultural trade barriers and subsidies (Moyer 
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and Josling 1990; Paarlberg 1997; Orden, Paarlberg, 
and Roe 1999). The EU devotes more than half of its 
total budget expenditures to agricultural programs, and 
European officials openly discuss the need to reform 
agricultural policies in order to reduce expenditures. 

If budget constraints are the force behind liberal- 
ization, then one would expect liberalization to occur 
during periods of belt-tightening when there is a reduc- 
tion in agricultural budget expenditures. Conversely, 
one could also argue that rising expenditures will make 
governments want to start cutting programs so they 
would be more likely to liberalize agricultural poli- 
cies when agricultural budgets grow rapidly. The role 
of commodity prices further complicates the budget 
dynamic because low commodity prices increase both 
agricultural budget expenditures and farmer demand 
for protection. This article provides one test of the role 
of agricultural budget growth, but further analysis is 
necessary to explain the connections between prices, 
spending, and trade protection. 


Number of Participants 


Another important characteristic of many interna- 
tional negotiations is multilateralism. The term multi- 
lateralism often refers to both multiple countries nego- 
tiating and the kind of institutions that are established 
to facilitate such negotiations. This tendency to conflate 
the institutional features of the negotiation structure 
with the number of participants can interfere with anal- 
ysis of the sources of cooperation. I evaluate whether 
it is the number of participants per se or the need for 
linkage that accompanies negotiations with many par- 
ticipants that provides the leverage of multilateralism. 

Studies about whether more is better have con- 
cluded that it depends on the issue and the nature 
of the cooperation problem (Kahler 1992; Martin 
1992; Sebenius 1983). On the one hand, more partic- 
ipants promote cooperation because there are more 
opportunities to find agreements with mutual gains. 
Multiple countries making the same demand as the 
United States could increase the pressure for liberaliza- 
tion. On the other hand, the problem of free-riding and 
the difficulty of bargaining increase with large numbers 
(Oye 1985). One could also argue that the lack of coun- 
terbalancing by a coalition would make it easier for the 
United States to maximize its leverage in a bilateral 
negotiation. Therefore, the number of participants rep- 
resents an important feature of the negotiation context. 


AGRICULTURAL TRADE NEGOTIATION 
DATA 


I analyze an original dataset of 267 cases of agricultural 
commodities that were the subject of U.S. negotiations 
with Japan or the EU during the period between 1970 
and 1999. The dataset includes both trade rounds and 


B The European Agricultural Guidance and Guarantee Fund ex- 
penditures on agricultural policies were 89% of the 1970 total EC 
budget and 51% of the 1999 budget (EC 1999) 
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TABLE 1. Types of Negotlatlon Contexts 
Included In the Dataset 
Percentage 

Japan EU 
Trade Arena (153 Casas) (114 Cases) 
Tokyo Round 18.95 11.40 
Uruguay Round 12.42 22.81 
Dispute adjudication 29.41 36.84 
Bilateral 34.64 28.95 
APEC 4.58 — 











dispute adjudication cases as well as negotiations out- 
side of the GATT/WTO institutional framework, such 
as bilateral talks and the 1998 Kuala Lumpur meet- 
ing of APEC (Table 1).* Some of the negotiations in 
the dataset addressed only agricultural trade barriers, 
while others involved multiple sectors. A negotiation 
on an agricultural trade barrier forms the unit of anal- 
ysis. A negotiation is defined as an “exchange between 
two or more countries that begins with a demand for 
policy reform and concludes when there is either an 
agreement or a decision to indefinitely halt talks on the 
issue” (Davis 2003, 85). The Appendix provides details 
on coding rules. 

Each case is a single commodity or commodity group 
among those being negotiated. Hence, in a large ne- 
gotiation like the Uruguay Round there are multiple 
cases, while another negotiation will focus exclusively 
on a single commodity. Within negotiations, there is 
considerable variation that can be hidden by aggrega- 
tion of products. During the Tokyo Round, for example, 
no liberalization occurred for grains, while both Japan 
and the EU agreed to partial changes for beef. Multi- 
ple negotiations on a single commodity are treated as 
separate cases when there is a clear initiation and end 
to one negotiation and a second initiation for the next 
negotiation. In order to consider the special nature of 
such repeated talks on similar issues, I include a count 
variable for the number of the particular negotiation in 
the sequence of negotiations on the same product. 

Negotiated policy liberalization is defined as the re- 
duction of explicit trade barriers through agreement in 
negotiation. Negotiation outcomes are categorized into 
three levels of liberalization: none, minor, and major 
policy change. The explanatory variables used to pre- 
dict liberalization are summarized in Table 2, and de- 
scriptive statistics and coding definitions are provided 
in the Appendix. I test my main argument using a link- 
age variable that is a four-level ordinal scale for the 
institutionalization of an issue linkage between agri- 
cultural and other sectoral issues. Cases negotiated in 
the Uruguay Round are coded four for the strongest 
linkage while Tokyo Round cases are coded three for 


14 In Table 1, a case ıs categorized as dispute adjudication 1f a for- 
mal complaint was filed under GATT or WTO dispute settlement 
procedures. Some cases ended after consultations, while others were 
concluded after a final panel ruling. See Busch and Reinhardt 2001 
and Davis 2003a for analyms of the effect that these rules have on 
trade negotiations 
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TABLE 2. Summary of Model for Pollcy Liberalization 














Result 
Varlable Predict Japan EU Operationallzation (Measure) _ 
Linkage + + + Strength of cross—sector linkage (1—4) 
Threat + + Spectfictty of threat (1—4) 
Trade balance — + Bilateral trade balance In goods (U.S. billlon $) 
Product value + Product share In total U.S. agr. exports (%) 
Pollttics + + NA LDP share of Lower House seats (%) 
- NA — French presidency of council (1—3) 
Budget — — Annual agriculture budget growth (%) 
Growth + + Annual growth (per capita GDP) (%) 
Mutti + Multiple-country demand (0—1) 
Export — NA — Case about export subsidy (0—1) 
Count + Case number In serles of repeated negotiations (1-6) 
Note. Vanables represent those used In the regression analysis. The second column grves the expected direction of the varlable's 
Influence on liberallzabon outcomes. The Result columns summartze the direction of Influence for statistically significant regression 
estimates (base models in Table 3), which are discussed in text. NA, not applicable 
——— 








a moderate linkage. Cases in multisector negotiations 
with a weak linkage in the agenda are coded two, and 
negotiations that only addressed agricultural topics are 
coded one. 

Additional variables represent factors that other the- 
ories have highlighted as important for trade negotia- 
tions. The coercive influence of U.S. threats is measured 
by a four-level ordinal scale, ranging from absence of 
threat to a specific threat or the initiation of retaliatory 
sanctions. Two variables, the bilateral trade balance 
and the value of the specific product in the profile of 
U.S. agricultural exports, control for possible sources 
of variation in the pressure from the U.S. demand for 
liberalization. 

The idea that domestic political constraints may add 
bargaining leverage is represented in the Japan model 
by the strength of the LDP majority. A proxy of tied 
hands in the EU analysis is the power of France over the 
negotiation, which is assumed to be relatively greater 
when it holds the presidency of the EU Council of Min- 
isters during the initial or concluding year of the nego- 
tiation. Domestic political constraints are expected to 
be higher when the LDP is weak or when France holds 
the EU presidency. 

A variable for agriculture budget growth tests the 
argument that the need to reduce spending motivates 
liberalization. Periods of budget growth apply less pres- 
sure for liberalization than when there are budget cuts. 
A variable for economic growth controls for other eco- 
nomic factors that may influence decisions about trade 
policy. 

Finally, I include an indicator variable for multiple 
countries being involved in the negotiation. This allows 
me to evaluate the effect of linkage, while controlling 
for the fact that most negotiations with linkage also 
involve many participants. 

For the EU, another variable is necessary to account 
for the special nature of negotiations dealing with pro- 
tection oriented toward export promotion. Since Japan 
has minimal agricultural exports and no export subsi- 
dies, this variable is not applicable to the Japan model. 

the EU policy context, however, export subsidies 
are a core policy of CAP. Moreover, they represent an 


area where GATT trade law has been weakened by 
exemptions and vague language (Hudec 1993, 147-54). 
The combination of stronger interests and weaker rules 
makes these among the toughest negotiations with 
the EU. 


EMPIRICAL ANALYSIS 


I evaluate the effect of issue linkage by analyzing data 
from U.S.—Japan negotiations and U.S.-EU negotia- 
tions. I choose not to combine the Japan and EU 
datasets since separate analysis allows me to examine 
whether the same variables have different effects in 
each political context. Given that the dependent vari- 
able measures policy liberalization with three ordered 
categories, I use the ordinal probit model. 

Table 3 presents the regression analysis. The results 
show that in Japan and Europe, cross-sector linkage 
significantly increases the likelihood of liberalization. 
There is also evidence that threats and LDP strength 
promote liberalization by Japan and that the role of 
France along with budget conditions influences liber- 
alization by the EU. The impact of these other factors, 
however, appears to be smaller compared with the im- 
pact of cross-sector linkage. 

The findings were robust to different model specifica- 
tions. The results were consistent when a probit model 
is estimated using a recoded liberalization variable that 
collapsed the top two categories of major and minor lib- 
eralization to form a simple binary outcome. Recoding 
linkage into a binary variable for whether the agenda 
included only agricultural issues or multiple sectors 
also produces a significant positive effect. Results were 
also consistent when I introduced a time period vari- 
able to account for interconnection between cases not 
captured by the other variables» As a test for omit- 
ted variables related to the different agricultural prod- 
ucts, I ran a fixed effects model including commodity 





15 The time penod variable divided the 30 years into five categories 
related to the two major trade rounds: 1969-72 (pre-Tokyo Round), 
1973-79 (Tokyo Round), 1980-85 (pre-Uruguay Round), 1986-94 
(Uruguay Round), and 1995-99 (WTO). 
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TABLE 3. Ordinal Probit Regression Models 








Variable 








Cross-sector IInkage 
Threat 

Trade balance 

Product value to the U.S. 
Budget growth 

Economic growth 

LDP strength 

French Council preskdency 


Export subsidy 
Muttcountry demand 


Count 


First threshold 
Second threshold 


log-ilkallhood 
T statistic 


N 


Note: The constant Is sat to zero. Robust standard errors are In parentheses. The fixed effects models Include 17 commodity categories 
for the Japan data and 13 commodity categories for the EU data, although these coefficients are not reported here * Significant at the 


5% level. * Significant at the 1% level 


indicator variables. This model attempts to capture 
product-specific effects related to the unobserved po- 
litical or economic characteristics of the commodity be- 
ing negotiated. For example, some products with more 
political clout in terms of the number of farmers or 
campaign contributions, etc., could be less likely to be li- 
beralized. Following the OECD Agricultural Accounts 
commodity categories, I classified the 153 Japanese 
cases into 17 commodity groups and the 114 EU cases 
into 13 commodity groups.!® The fixed effects models 
give results similar to the base models used for the anal- 
ysis below. Given the possibility for some correlation 
among negotiation cases, the estimates in Table 3 are 
based on Huber/White robust standard errors. 

It is possible that there is endogeneity in the agenda- 
setting process so that the level of domestic opposition 
to liberalization influences whether there is a strong 
linkage. This article has addressed this issue in part 





16 See the Appendix for a list of commodity categories In some cases, 
I made a deviation from the OECD categories, such as separating rice 
into a separate category for Japan while leaving it m the cereals group 
for the EU, where there is less reason to expect special treatment. 
Processed products are included in the closest raw matenal category, 
e.g , pasta 1s m the cereals group. Industrial crops is used as the base 
category. 
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through selecting the agricultural sector for analysis 
and including a commodity fixed effects model to allow 
for variation among agricultural products.” Examples 
such as the decision to negotiate Japan’s rice import ban 
in the Uruguay Round also indicate that cases with high 
domestic opposition can arise in linkage negotiations. 
This reduces concern that only cases that will be easy to 
liberalize have a strong linkage. Nevertheless, more sys- 
tematic study is necessary to understand the selection 
mechanism for the institutional setting of negotiations. 

Table 4 shows how a shift in the level of one variable 
will affect the predicted probability of major liberaliza- 
tion while all other variables are held constant.!8 Use 





17 An alternative approach would be to use a two-stage selection 
model where the choice of negotiation forum 1s modeled ın the first 
stage and the effect of Imkage 1s estimated in a second stage. The 
use of such selection models requires a valid instrument, a variable 
that affects the negotiation outcome only through the linkage choice. 
The recent hterature has found that without such an instrument, the 
model could yield misleading results (e.g. Heckman et al. 1998). 
In this case, it seems that any variable that is related to choice of 
negotiation agenda is also related to the other variables that influence 
liberalization outcomes 

18 Holding other variables constant mvolves setting variables to therr 
mean levels and multiplying these means by their coefficient esti- 
mate. In Table 4, I only summarize the change in the top category of 
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TABLE 4. Change In Predicted Probablllty of Major Liberalization 





Change In Predicted Major Liberalization 








Variable 

Change Japan EU 
Strong linkage 134 0.64 (0.33, 0.85) 0.50 (0.10, 0.80) 
Strong threat 144 0.43 (0.18, 0.64) 0.09 (—0.14, 0.35) 
LDP strength 49% > 59% 0.44 (0.25, 0.59) 
French presidency 13-2 —0.25 (—0.39, —0.11) 
Budget growth 0% — 25% —0.20 (—0 49, 0.11) —0.23 (—0.37, —0.09) 
Multicountry 0-1 —0.13 (—0.35, 0.10) —0.25 (—0.54, 0.08) 








Note The predicted probability of major liberalzabon when changing the varlable of Interest and holding all other 
variables constant at thelr mean. Results are based on 1,000 simulations using the parameter estimates from the 
base models in Table 3. The 95% confident interval ıs given in parentheses. 










of simulation techniques allows me to generate these 
predictions from the coefficient estimates while taking 
into account estimation uncertainty, which is shown in 
the confidence interval. 


Cross-Sector LInkages Promote 
Liberalization 


The data provide strong support for the linkage hy- 
pothesis. Table 4 shows that an increase in the linkage 
variable from its lowest to its highest value causes a 
0.64 increase (or 64 percentage point increase) in the 
predicted probability of major liberalization by Japan, 
all else being equal. In other words, the model predicts 
that a U.S. negotiation with Japan would end in major 
liberalization with an 0.18 probability for negotiations 
that address agricultural issues in the absence of a cross- 
sector linkage of any kind, while it predicts the same 
outcome with an 0.82 probability when there is a strong 
linkage. The evidence from the EU data corroborates 
this finding. Shifting the linkage variable from one to 
four results in a 0.50 increase in the probability of major 
liberalization by the EU. 

Figure 1 illustrates that a strong cross-sector linkage 
has a greater effect than leading alternative explana- 
tions, such as threats against Japan or EU budget con- 
straints. The four triangle plots display the condition- 
ally independent effects of the specified variables on 
predicted probabilities of liberalization. The vertices 
of the triangle correspond with the three categories 
of the dependent variable and provide coordinates for 
the points inside the triangle, each of which represents 
a simulated prediction for the negotiation outcome. 
Points closer to the upper vertex of the triangle, for 
example, indicate a high probability of major liberal- 
ization, while those closer to the lower left vertex in- 
dicate a high probability of no liberalization. Points in 
the center of the triangle indicate an equal probability 
for all three categories of liberalization. Each triangle 
displays the predictions from two scenarios, one with 
the specified variable set to a high level and the other 


liberalization, although predictions change across all three cate- 
gories, as illustrated in Figure 1. 

The simulation technique used here involves 1,000 random draws 
of coefficients and threshold parameters from a multivariate normal 
distribution I calculate 1,000 sets of predicted probabilities using the 
stmulated parameters (King, Tomz, and Wittenberg 2000). 


with the variable set to a low value, while all other vari- 
ables are held constant.” The clusters of points within 
a triangle represent the simulated predictions for each 
scenario—black dots represent the null category (no 
linkage, no threat, or zero budget growth) and gray 
plus signs represent the high category (strong linkage, 
strong threat, or 25% budget growth). 

The upper left triangle compares the predicted prob- 
abilities for a strong cross-sector linkage with those 
when there is no linkage, based on the U.S.—Japan data. 
The cluster in the upper vertex of the triangle indicates 
that a strong cross-sector linkage brings a high prob- 
ability of major liberalization by Japan with a small 
uncertainty for the predictions. The cluster centered in 
the lower half of the triangle shows that the predic- 
tions for negotiations with no linkage lie between none 
and minor liberalization. The large gap between the 
predictions for the two scenarios signifies the effect of 
changing the level of cross-sector linkage on the overall 
predicted probability of liberalization. In contrast, a 
threat has slightly less effect on Japan’s liberalization, 
as shown by the smaller gap between the no-threat and 
strong-threat clusters of predictions shown in the lower 
left triangle. 

The two triangles on the right report predictions from 
the U.S.-EU data. The spread of each cluster of simu- 
lated predictions indicates that there is greater uncer- 
tainty for the predicted effect of linkage on the EU 
relative to Japan. Nevertheless, the positive effect of 
cross-sector linkage on EU liberalization is apparent 
in the difference between the clustered predictions of 
the two linkage scenarios. This is a larger effect than 
observed in the lower right triangle, where the com- 
parison of budget scenarios shows that the two clusters 
of predictions are close to overlapping. The negative 
direction for the budget variable is shown by the place- 
ment of predictions for the 25% budget growth scenario 
below those for the null category of zero budget growth. 
The results confirm that issue linkage significantly pro- 
motes liberalization and that the effect is substantial 
compared with the leading alternative explanations for 
Japanese and EU agricultural liberalization. 





20 The parameters from the regression estimates shown in Table 3 
form the basis for smulating predictions of hberalization outcomes 
in each of the two scenarios. This kind of figure 1s called a “ternary 
plot” and has been suggested as a way to display results for a three- 
category dependent variable (King, Tomz, and Wittenberg 2000). 
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| FIGURE 1. Effects of Linkage on Liberalization 





Japan 


Major 














Qualitative Evidence for Cross-Sector 
Linkage 


A closer look at a few key negotiations supports the ag- 
gregate evidence that a strong cross-sector issue linkage 
promotes liberalization by broadening the mobilization 
of industry and expanding the policy jurisdiction. In the 
Uruguay Round negotiation, Japan tried to argue that 
the need for food security should justify price supports 
and import quotas for staple foods. The government 
adamantly refused to consider allowing rice imports, 
even when U.S. and GATT officials made a special plea 
in the fall of 1990 before a critical meeting of the trade 
round.” More than half of Japanese farmers grow rice, 
and it had been considered a political taboo to discuss 
market opening. After the failure of this meeting, how- 
ever, export industries began lobbying for agricultural 
trade liberalization as the credibility of the cross-sector 
linkage increased. In December 1991, when Dunkel put 
forward his draft agreement calling for joint conclusion 
of negotiations on all sectors, Japan's leading business 
association endorsed it in its entirety and urged the 
government to be more flexible on agricultural talks 
(Shiwaku 1994, 14).? Senior LDP politicians began to 





21 Nihon Nogyd Shimbun, 4 December, 8 December 1990. 


Z Interviews by author of a Japanese government (MAFF) nego- 
tiator, November 1998, and Keidanren official, October 1998. Also 
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Note: Each triangle shows the predictions for negotiation outcomes based on 200 slmulatons using the estimates from the base models 
In Table 3. The predictions compare two scenanos by changing the vanable of interest from Ite lowest to its highest level while holding 
all other varlablee constant at their mean. Each point represents one prediction of liberallzaton under the specified scenario 
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issue public statements that Japan would have to accept 
some kind of partial liberalization.” In government 
discussions, MITI became concerned about the agricul- 
tural negotiations blocking the round and urged con- 
cessions for the sake of the Uruguay Round. Faced 
with internal divisions over the gains promised by the 
round and the necessity to accept agricultural liberal- 
ization as part of the package, Japan made concessions 
on even the most important agricultural item—rice. 
The rice-opening agreement was the result of a 
compromise proposal submitted by the GATT official 
heading the agriculture negotiating group. Although 
rice would remain heavily protected, the government 
agreed to end the ban against imports and to guarantee 
the purchase of five percent of domestic consumption 
as imports, with provisions for a gradual increase and 
tariffication plan. Prior to making the decision to accept 
rice imports, Prime Minister Morihiro Hosokawa said 
to his staff, “Japan cannot become the criminal that 
wrecks the Uruguay Round” (Karube 1997, 104). In 
his public announcement, he called for Japan to endure 
sacrifices in difficult areas such as agriculture for the 
sake of the free trade system and successful conclusion 





see the Keidanren Report “A Call for Decasrve Action to Ensure the 
Success of the Uruguay Round” (March 1992) 

D Asahi Shimbun, 18 November 1991 

M Former MITI official. Interview by author. Tokyo, 3 June 1999. 
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to the Uruguay Round. Similar arguments were used 
in Diet debates as well. The following exchange during 
a Diet committee meeting is illustrative: A senior LDP 
politician, Kösuke Hori, argued that even partial lib- 
eralization would violate the Diet resolutions for com- 
plete self-sufficiency in rice. He urged the government 
to renegotiate the proposal with a tougher position. 
Hosokawa responded that it was necessary to evaluate 
the Uruguay Round negotiation as a whole and that, 
in a multilateral negotiation, it was unreasonable for 
Japan to insist that it could not import even a grain of 
Tice. 

The reforms achieved in the context of the Uruguay 
Round also demonstrate the influence of issue link- 
age to promote liberalization in Europe. In 1986, the 
EU only agreed to discuss agricultural liberalization 
because of its strong interest in the financial and service 
sector. Thereafter, the linkage of agricultural talks with 
service and industrial goods talks, which was reinforced 
by the long deadlock over agricultural issues, persuaded 
EU ministers that internal reform of CAP was neces- 
sary to conclude the round (Tangermann 1996). A veto 
of the agriculture agreement was narrowly avoided as 
the EU followed the package approach in its own de- 
cision making by not voting on the agricultural com- 
ponent separately from the Uruguay Round (Paemen 
and Bensch 1995, 237). France would not invoke a veto 
in the Council of Ministers without German support, 
which was not forthcoming. German industrial groups, 
along with the EU-level industry association UNICE 
(Union of Industrial and Employers’ Confederations 
of Europe), strongly advocated an agricultural agree- 
ment for the sake of successful conclusion of the trade 
round.” Moreover, French interests in the industrial 
and service sector agreements also favored the success- 
ful conclusion of the Uruguay Round, and employer 
groups began to pressure the French government to 
compromise (Vahl 1997, 253). The package approach 
helped gain EU acceptance of an agricultural agree- 
ment that had faced initial public rejection and threat 
of a veto by France. 

In contrast, a weakly institutionalized cross-sector 
linkage contributed to the inability of the United States 
to persuade Japan to make any concession for fish or 
forestry liberalization during the 1998 Kuala Lumpur 
meeting of APEC. Based on the principle of volun- 
tarism and lacking any kind of dispute mechanism, 
APEC is widely viewed as a negotiation forum with low 
levels of institutionalization (Kahler 2000). The agenda 
for the 1998 meeting included a cross-sector linkage 
calling for liberalization of nine priority sectors and a 
pledge to pursue liberalization of six additional sec- 
tors the following year.” The linkage was weak, how- 


2 128th Session of the Lower House of the Diet, Budget Committee 
Meeting Records, 8 December 1993. Author’s translation. 

26 Financial Tunes, 21 November 1992; UNICE, “Industry and Ser- 
vices Are Poor Relations m the Negotiations,” 3 December 1992 

27 The nine priority sectors were forestry, fisheries, environmental 
goods, toys, gems and jewelry, chemicals, medical equipment, en- 
ergy, and telecommunrications. The six sectors in the secondary tier 
for subsequent liberalization included oilseeds and products, food, 
rubber, fertilizer, autos, and avil aircraft. 


ever, because the package was labeled “early volun- 
tary sectoral liberalization,” and the agenda made ex- 
plicit reference to flexibility for the liberalization of any 
given sector with difficult circumstances. The United 
States along with several Southeast Asian nations in- 
sisted that Japan must contribute to liberalization on 
all nine sectors, including fisheries and forestry. Rising 
trade dependence made Japan especially vulnerable to 
US. pressure in 1998 as declining demand in recession- 
struck Asia and Japan left the U.S. market as a crucial 
outlet for Japanese exports. 

Resistance from the forestry and fisheries lobbies in 
Japan was strong, however, and the negotiation did 
not force tradeoffs with other interests. The Ministry 
of Foreign Affairs along with a top Cabinet official 
backed the Ministry of Agriculture’s opposition to any 
concession. Protests from business groups that had 
been heard during the Uruguay Round, when rice stood 
as an obstacle to agreement, did not materialize. Tak- 
ing a unified position, the common refrain by Japanese 
officials and politicians was that the appropriate forum 
for discussing tariffs was the WTO, where binding com- 
mitments could be made, not APEC.” The voluntary 
nature of commitments reduced expectations for the 
likely gains from the negotiation and any sense of ne- 
cessity for painful cuts. In the end, no agreement was 
reached. Although the agenda lacked an institutional- 
ized all-or-nothing approach, in fact, the choice came 
down to moving forward on the package as a whole 
or not at all. The U.S. government reported that Japan 
had blocked the agreement by refusing to participate 
in fishery and forestry liberalization, while officials of 
Japan and some of the other member governments em- 
phasized that the provisions for flexibility and volun- 
tarism in APEC meant that there was no obligation 
to participate.” The weakly institutionalized linkage 
allowed for different interpretations over which sectors 
had to be included, dooming any chance for liberaliza- 
tion of a sensitive sector. 


Other Pressures for Liberallzation 


While the institutionalization of linkages emerges as 
a major factor to explain the variation in negotiation 
outcomes, the evidence also supports some of the al- 
ternative explanations. As discussed above and shown 
in Table 3, threats add leverage to liberalization de- 
mands. The variable measures the strength of the threat 
in terms of whether it was a general threat of a possible 
rise in U.S. protection against exports or a more spe- 
cific threat backed by a deadline for retaliation. The 
significant and positive coefficient for the Japan model 
indicates that Japan is reactive to U.S. pressure, which 
has been widely discussed in studies of Japanese foreign 
policy (Mulgan 1997; Schoppa 1997). A strong threat 


28 Ministry of Foreign Affairs official. Interview by author, Tokyo, 
14 May 1999. 

2 Yomiun Shumbun, 16 November 1998. 

% Office of the U.S. Trade Representative, The 1999 National Trade 
Estimate Report on Foreign Trade Barriers, 219; New Straits Times, 
14 November 1998. 
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leads to an increase of 0.43 in the probability of major 
liberalization. On the other hand, threats have little 
influence against Europe. 

The variation in negotiation outcomes challenges the 
notion that budget constraints alone explain the pattern 
of liberalization. The rise and decline of budgetary con- 
ditions and economic growth have minimal impact on 
negotiation outcomes with Japan—the variable mea- 
suring growth in agricultural spending does not reach 
significance. Given that agricultural policies have long 
absorbed over half of EU spending, it is not surprising 
that there is a strong relationship between the budget 
and agricultural policies in the EU. The negative effect 
of agriculture budget growth in the EU model confirms 
that the EU is less likely to liberalize agricultural trade 
policies when spending is growing. Budget conditions, 
however, have only a modest impact on trade negoti- 
ation outcomes. For example, a hypothetical scenario 
of a 25% increase in the EU agriculture expenditures 
during the first year of the negotiation reduces the pre- 
dicted probability of major liberalization by 0.23, which 
is less than half the magnitude of change resulting from 
a strong linkage (Table 4). On the other hand, contin- 
uation of expensive protection policies contributes to 
the eventual need for cutbacks. A fuller understand- 
ing of the relationship between budget constraints and 
liberalization requires further study. 

The indicator variable for negotiations involving ex- 
port subsidies as the trade barrier is highly significant 
for the EU model, with a large, negative effect on liber- 
alization. The difficulty of negotiating liberalization of 
this particular kind of policy has several implications. 
Since France is the primary beneficiary of export subsi- 
dies for its productive grains sector, the negative effect 
of this variable highlights strong French influence over 
EU agricultural policy. At the same time, export subsi- 
dies distort world markets by lowering world prices and 
stealing third-country markets from other exporters. 
As such, their elimination has been a top U.S. priority 
(Paarlberg 1997, 432). The fact that the policy receiv- 
ing the greatest U.S. pressure has been so resistant to 
change shows the limits of U.S. bargaining leverage. 
Finally, export subsidies represent the most egregious 
kind of budget expenditure. Hence the negative effect 
of this variable suggests that neither U.S. policy prior- 
ities nor EU budget constraints determine the pattern 
of EU liberalization. 

As expected, when France holds the position of 
Council president, it blocks liberalization. In a hypo- 
thetical tough negotiation when France holds the pres- 
idency during either the initial or concluding year of 
the negotiation, the model predicts a 0.25 decrease in 
the probability of major liberalization compared with a 
negotiation in which France is simply one among many 
in the Council room. Anecdotal evidence offers addi- 
tional support. For example, the EU Trade Commis- 
sioner told a senior U.S. trade official that there would 
not be an end to the WTO banana dispute during the 
French presidency.*! Indeed, the EU did not resolve 


31 USTR official to the U.S. delegation to the EU. Interview by au- 
thor. Brussels, 10 July 2000 
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the banana case until April 2001, four months after 
the end of the French presidency. In contrast, when a 
government more favorable to agricultural reform is 
Council president, it may encourage greater involve- 
ment of trade ministers in Council decision making to 
prevent the Agriculture Council from acting as a veto 
player. This has been reported as an important factor 
that helped bring the EU to compromise on agriculture 
in the Uruguay Round (Woolcock and Hodges 1996, 
313-14). Such stories corroborate the influence of the 
Council presidency on trade negotiations, a factor that 
has been largely overlooked in previous studies. 

The analysis confirms that LDP strength makes lib- 
eralization by Japan more likely. The variable for the 
LDP share of the total seats in the Lower House of 
the Diet is positive and highly significant. Compar- 
ing the scenario of the LDP holding 49% of the seats 
in the Lower House with a scenario of the party holding 
59% of the seats yields a 0.44 increase in the predicted 
probability for major liberalization. This is a substantial 
change, but it is still smaller than the predicted increase 
in the probability of major liberalization that resulted 
from a strong linkage in the negotiation. 

Finally, the indicator variable for multiple countries 
making a demand for liberalization during the negoti- 
ation is not significant. Indeed, counter to the notion 
that there is power in numbers, the negative direc- 
tion means that multiple countries making demands 
may slightly reduce the likelihood of liberalization. 
The power of multilateralism is not because multiple 
countries making demands during a negotiation apply 
greater peer pressure or create more bargaining oppor- 
tunities. Rather, multilateralism is important through 
the institutions that often accompany negotiations in- 
volving many countries. In particular, the use of a pack- 
age negotiation structure is one of the most important 
of these institutional features. 


CONCLUSION 


Institutionalized cross-sector linkages promote liberal- 
ization in the face of strong opposition from perhaps the 
most powerful interest groups—Japanese and Euro- 
pean farmers. For both Japan and the EU, a strong link- 
age changes the predicted negotiation outcome from an 
expectation of minor or no policy change to an expecta- 
tion of major liberalization. Case study evidence shows 
that linking agricultural and industrial issues builds the 
prospects for liberalization by shifting the aggregation 
of domestic interests, Without a strong issue linkage, 
the united strength of the farm lobby and jurisdictional 
autonomy of the agriculture ministries reinforce the 
status quo protection policies. Presence of a linkage 
increases negotiation stakes, and this leads to more lob- 
bying by industrial export interests against agricultural 
protection as well as a greater role for bureaucrats and 
politicians outside of the agricultural policy community. 
Even while taking into account other factors such as 
threats and budget constraints, negotiation structure 
Temains one of the most important determinants of 
policy outcomes. 
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When narrow interests defend the status quo, broad- 
ening the scope of actors and interests can provide the 
impetus for change. A cross-sector issue linkage that 
is institutionalized as a package deal combines issues 
with a credible linkage to make it politically possible 
for leaders to choose liberalization over the protests of 
influential lobbies. This highlights the possibility for the 
structure of an international negotiation to compensate 
for the political market for protection in the domestic 
arena. 

The advanced industrial nations spend over $300 
billion a year subsidizing their farmers. Consumers in 
rich countries and farmers in poor countries are among 
the leading beneficiaries of liberalization, but they have 
been unable to bring change on their own. International 
negotiations represent a critical venue for adding pres- 
sure to reduce the subsidies and trade barriers. While 
it is unlikely that any negotiation will bring an end to 
agricultural protection, understanding what leverage is 
more effective will help to reform some of the most 
trade distortionary policies in the world economy. 


APPENDIX 


Dependent Variable: Policy Liberalization 


The dependent variable measures the degree of policy lib- 
eralization in the negotiation agreement (Tables 5 and 6). 
Outcomes are categorized into three levels of liberalization: 
no liberalization for status quo or nominal change; minor 
liberalization for partial change in the size or coverage of 
quota category, modification in the use of quarantine stan- 
dards, reduction in subsidy levels; and major liberalization for 
tariffication of quantitative restriction, change in the nature 
of health/quarantine measure, tariffication of variable levy 
when accompanied by substantial tariff or subsidy reductions. 

I attempt to include the full universe of negotiation cases 
in the public record. In negotiations with many commodities 
being negotiated, I only include those that were singled out 
for specific mention by either side. The primary sources for 
coding negotiation cases were the annual yearbook Ndrinsho 
Nenpo published by the Japanese Ministry of Agriculture, the 
Agriculture Situation in the Community Annual Report pub- 
lished by the European Commission, and the National Trade 
Estimate Report on Foreign Trade Barriers published by the 
Office of the U.S. Trade Representative. Important secondary 


sources included Bayard and Elliott (1994), Hudec (1993), 
and Mulgan (1997). Finally, I conducted a search of The New 
York Tunes, Financial Times, Asahi Shimbun, and the Nihon 
Nogyd Shimbun (The Japan Agricultural Newspaper). 


Explanatory Variables (Tables 5 and 6) 


Cross-Sector Linkage. A four-level ordinal scale mea- 
sures the strength of cross-sector linkage. Negotiations on 
agricultural issues alone are coded one. Multisector negoti- 
ations with weak institutionalization, including both APEC 


- talks and bilateral talks, are coded two. The Tokyo Round 


receives the next level of cross-sector linkage strength, coded 
as three. Cases negotiated in the Uruguay Round receive the 
highest cross-sector linkage value of four. 


Threat. A four-level ordinal scale measures the degree of 
specific and public commitment behind the threat of retali- 
ation. It ıs coded one when there is no public record of US. 
officials making a threat in connection with the negotiation. 
General threats are coded two and include cases in which the 
U.S. government initiates a section 301 investigation or a U.S. 
official claims that a concession on the issue is necessary to 
forestall rising protectionism in Congress. Specific threats in 
which the U.S. government issues a deadline and target list are 
coded three. Finally, cases with implementation of retaliatory 
sanctions are coded four. 


Trade Balance. This variable is the annual bilateral trade 
balance for trade in goods measured in billion U.S. dollars at 
1995 prices. The figures for the EU represent the aggregate 
of the 15 members’ trade with the United States. For the 
sake of consistency, I include this aggregated figure for the 
full period rather than adjusting for changes in EU member- 
ship. Source: IMF, Direction of Trade (Washington, DC: IMF, 
assorted years). 


Product Value to the United States. A proxy for the 
significance of a product to U.S. interests is given by its share 
in the total value of U.S. agricultural exports on a world trade 
basis. Source: USDA, Federal Agri Trade of the United 
States Statistical Reports (Washington, DC: USDA, assorted 
years). 


Budget Growth. I code the budget variable as the annual 
growth in agricultural budget expenditures for the initial year 
of the negotiation. For the EU, expenditures are EU expen- 
ditures rather than national budgets (EC 1999). For Japan, 
the expenditures are general account expenditures of the na- 
tional budget (MOF 1999). 





TABLE 5. Descriptive Statistics for U.S._Japan Data (Number of Observations, 153) 





























Japan 
Vanable Mean Minimum Median Maximum 
Liberallzation 2.10 0.84 1.00 2.00 3.00 
Cross-sector linkage 1.94 1.09 1.00 2.00 4.00 
Threat 1.71 1.02 1.00 1.00 4.00 
Trade balance —31.93 2.91 —83.38 —25.09 —0.38 
Product value to the U.S. 2.00 4.00 0.00 0.65 29.59 
Budget growth 12.31 15.57 —10.60 5.72 39.94 
Economk growth 2.81 2.19 —2.76 3.36 5.74 
LDP strength 51.83 5.80 43.64 48.92 59.26 
Multicountry demand 0.53 0.50 0.00 1.00 1.00 
Count 2.12 1.23 1.0 2.00 6.00 
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TABLE 6. Descriptive Statistics for U.S.—EU Data (Number of Observations, 114) © 























EU 
Varlable Mean SD Minimum Median Maximum 
Liberalization 2.04 0.80 1.00 2.00 3.00 
Cross-sector IInkage 2.02 1.28 1.00 1.00 4.00 
Threat 1.90 1.13 100 100 4.00 
Trade balance —8.81 13.47 —52.16 —11.90 10.56 
Product value to the U.S. 3.72 7.26 0.01 0.68 41.52 
Budget growth 25.32 24.41 —39.63 16.14 94 79 
Economic growth 2.51 0.99 —0.06 2.42 5.21 
French Councll presidency 1.29 0.53 1.00 1.00 3.00 
Export subsidy 0.38 0.48 0.00 0.00 1.00 
Multicountry demand 0.47 0.50 0.00 000 1.00 
Count 193 1.18 1.00 2.00 6.00 






















Economic Growth. This variable is the per capita GDP, 


growth for the final year of the negotiation at 1995 prices and 
exchange rates. For Europe, I use the larger aggregation of 
the 15 EU members rather than changing aggregation as new 
members entered. Source: OECD, Statistical Compendium 
CD-Rom (OECD: Paris 2001). 


LDP Strength. LDP strength is measured by a ratio for 
the share of LDP seats relative to all seats in the Lower House 
of parhament. 


French Council The variable is a count vari- 
able for the number of times that France held the presidency 
of the EU Council of Ministers during the initial year and the 
final year of the negotiation. The value ranges from one to 
three. 


Export Subsidy. This is an indicator variable for the EU 
model that is coded one when the trade measure in the nego- 
tiation is an export subsidy policy. 


Multicountry Demand. This is an indicator variable that 
represents whether the liberalization demand is made by the 
United States alone or multiple countries. 


Count. This variable counts the number of the case in a 
series of negotiations on the same product. 


Japan Fixed Effects Commodity Groups (17). These 
include fruit, cereals, citrus, nuts, fish, sugars, meat (e.g., offals, 
processed meat), beef, pulses, roots, pork, rice, vegetables, 
dairy, alcohol, wood, and industrial crops (e.g., Oilseeds and 
tobacco). A few miscellaneous processed products are aggre- 
gated with industrial crops. 

EU Fixed Effects Commodity Groups (13). These in- 
clude fruit, cereals (includes rice), citrus, sugars, meat (e.g., 
offals, processed meat), beef, poultry, roots, pork, vegeta- 
bles, dairy, alcohol, and industrial crops (e.g., oilseeds and 
tobacco). 
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ne of the most important developments over the past three decades has been the spread of liberal 

economic ideas and policies throughout the world. These policies have affected the lives of mil- 

lions of people, yet our most sophisticated political economy models do not adequately capture 
influences on these policy choices. Evidence suggests that the adoption of liberal economic practices 
is highly clustered both temporally and spatially. We hypothesize that this clustering might be due to 
processes of policy diffusion. We think of diffusion as resulting from one of two broad sets of forces: one 
in which mounting adoptions of a policy alter the benefits of adopting for others and another in which 
adoptions provide policy relevant information about the benefits of adopting. We develop arguments 
within these broad classes of mechanisms, construct appropriate measures of the relevant concepts, and 
test their effects on liberalization and restriction of the current account, the capital account, and the 
exchange rate regime. Our findings suggest that domestic models of foreign economic policy making 
are insufficient. The evidence shows that policy transitions are influenced by international economic 
competition as well as the policies of a country’s sociocultural peers. We interpret the latter influence as a 
form of channeled learning reflecting governments’ search for appropriate models for economic policy. 


ne of the most important developments over 

the past three decades has been the growing 

willingness of governments to open up the na- 
tional economy to global market forces. The wide- 
spread rollback of policies that block the free move- 
ment of goods and capital has affected the quality of life 
for millions of the world’s citizens. Economists reckon 
the gains to developing countries from a liberalized 
capital regime to be in the billions of dollars of added 
GDP growth (Dobson and Hufbauer 2001; Soto 2000). 
Some, however, acknowledge the instability and hu- 
man insecurity left in liberalization’s wake (Kaplinsky 
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2001; Prasad et al. 2003). These debates have not been 
resolved. Nevertheless, few policy choices are as fun- 
damental as those that determine how a national econ- 
omy should engage—or resist—the forces of economic 
globalization. 

Despite its centrality to the economic history of the 
last third of the twentieth century, we know little about 
the conditions that underlie the ebb and flow of liber- 
alization worldwide. The political economy literature 
has typically assumed that the most important politi- 
cal processes to model are largely internal to each na- 
tional polity. Scholars have built theory about the pref- 
erences of domestic actors for liberalization (Frieden 
1991; Rogowski 1989), explored the partisan sources of 
economic and financial policy (Epstein and Schor 1992; 
Simmons 1994), and linked the rent-seeking behav- 
ior of governments to resistance to opening the econ- 
omy (Alesina, Grilli, and Milesi-Ferretti 1994; Leblang 
1997). These benchmark works tend to play down or 
neglect altogether the role of international politics or 
broader external social relations. The risk is high that 
political economy models of economic liberalization 
have been under- or even misspecified. 

As we show, evidence indicates that transitions to 
economic liberalization cluster in time and space. The 
question is, What can account for these tides of foreign 
economic policy liberalization and restriction? A cru- 
cial explanation, we believe, lies in policy diffusion, in 
which the decision to liberalize (or restrict) by some 


1 There 1s a mich tradition of research on the geographic diffusion 
of a whole host of political, social, and economic phenomena. In 
political science, see the work of Walker (1969) and Gray (1973) on 
the diffusion of policy among the states of the United States, Collier 
and Messick (1975) on social security, and Tolbert and Zucker (1983) 
on civil service reform. On democratic diffusion see Huntington 1991, 
O’ Laughlin et al. 1998, and Starr 1991. A parallel set of studies exists 
in sociology with respect to institutional evolution (eg., Meyer and 
Rowan 1977, Powell and Dimaggio 1991, and Strang 1991). 
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governments influences the choices made by others.” 
We theorize two broad classes of diffusion mechanisms: 
one in which foreign policy adoptions alter the bene- 
fits of adoption for others and another in which these 
adoptions provide information about the costs or ben- 
efits of a particular policy innovation. In developing 
these arguments, we explicitly acknowledge the alter- 
natives. For example, liberalization patterns could be a 
response to commonly experienced phenomena (cur- 
rency crises, economic recession) rather than the result 
of interdependent state behavior. Similarly, economic 
liberalization may simply be the preference of liberal 
democracies; such preferences alone may lead govern- 
ments to respond similarly, but independently, to the 
conditions they face. Both of these processes could lead 
to highly clustered policy making, but we would not 
classify either of them as a diffusion process.3 For our 
purposes, they constitute null hypotheses against which 
accounts of interdependent decision making must com- 
pete. 

We focus on explaining changes within three foreign 
economic policy areas, each of which is primarily mon- 
etary or financial in nature but has a profound impact 
on the real economy. The first is liberalization of the 
current account, which includes foreign debt repay- 
ment and payment for goods, services, and invisibles 
(see Simmons 2000). The second is liberalization of 
the capital account, or the removal of taxes, quotas, 
or other rules that discourage the free movement of 
investment funds into and out of a country (Quinn and 
Inclan 1997). The third policy is the unification of the 
exchange rate, or eliminating multiple or tiered systems 
that can be used to discriminate against particular kinds 
of transactions or particular trading partners (Reinhart 
and Rogoff 2002). Together, these three policy areas 
constitute the principal aspects of international mon- 
etary and financial liberalization over the past three 
decades. We argue that these choices are influenced by 
the choices of other governments as much as they are 
by exogenously given domestic institutions or prefer- 
ences that can be traced back to domestic political or 
economic structures. Our task is to demonstrate how 
and why these policy choices diffuse internationally. 

One can observe a strong trend, with fits and starts, 
toward liberalization in these three areas over the past 
30 years. In 1967, 25 members of the International Mon- 
etary Fund (IMF; 24% of its membership) had capital 
accounts that were practically free of restrictions, 38 
(37%) had fully liberalized current accounts, and 75 
(73%) had unified exchange rate systems. By 1996, 54 
members (30%) had removed virtually all restrictions 


2 We use the term diffusion to refer to all processes ın which “prior 
adoption of a trait or practice ın a population alters the probability 
of adoption for remaining non-adopters” (Strang 1991). A host of 
related phenomena is subsumed under this definition (e.g., imitation, 
demonstration effects, mimicry, emulation, isomorphism, contagion, 
dissemination, transfer), which we will assume to be part of the more 
general phenomenon with which we are concerned. 

We chstinguish diffusion processes from diffusion outcomes, or sm- 
ple clustering in time and space. Processes of policy diffusion are one 
class of explanation for such clustering, but there are a number of 
alternative explanations, which we discuss below 
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on the capital account, while 79 (45%) had liberalized 
the current account and 158 (or 88% of the member- 
ship) had unified their exchange rate systems* But 
more than this trend, what concerns us is that tran- 
sitions to and from these policies tend to be highly 
concentrated in certain years and particular regions. 
For example, the bulk of the transitions to unified ex- 
change rates occurred in the mid-1970s and again in the 
mid-1990s. Similarly, the late 1960s and mid-1990s were 
times of high activity in current and capital account 
liberalization. Policy clusters can be confirmed statisti- 
cally: The distribution of transition counts (both liberal 
and restrictive transitions) fits a negative binomial dis- 
tribution (which assumes clustered data) better than 
it does the distribution from a random, nonclustered 
process such as the Poisson. 

Foreign economic policy transitions tend to cluster 
spatially as well. As the 1995 maps in Figure 1 demon- 
strate, the three economic policies have a distinctly 
regional cast. But why, exactly, should near neighbors 
choose similar policies? We suspect that geographical 
clustering is largely spurious and can be explained by 
a more precise set of relationships. We suggest two 
broad diffusion mechanisms for clustered policies: (1) 
foreign economic policy choices elsewhere can alter the 
payoffs associated with choosing or maintaining a par- 
ticular policy, and (2) foreign economic policy choice 
elsewhere can change the information set on which gov- 
ernments base their own policy decisions.® 


EXPLAINING CLUSTERED TRANSITIONS IN 
FOREIGN ECONOMIC POLICY MAKING 


Altered Payoffs 


In this class of mechanisms, the policy decisions of one 
government alter the costs and benefits of the policy for 
others. One can think of these decisions as producing 
externalities that subsequent adopters must factor into 
their decision calculus. One type of externality is highly 
material and works through direct economic competi- 
tion. Another is more ideational and works through the 
more subjective pressures of prevailing global norms. 


4 IMF: Annual Exchange Arrangements and Restrictions, analytical 
appendix, various issues. Smmuar trends can be observed with data 
presented by Prasad et al (2003). 

> The Poisson is a rare events distribution that assumes the absence of 
precisely the two sources of convergence that we purport to disentan- 
gle in this article. Specifically, the distnbution assumes that the mean 
equals the variance, which, in an event count, implies that events 
occur independently and that the susceptibility of a particular event 
is homogeneous across units, which in this case is years (King 1989). 
If there 1s overdispersion (variance greater than the mean)—an ef- 
fect of highly clumped data—then the Poisson will not fit the data 
well. The negative binomial, on the other hand, is less restrictive and 
treats the variance as a parameter to be estimated. For each set of 
policy reversions, we performed a chi-square test of the equivalence 
of distributions. In each case, a likelihood-ratio test reveals that the 
data on policy choice fit a negatrve binomial distribution significantly 
better than they do a Poisson 

6 Policy innovation elsewhere may affect both payoffs and informa- 
tion, but we view these mechanisms as analytically, 1f not always 
empirically, distinct. 
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FIGURE 1. Geographic Dispersion of Three 
Economic Pollctes (1995) 











Altered Material Payoffs. One of the important in- 
sights of economists who pioneered the early in- 
terdependence literature was that economic policies 
adopted in one country can have economic effects else- 
where, with profound consequences for policy making 
(Cooper 1968). These insights informed a generation 
of political economy work concentrated on issues of 
macroeconomic policy coordination among the major 
economies (Hamada 1985; Iida 1999). 

International markets for goods and especially for 
capital are the conduit for policy interdependence in 
these models. Here we focus on competition among 
policy makers to attract capital and international busi- 
ness generally as a means to enhance aggregate eco- 
nomic growth (Stockman and Hernandez 1988). Policy 
liberalization in country A may make it a relatively 
more attractive venue for investment or conducting 
commercial relations. Indeed, economists have stressed 
that capital and trade respond positively to the signal 
that policy liberalization sends (Bartolini and Drazen 
1997). When a country’s foreign competitors liberalize, 
traders and investors are drawn to locations where they 
can do business more freely and securely. Anticipating 
this outcome, country B may feel competitive pressures 
to match its rival’s liberal policy. This phenomenon sets 





up the possibility of competition among jurisdictions, 
at least on the margins, for international economic ac- 
tivity. 

In this model, governments act strategically in order 
to attract economic activity to their jurisdiction with 
the ultimate aim of boosting aggregate growth. Pluralist 
renditions emphasize the preferences of electorally sig- 
nificant firms or groups in clarifying to leaders the inter- 
ests they have in such policies (Encarnation and Mason 
1990; Goodman and Pauly 1993). In more statist ver- 
sions, decision makers take such actions regardless of 
the immediate preferences of domestic political groups 
(Krasner 1985); in the medium run, they are gambling 
on an aggregate growth payoff for which, presumably, 
they will be rewarded by continued political support. 
In each case, the government faces incentives to an- 
ticipate and match decisions made outside its jurisdic- 
tion, rather than waiting passively for these decisions 
to work their way through the international economy, 
the domestic economy, and the domestic electoral sys- 
tem. In an international environment that is assumed to 
be institutionally thin and nonhierarchical, the result is 
competitive pressure to implement capital- and trade- 
friendly policies when major competitors have done so. 

Note that this model does not predict universal con- 
vergence on liberalization. It predicts convergence to- 
ward either restrictive or liberal policies among com- 
petitors. We assume that a decision maker has good 
information about the identity of competitors, their pol- 
icy choices, and the material consequences of matching 
or failing to match their policies. The model leads to 
the following prediction: 


Governments’ liberalization policies will be in- 
fluenced by the policies of their most important 
foreign economic competitors. 


Altered Reputational Payoffs. Changes in prevailing 
global ideas and the practices they entail create exter- 
nalities for governments as well. One of the hallmarks 
of the current trend toward globalization is the ascen- 
dancy of theories that emphasize market mechanisms 
as engines of economic growth (Gore 2000, McNamara 
1998; Williamson 1993). The spread of liberalization 
both reflects and buttresses the power of a neoliberal 
ideational consensus. 

Ideational consensus is a potential externality be- 
cause it alters the reputational payoffs associated with 
policy choice. As growing numbers of important actors 
articulate theories and implement practices that reflect 
a normative consensus, the legitimacy of these ideas 
gathers steam. In the absence of ideational consensus, 
heterodox policies are difficult to distinguish and are 
readily tolerated. But theoretical consensus on an ap- 
propriate economic model raises the intangible costs 
of nonconformity. Perceived policy failures associated 
with “heterodoxy” will suffer greater public condem- 
nation than similar failures of conforming policy. Gov- 
ernments that resist ideational trends face reputational 
consequences that cast doubt on their approach to the 
economy and potentially the legitimacy of their gover- 
nance. 
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FIGURE 2. Threshold Model of Policy Adoption 
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The logic that links normative consensus to le- 
gitimacy externalities may be reflected in the “tip- 
ping,” or “threshold,” models that Schelling (1978) and 
Granovetter (1978) have described. The basic intuition 
in these models is that most governments are highly 
sensitive to the number, or proportion, of other coun- 
tries that have adopted a particular policy stance. The 
idea of “thresholds” or “critical mass points” is a useful 
(although not necessary) device for understanding the 
process. 

Figure 2 illustrates the classic effect of Schelling’s 
conception of thresholds on the probability of adopting 
a particular policy. Imagine a group of actors, each of 
whom will adopt a given practice only if a critical mass 
of others adopt. Assuming that the distribution of the 
various critical mass points is normal, we can add them 
and produce the s-curve in Figure 2 (which depicts the 
mean critical mass point around 50%). Points on the 
curve represent the proportion of actors who would 
adopt a policy given the proportion of the population 
that is expected to adopt. In this stylized scenario, one 
can see that a small set of actors (about 5%) would 
adopt the policy even if no one else is e ed to do so. 
Similarly, a small percentage of actors (about 15%) at 
the top of the curve will not adopt even if they expect ev- 
eryone else to adopt. The y =x line helps demonstrate 
the equilibria that result from this dynamic. Points on 
the curve under the identity line will resolve to the bot- 
tom of the curve (at its intersection with the identity 
line), because the critical proportion that actors require 
for their adoption is always higher than the number who 
would adopt at that level. One can see the opposite 
dynamic for points above the identity line, where the x 
and y variables also reinforce one another, but this time 
produce an equilibrium point at the upper intersection 
of the curve and the identity line. 

Tipping models capture the dynamics of global 
norms fairly well. For reputational reasons, the propor- 
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tion of others adopting may matter a great deal. Such 
reasoning implies a clear empirical expectation: 


The proportion of liberalizations in the sam- 
ple at large should influence a governments 
decision to liberalize. 


New Information 


A conceptually distinct motor for policy diffusion is 
informational. In contrast to the discussion of payoffs 
above, this approach assumes that governments often 
lack the crucial information they need to understand 
the consequences of economic policy innovation.’ In- 
novations elsewhere provide information on policy 
consequences that may be more or less relevant in a 
particular case. Governments are assumed here to use 
available information in a rational fashion to maximize 
the chances of their own policy success, Sometimes such 
policy learning involves deliberate attention to foreign 
models and their outcomes (see, e.g., Westney’s [1987] 
description of Japanese statecraft in the 1800s). For- 
eign models can encourage or expedite adoption by 
inserting a policy innovation on a legislature’s agenda. 
A foreign model may also offer a ready-made answer 
to ill-defined domestic pressure for “change” and “in- 
novation.” Or it may legitimate conclusions or pre- 
dispositions already held or add a decisive data point 
in the evaluation of alternatives (Bennett 1991). But 
what “lessons” do governments actually learn in the 
economic policy realm? We hypothesize that they may 
learn from “success,” via communication networks, and 
from cultural reference groups. 


7 A consistent theme in recent research has been the uncertainty sur- 
rounding the liberalzzation process. A recent study by staff of the IMF 
concludes that basic questions about the optimal pace and sequencing 
of financial integration are unresolved (Prasad et al. 2003, 5). 
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Learning from Success. The most obvious sources of 
lessons for economic policy making are those that seem 
to “work.” By mechanisms consistent with theories of 
Bayesian updating, governments are likely to follow 
the lead of those countries whose economic conse- 
quences appear to have been notably favorable. Thus 
the Japanese “miracle” provided an economic model 
for much of Asia and beyond in the 1970s and 1980s, 
though it has been a far less attractive model during 
the doldrums of the 1990s. Chile, too, is often cited as 
a relevant “success story” for liberalization of emerg- 
ing market economies, from Latin America to Asia to 
Eastern Europe (Edwards and Edwards 1992). 

Learning from success need not depend on the ability 
of the government to reason in a sophisticated manner 
about the links between a policy and its apparent out- 
come. Indeed, “learning” can be quite superficial, and 
it often involves linking a highly salient outcome with a 
policy innovation without complete information about 
the causal connections. The apparent success of others 
may in fact be a cognitive short-cut to assessing policy 
consequences; the relevant question in this process is, 
What policies are the high achievers pursuing? If this 
form of learning is important, we should expect govern- 
ments to be influenced by the policy innovations of the 
best-performing economies. 


Learning through Communication. Above we as- 
sumed that learning was channeled by salient facts. 
Another possibility is that it is primarily conditioned 
by informational networks themselves. The exchange 
of information among connected actors is the presumed 
motor behind diffusion in most sociological studies 
(Rogers 1995; see also Axelrod 1997). In these models, 
information is largely channeled along specific actor 
networks. The cognitive process is dominated by an 
availability heuristic, in which actors unable to retrieve 
a full sample of information base their decisions on only 
those instances that are available to them (Kahneman, 
Slovic, and Tversky 1982). The result is that the choice 
set of policy makers will be limited to policies of states 
that are immediately accessible to them. 

One can readily identify the kinds of network and 
communicative links that could contribute to learning 
with respect to monetary and financial liberalization. 
Direct contacts at the intergovernmental level may re- 
flect well-established channels of communication: Fre- 
quent intergovernmental meetings at multiple official 
levels can transmit information to policy makers about 
“what works” in other settings. It is well documented 
that the process of negotiating and maintaining insti- 
tutional affiliations may create opportunities to learn 
and persuade (Haas 1959). Conversely, where official 
contacts are infrequent, information is less likely to be 
transmitted and less likely to become salient to decision 
makers. 

Policy diffusion may follow communication channels 
mediated by private actors as well. Business people may 
transmit ideas about appropriate economic policy by 
looking to the experiences of the countries with which 
they have especially intense trading contacts. Lessons 
drawn from these contacts may inform the shape of the 


demands they make on their own governments, thus 
feeding into the demand side of the policy equation. 
This argument suggests that policy diffusion should be 
strongest among governments that are in especially 
close communication. We should expect a positive re- 
lationship between policies of governments with exten- 
sive opportunities to share information about the conse- 
quences of economic policy innovation. 


Learning from Cultural Reference Groups. Actors 
in uncertain and information poor environments ra- 
tionally seek information relevant to their own policy 
context. Learning takes place at least partially through 
analogy, and lessons are viewed as more relevant the ex- 
tent to which a foreign case is viewed as analogous. The 
IPE literature readily distinguishes between “advanced 
industrial,” “emerging,” and “developing” countries 
but completely overlooks a far more salient identity 
marker that may shape the emulation process: cultural 
similarity. 

Cultural propinquity is a nonobvious yet highly plau- 
sible explanation for policy emulation, even in as 
material an issue area as international finance. Cul- 
tural factors underlie economic and financial structures 
to a greater extent than is often realized. Granato, 
Inglehart, and Leblang (1996) have shown that cul- 
tural values are important to economic development. 
Cargill and Parker (2001, 2) note, in their study of fi- 
nancial liberalization in China, that China adopted the 
Japanese style of finance for “shared cultural and his- 
torical reasons.” The experience of Egypt with financial 
liberalization is typically cited as the appropriate source 
of lessons for “Arab banking systems” (Wahba and 
Mohieldin 1998). Indeed, a new generation of research 
explores the ways in which culture has reasserted itself 
despite the globalization of markets (Beng-Huat 1999; 
Chun 2000; Goff 2000). 

Unlike organizational sociologists, who have con- 
centrated on the apparently nonrational adoption of 
policy models reflecting “world culture” (Meyer and 
Rowan 1977; Powell and DiMaggio 1991; Scott and 
Meyer 1994), we argue that cultural emulation reflects 
reasonable efforts to learn from the most appropri- 
ate available examples of policy innovation. We are 
interested in testing a constructivist-inspired hypothe- 
sis: Ideas about appropriate models are likely to reflect 
deep identity concerns (Checkel 1993; Risse-Kappen 
1994; Ruggie 1975). Sociologists have long assumed 
that shared beliefs and values shape the channels along 
which ideas flow (Rogers 1995, 274). In fact, the volu- 
minous literature on diffusion and social influence has 
found that entities that share similar cultural attributes 
tend to adopt the same practices. This is true not only of 
individual behavior like teen smoking (Coleman 1960) 
and voting (Brady and Sniderman 1985; Lupia and 
McCubbins 1998) but also of collective behavior with 
respect to corporations (Davis and Greve 1997), non- 
profit organizations (Mizruchi 1989), states within fed- 
erations (Walker 1969; Rose 1993), and indeed nation- 
states (Deutsch 1953). The most plausible explanation 
of this finding is that actors negotiating a complex set 
of political choices regard the actions of actors with 
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perceived common values as a useful guide to their own 
behavior. 

But why should cultural groups be relevant reference 
groups? For one thing, cultural markers are highly vis- 
ible. If decision makers know one thing about another 
country, it is usually the language its citizens speak or 
the general tenor of the cultural traditions they practice. 
Moreover, common culture embodies subjective no- 
tions of identity contained in assumptions of commonly 
shared values and social purposes. The policies of cul- 
turally similar countries are perceived to (and in fact 
may) contain highly relevant information on the ap- 
propriateness of a particular policy in a specific context 
of shared values. This perceived similarity may provide 
a cognitive short-cut for an individual or a focal point 
to limit cycling over alternatives in a group decision- 
making context. Furthermore, models used by favored 
cultural groups may provide a persuasive rhetorical 
spin in the debates that accompany policy evaluation 
(on the importance of rhetoric and persuasion to the 
policy evaluation process see Majone 1989). Following 
the lead of cultural reference groups may even provide 
a way to mitigate the perceived threats to cultural iden- 
tity posed by globalization.® In each case, perceived cul- 
tural affinity assists in selecting the relevant models, or 
“reference groups,” that inform policy development.? 
Admittedly, economic policy making—a practice with 
very material ends and theoretical underpinnings that 
make no explicit concessions to culture—may be im- 
mune to culturally channeled learning. But given a high 
degree of uncertainty about the consequences of a par- 
ticular policy shift, governments may be influenced to 
follow the lead of a culturally or socially similar group 
of states. 


We expect that cultural similarity will be a posi- 
tive predictor of policy diffusion among states. 


DATA AND ANALYTICAL METHODS 


The empirical problem now is (1) to identify meaning- 
ful measures of the pressures from altered payoffs and 
channeled forms of learning and (2) to estimate their 
effects on the liberalization and restriction of foreign 
monetary and financial policy, while controlling for a 
reasonable battery of nondiffusion effects. 


Dependent Varlables 


Our dependent variables are transitions in each of 
three policy areas—capital account openness, current 
account openness, and exchange rate unification. (See 
Table 1 for descriptive statistics and sources.) Each of 
the variables is a binary measure of whether the gov- 
ernment has or has not imposed restrictions (or, in the 
exchange rate case, a tiered or multiple system) in the 


8 On the importance of perceived cultural threats in the Egyptian 
case, see Amin 1981. 

9 Rosenau (1990, 213) terms these reference groups “cathectic,” sug- 
gesting that decrsion makers have a strong cultural sense of whom 
their nation should look lke. 
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given year (with policy liberalization = 1). The data are 
from the annual IMF volumes on exchange restrictions 
and controls (Analytical Appendices; various issues). 
These measures have been criticized on a number of 
grounds, but for our purposes they are appropriate. 

One concern is that the dichotomous measure masks 
the complexity and intensity of restrictions. For at least 
one of the dependent variables (capital controls), data 
with finer gradations do exist, though for a smaller sam- 
ple (Quinn 1997). All things being equal, we would pre- 
fer to use a more finely grained measure. Nevertheless, 
our purpose here is to model major policy shifts glob- 
ally, and the cost in sensitivity with the IMF’s data is bal- 
anced by their comprehensive coverage across time and 
space. It is less important, for our purposes, to capture 
the nuance than it is to capture the foundational policy 
demarches for a wide range of countries around the 
world. Nonetheless, it is crucial that our use of dichoto- 
mous data does not misrepresent true trends. For this 
reason, we compared the IMF capital control measure 
with Dennis Quinn’s more nuanced measure for the 
countries these two datasets have in common (namely, 
27 OECD countries between 1967 and 1997). The over- 
all correlation between the dichotomous IMF data and 
the polychotomous Quinn data is 0.65, suggesting that 
the former do not suffer remarkably from unreliability. 
The Quinn data and the dichotomous IMF data diverge 
most in more recent years. These are years for which the 
OECD countries have already crossed the threshold to 
liberalization by our cruder measure. Quinn’s measure 
continues to pick up nuances in degrees of liberalization 
beyond this threshold, causing the correlation in later 
years to come somewhat unhinged. 

The existence of a superior but more limited dataset 
provides an opportunity to check the validity of the 
globally available dichotomous measure. The construct 
validity of a measure of capital controls may be judged 
by the comparative accuracy of its prediction of the 
volume of capital flows (see Collier and Adcock 2002 
for a useful clarification of validity issues). When we 
Tegress gross private capital flows as a percentage of 
GDP on the dichotomous measure and on Quinn’s 
polychotomous measure (for the 27-country overlap- 
ping sample), both measures appear highly statistically 
and substantively significant, after accounting for serial 
autocorrelation in the time series with a fixed effects 
model. The K? statistics for similar models employing 
the measures successively are slightly higher for the 
IMF measure than for the Quinn data for the same 
country sample (0.57 compared to 0.53), and the stan- 
dard error of the model (perhaps a better measure 
than R because of the difference in scale of the two 
measures) is lower for the IMF measure. And if we 
focus just on the 1990s—the period for which these 
measures diverge—the dichotomous measure predicts 
even more of the variance with respect to capital flows 
than does the more nuanced Quinn measure (R? of 
0.64 compared to 0.57 and standard error of roughly 68 
compared to 127). These tests suggest that the IMF’s 
dichotomous measures, although crude, are quite likely 
to be valid constructs for the phenomenon we have in 
mind. Finally, as we need global data to test arguments 
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Data 











Mechanism or 
Concept Explanatory Variable Mean SD Minimum Maximum Source(s) 
Dependent variables Capital account policy 0.22 0.42 0 1.0 (1) 
Exchange rate policy 0.73 0.44 0 1.0 (1) 
Current account policy 0.46 0.50 0 1.0 (1) 
Altered payoffs 
Material 
Competition Pollcles of capital 2.25 1.3 0 10 (1, 3) 
competitors 
Policies of trade 1.93 1.40 0 6.36 (1, 2) 
compettors 
Reputational 
Global norms Mean global policy 2.26 0.31 1.80 3.4 (1) 
New Information 
Learning from success Policies of high-growth 2.8 0.68 1.50 3.9 (1, 10) 
countries 
Communication networks Pollcles of trade partners 5.08 2.43 0 10 (1, 2) 
Policies of BIT partners 5.60 3.69 0 10 (1, 5) 
Policies of PTA partners 2.28 2.07 0 10 (1, 4) 
Cultural sImllartty Policies of religlon partners 2.22 1.14 0 8.6 (7-9) 
Pollcies of colonlal partners 1.69 1.68 0 10 (6) 
Pollcles of language 3.11 2.18 0 10 (6) 
partners 
Control mechanisms 
Economic conditions Current account/GDP (t—2) —3.83 10.99 -—240.52 70.21 (11) 
GDP growth 4.08 1427 —76.82 699.90 (11) 
QDP per capita 3.31 5.62 0.03 37.42 (10) 
(In thousands) 
Interest rates 6 60 2.45 3.02 14.08 (12) 
Currency crisis (t — 1) 0.14 0.35 0.00 1.00 (20) 
External political pressures Use of IMF credits 0.76 0 42 0 1 (17) 
Foreign ald (per captta) 55.88 157.28 -75.31 2,337.98 (17) 
Domestic political conditions Degree of openness 66.78 44.12 4.99 423.41 (10, 11) 
Democracy 3.78 4.34 0.00 10.00 (13) ` 
Nationalist executive 1.45 3.52 0.00 10.00 (14) 
Central bank Independence 0.14 0.35 0.00 1.00 (18) 
Common law legal tradition 0.34 0.47 0.00 1.00 (19) 
Geography Policies of border countries 2.38 351 0 10 (15, 16) 
Policies of neighbors 2.40 0.92 0 7.89 _(15, 16) 





Exchange 
Statstics, (3) Standard and Poors, 





Note. Summary statstics for the relational variables are calculated for the capita! account: relatonal varlables have been 

muttplled by ten to Improve presentation of the regression coefficients Data sources: (1) International Monetary Fund, Annual 

and Restrictons, analytical appendix, various issues, (2) International Monetary Fund, Direction of Trade 

Historical Sovereign Bond Ratings, (4) World Trade Organization, http-/Aveww.wto.org/english/ 

tratop_e/region_e/region.s.him; (5) World Bank, http//www.wordbank.o 
and Przeworski 1996, Pollbcal Economic Database Codebook, 


-{.htm, (6) Alvarez, Cheibub, Limongl, 
http /Awww.ssc.upenn.edu/~chelbub/data/ACLP - 


PDF; various country Web sites; (7) Countries of the World and Their Leaders Yearbook 2000; (8) Europa 1999, (9) Central Intelligence 
Agency 1999; (10) Penn World Tables, http/Avww.bized.ac.uk/dataserv/penndata/pennhome.htm; (11) World Bank, STARs Database; 
Financial 


(16) ESRI, Arc-View World Dataset; (17) World Bank, World Development Indicators, http //devdata.worldbank org/dataonline/; 


(18) Cukerman 1992; McNamara and Castro 2003; (19) 
Yu, World Bank; (20) Leblang 2003. 
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about global diffusion processes, and as we employ 
event history methods (discussed below), the dichoto- 
mous measure is most appropriate. 

Another concern is that the IMF measures do a bet- 
ter job of measuring “announced” policy than they 
do “actual” policy. The validity analysis of the IMF 
capital control measure above suggests that this dif- 
ference is negligible. Nevertheless, with respect to the 
exchange rate data, some authors have questioned 


the correspondence between these reports and ac- 
tual currency behavior. Reinhart and Rogoff (2002) 
have developed a measure of exchange rate unifica- 
tion based on actual rates rather than reported gov- 
ernment policies Because their data are thought to 
be the state of the art among economists, we coded 
the descriptions from their qualitative appendix to see 
to what extent they accord with the extant IMF data. 
Our dichotomous coding of the Reinhart/Rogoff data 
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with respect to exchange rate unification correlated 
quite highly with the IMF data (the datasets agree in 
82% of the 3449 country-years they had in common 
between 1967 and 1997). Disagreement between the 
Reinhart/Rogoff data and the IMF’s was strongest in 
the early 1970s, the period of breakdown in the Bretton 
Woods system of fixed rates. 

We checked to see whether the roughly 20% of cases 
of disagreement involved a systematic bias by the IMF 
but found that the “off-diagonals” in a cross tabulation 
were roughly equally distributed. There was a slight 
tendency for countries to report a unified exchange rate 
when the behavioral measure indicated otherwise, but 
these cases account for at most about 4% of the total 
number of cases. As above, we also tried to confirm the 
validity of the IMF data. Because multiple rate systems 
are sometimes used to discriminate against classes of 
imports or importers, we compared regressions of im- 
ports as a proportion of GDP on the two measures, 
taken individually and then together. In this case, the 
Reinhart/Rogoff measure did slightly better at predict- 
ing imports in a fixed effects model than did the IMF 
measure. For this reason, we have replicated all ex- 
change rate models with the Reinhart/Rogoff data.!° 
All of the main findings are robust to the use of either 
dataset (see subsequent footnotes for details). 


Diffusion Variables 


In order to assess the source and strength of policy dif- 
fusion paths, we must construct variables that plausibly 
indicate changing payoff structures (material or reputa- 
tional) and new sources of salient policy information. 
Under these two broad rubrics, we seek to identify a 
country’s various competitive, normative, communica- 
tive, and cultural influences and combine this informa- 
tion with the policy “cue” transmitted along that net- 
work. The growing field of spatial econometrics offers 
a useful set of methods to incorporate these kinds of 
variables (Anselin 1988).!! Spatial regression models 
handle spatial dependence in one of two ways. One is to 
specify the spatial dependence in the error term (spatial 
error models). This method is appropriate when spatial 
dependence is nothing more than a nuisance that biases 
the interpretation of the parameters of interest. When, 
as in our case, spatial dependence is itself the focus, 
researchers include spatial terms as repressors in the 
model (spatial lag models). 

Spatial lag models treat spatial dependence in the 
same way that time-series models treat serial corre- 
lation. Instead of lagging the value of the dependent 
variable one unit in time, one “lags” it one unit in 
space. The spatial lag is the weighted average of the 
dependent variable in the actor’s “neighborhood.” The 
neighborhood is mapped by an N x N spatial weights 
matrix conventionally labeled W. Thus the spatial lag 


10 Full results using the Reinhart/Rogoff exchange rate data are 
available from the authors on request. 

11 A parallel set of methods has developed within network analysis. 
See, for example, Marsden and Friedkin 1993. 
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for country i can be written 


Wy, = 5 Wa ¥ys 
N 


j=l, ., 


where W is the spatial weights matrix and yy is the 
dependent variable for country j. In matrix form we 
write the relationship Wy, where y is an N x 1 vector 
of observations on the dependent variable. These mea- 
sures vary by year as well. 

As with time-series models, the spatial dependence 
can be modeled as an autoregressive or as a moving 
average function, depending on our assumptions about 
the effect’s rate of decay. Because we expect spatial 
effects to reverberate throughout the network and not 
just from the closest actor, we adopt an autoregressive 
function. We can express such a model as 


Y= pWy+ AB +e, 


where p is a spatial autoregressive coefficient, W is the 
n x n spatial weights matrix, X is a vector of nondiffu- 
sion regressors with coefficients £, and e is a vector of 
error terms.” 

In geographic models, the spatial weights matrix, W, 
is often a matrix of geographic distances among units. In 
our case, we are interested in measuring influence along 
other channels controlling for geography: through com- 
petitors whose policies alter material payoffs, through 
norms that gain global adherents and thus alter reputa- 
tional payoffs, and through linkages that alter the infor- 
mation set that informs policy decisions. The elements 
of W differ according to the nature of the measure 
of “distance” between units. These measures come in 
two principal forms: (1) direct bilateral data that record 
a level of interaction between states (e.g., amount of 
trade, number of telephone calls) and (2) affiliation data 
that identify shared membership in various groups (e.g., 
regional trade groups and language communities). 


Indicators of Altered Payoffs. We have theorized 
that altered payoffs spur policy liberalizations and that 
these payoffs may be material or reputational. Com- 
petitive economic pressures are a clear example of the 
former. In order to measure the effect of competitive 
mechanisms, we developed indicators of “competitive 
distance” for two arenas of competition: the export 
market for goods and services and the capital market. 
For the export market we have created two measures. 
The first registers the degree to which nations compete 
in the same foreign markets.“ Because an importing 
country may find ways to reciprocate or reward policy 
liberalization in country A, country B has an incentive 
to liberalize to the extent that A and B compete for mar- 
ket share within that third market. Using the bilateral 


12 W, then, is an N x N x T matrix and yw an N x T matrix. 

1D In the models reported below, we measure the policies of the tenth 
of the sample closest to each country, by each measure. We also 
tested models that allowed for a more gradual rate of decay in the 
lag by weightmg the dependent variable by the distance to all other 
countries in the sample. The results were, on the whole, fairly similar 
for these different lag structures. 

14 For a similar approach see Finger and Kreinen 1979 and 
Wasserman and Faust 1994, 
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direction of trade data available from the IMF, we pro- 
duce ann x n x t matrix of correlations (between coun- 
tries) across each country’s total exports to each of the 
182 partner countries. The result is a matrix of yearly, 
dyadic measures of the degree to which nations possess 
the same trade relationships. We use these “distances” 
to identify the tenth of the sample most competitive 
with each country in each year. The mean of the depen- 
dent variable for this group is the spatial lag. Another 
way to measure export competition is to observe the 
similarity between countries according to their propor- 
tion of exports in various sectors. This measure was con- 
structed by calculating the correlations between coun- 
tries, by year, across a set of nine variables that scored 
the countries on the percentage of their exports in nine 
different sectors. We use these correlations, together 
with the dependent variables of other countries, to cal- 
culate the spatial lag in the way described above. We 
expect a positive association between both measures of 
policies of close trade competitors and the dependent 
variable. 

Competitors for investment capital are also likely to 
be salient models for policy makers. Because investors 
want the freedom to repatriate their assets, among oth- 
erwise similar investment venues they will favor coun- 
tries that allow for the liberal movement of capital and 
currency at nondiscriminatory exchange rates. Ideally, 
we seek an indicator that allows us to predict which 
countries will compete for the same pool of interna- 
tional capital. We begin by assuming that international 
investors’ decisions depend on their varying tastes for 
risk. Portfolio theory suggests that investors will want to 
create a portfolio with a share of low-risk, medium-risk, 
and high-risk investments (according to their tastes). 
Investors may decide, for example, that 10% of their 
portfolio will be reserved for high-risk, potentially high- 
return investments. Given this assumption, it is rea- 
sonable to posit that countries that pose similar risks 
are close substitutes from an investor’s point of view. 
(The United States, we assume, does not primarily com- 
pete with Argentina for foreign capital, whereas Brazil 
might.) Our measure groups countries by their yearly 
Standard and Poor’s sovereign bond rating and calcu- 
lates the mean policy score (for each policy area) for a 
country’s rating category for each year.’ If competition 
over the same “slice” of international capital provides 
incentives to liberalize, then we expect a positive coef- 
ficient. 

Using bond ratings in this way is unprecedented and 
deserves some exposition. In a model predicting eco- 
nomic policy, one may be concerned with the potential 
endogeneity of bond ratings. It is reasonable to think 
that analysts at Standard and Poor’s are accounting for 
a country’s economic policy in their assessment of the 
risk of default of the country’s bonds. But policies— 
much less the specific policies we analyze here—appear 
to play a minor role in the rating system. As Stan- 


15 Because it is possible that the policies themselves help determme 
the bond ratings, we experiment with models that lag the ratmgs by 
two and three years. We find that the results do not change signifi- 
cantly with this modification. 


dard and Poor’s describes their methodology, ratings 
are constructed by an informal (and subjective) combi- 
nation of one to five scores in eight areas, one’of which 
theoretically could include an assessment of fiscal pol- 
icy. Empirical analyses suggest that the ratings depend 
overwhelmingly on macroeconomic indicators of pub- 
lic debt and inflation (Cantor and Packer 1996; Haque, 
Mathieson, and Nelson 1997), and not the specific poli- 
cies of interest here. 

Nevertheless, we are sensitive to the possibility of 
such endogeneity and, accordingly, have developed an- 
other measure to identify capital competitors in order 
to corroborate our results. Assuming that potential for- 
eign direct investors are concerned with a country’s 
human assets as well as its technological and com- 
munications infrastructure, we reason that countries 
with similar educational and infrastructural profiles will 
compete for the same pool of capital. We compare 
such investment profiles by calculating correlations, by 
year, between countries across roughly 15 educational 
and infrastructural variables selected from the World 
Bank World Development Indicators. As we do for the 
export competition measure, we use these distances 
between countries to identify the tenth of the sample 
“most similar” to each country in investment profile. 
The spatial lag is the mean of the dependent variable 
for this group of countries. 

We have also argued that changes in less tangible 
payoffs such as legitimacy and prestige might be influ- 
enced by the prevalence of increasingly global norms, 
theories, or beliefs embraced by governments else- 
where. Heterodox policy failures are likely to earn a 
government more criticism than would policy failures 
that are consistent with a global consensus about what 
constitutes wise, sustainable economic policy. We mea- 
sure global norms with the yearly mean of the depen- 
dent variable across all countries in the sample. One 
may think of this indicator as a measure of unchanneled 
diffusion pressure, something diffusion scholars some- 
times call homogeneous, as opposed to heterogeneous, 
mixing (Strang 1991). 


Indicators of Informational Influences. Our second 
cluster of arguments concerns changes in the informa- 
tion set governments face. Our first set of information 
indicators taps learning from apparent success (rational 
Bayesian updating). Many complex measures of suc- 
cess could be devised, but we opt for a highly visible and 
well-publicized bottom line: growth rates. Our measure 
is the proportion of liberal (or restrictive) policies of 
the top growth decile. The higher this proportion, the 
clearer the message that liberalization “works.” 

Our next set of indicators concerns communication 
networks. At the official level, information about eco- 
nomic policy options can be transmitted through ne- 
gotiations and discussions among the members of eco- 
nomic agreements and groupings, such as the European 
Union or NAFTA.!6 Another plausible channel for 





16 Common membership in a PTA 1s likely to be endogenous (ex- 
plained, for example, by shared attitudes toward policy liberaliza- 
tion), but our formulation of the dependent variable (actual policies 
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communicating expectations and information about 
capital account openness is bilateral investment treaties 
(BITs). Again, for each country-year, we calculate av- 
erage policy scores weighted by PTA and by BIT part- 
nerships. These common memberships should predict 
channeled policy diffusion, based on the diffusion of 
policy-relevant information. 

Information can also flow between decision makers 
via private channels. Private actors who are exposed 
to liberal foreign regimes for the movement of goods 
and services may become convinced of the virtues of 
these arrangements and attempt to persuade their gov- 
ernments to liberalize. Learning may even take place 
(though less plausibly) at the mass level: Extensive pri- 
vate communications may persuade a relatively broad- 
based segment of the populace in the home country 
that liberalization is an appropriate policy. To allow for 
these possibilities, we have gathered data on business 
contacts (proxied here as direct bilateral trade links) 
and estimates of telephone traffic across pairs of coun- 
tries. Once again, we weight the policy in the foreign 
country by the intensity of these communication chan- 
nels. 

We have also argued that information is gleaned 
from appropriate analogies and that policy changes in 
countries sharing common cultural traits have greater 
information content than do others. Good measures of 
cultural reference groups are difficult to pin down, but 
ideally we seek measures that tap perceived similarity 
of values and shared identity across countries. Com- 
mon dominant language, common colonial heritage, 
and common dominant religion come close to capturing 
these shared orientations Dominant language may also 
reflect communication channels, and common colonial 
heritage may pick up a number of structural similari- 
ties that on balance may be more historical than cul- 
tural. Dominant religion, on the other hand, should be 
a fairly good measure of the identity and values held 
by a society and a sense of cultural connectedness with 
other nations with similar spiritual commitments!” For 
each language, colonial, or religious grouping we com- 
pute yearly means as described above. Note that these 
three variables, although quite similar, correlate only 
between —0.03 and 0.43 when combined with the de- 
pendent variable as described above (i.e., as Wy). 


Control Varlables 


Economic Shocks. It is certainly likely that policy 
transitions are influenced by conditions that have noth- 
ing to do with policy diffusion as we have defined it. 
The most likely alternative explanation is that govern- 
ments, especially those in close regional proximity, face 
similar economic conditions and, therefore, find it inde- 
pendently rational to respond in similar ways.Currency 
crises, for example, may be a reason to restrict cap- 





of PTA members) is not. Thus expressed, diffusion via communi- 
cation among PTA members can be distinguished from the orginal 
decision to join the PTA in the first place. 

17 On the link between cultural and, especially, religious values and 
economic liberalization, see, for example, Shalev 1997. 
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ital outflows or an impetus for reform. Variation in 
world interest rates could cause regionwide capital out- 
flows, capital and current account deterioration, and 
exchange rate pressure (Bartolini and Drazen 1976), 
with predictable pressures on policy. Similarly, robust 
growth rates or an improving balance of payments 
could increase policy makers’ confidence in liberaliza- 
tion (Goodman and Pauly 1993; but see Haggard and 
Maxfield 1996). 

In order to test these hypotheses, we enlist global and 
country-specific economic variables with a close asso- 
ciation with liberalization. We include a lagged mea- 
sure of currency crises/speculative attacks collected 
by David Leblang (2003) and based on Eichengreen, 
Rose, and Wyplosz (1995), which measures abnormally 
strong market pressures for currency depreciation. We 
control for world interest rates (using U.S. interest rates 
as a proxy), for each country’s current account balance 
as a proportion of GDP (lagged two periods to mini- 
mize problems of endogeneity), and for GDP per capita 
(a rough indicator of developmental level [Johnston 
and Tamirisa 1998]). We control for the business cycle, 
using a measure of change in GDP growth. If policy 
liberalization and restriction are simply an uncoordi- 
nated response to financial or economic conditions, we 
should see strong effects for this battery of variables. 
To the extent that these conditions cluster in time and 
space, these controls should differentiate our diffusion 
mechanisms from explanations for clustering based on 
commonly experienced shocks. 


External Political Pressure. A second sort of exoge- 
nous external shock for which we control is inspired 
by traditional theories of international relations, In- 
ternational politics often involve power relationships 
in which decisions made by weak or vulnerable states 
are the result of hegemonic pressure. One possibility 
is that the United States—the avatar of economic lib- 
eralism in the postwar period—has used its influence 
to prevail upon countries to announce policies that 
they would not have embraced otherwise. For exam- 
ple, the United States has reportedly pressed Chile 
and Singapore recently to liberalize further their cap- 
ital accounts as a condition of free trade negotiations 
(Economist 2003:15). We therefore control for the pro- 
portion of each country’s trade with the United States, 
as well as a partnership with the United States in a 
preferential trade arrangement or bilateral investment 
treaty. We even experiment with a dummy variable 
identifying years in which a Republican administration 
was in charge, on the theory that the United States’ 
influence might differ depending on the partisan ori- 
entation of the president. As it is also possible that 
creditors with an interest in liberalization work through 
dominant international institutions, we control for the 
use of IMF credits, as well as an overall measure of over- 
seas development assistance per capita (which includes 
actual multilateral and bilateral aid disbursed). As an 
alternative to the diffusion processes we have outlined 
here, governments might simply be responding to pres- 
sures by their creditors to liberalize their economies. 
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This effect should be especially strong when members 
need to draw upon negotiated external resources. 


Domestic Political Economy. Another possibility is 
that comparative political economy is sufficient to ex- 
plain policy transitions. Domestic preferences, capaci- 
ties, and institutions themselves may be correlated over 
time or spatially, producing a pattern that resembles 
policy diffusion. If the underlying political-economic 
conditions common to many states in a particular re- 
gion “coevolve” over time, we could mistakenly be fo- 
cused on policy diffusion when we should be looking 
at political, institutional, or developmental diffusion. 
Economic policy liberalization may simply be the re- 
sult of independent decision making in similar domestic 
political/institutional contexts. 

We therefore control for a set of measures that cap- 
tures both the nature of “demands” the polity might 
make and the institutional landscape that translates 
those demands into policy. The extent to which the 
polity is likely to demand more open policies is ap- 
proximated by the penetration of international trade, 
measured here in the traditional way (imports plus ex- 
ports as a proportion of GDP). Garrett, Guisinger, and 
Sorens (2000) find that democracy is associated with a 
lower probability that a developing country will liber- 
alize the capital account. This finding may well be due 
to the difficulty of reconciling democratic politics with 
popular demands in developing countries. We there- 
fore include a measure of democracy (see Elkins 2000). 
La Porta et al. (1997) have argued that the nature of 
the legal system has a good deal to do with the pro- 
tection of property rights. Common law systems, they 
claim, better protect investors and facilitate litigation- 
based rather than regulatory approaches to property 
rights. We hypothesize, therefore, that common law 
countries will be more likely to liberalize and less likely 
to restrict their financial markets in the ways exam- 
ined here. Furthermore, certain institutional magnets 
for international capital might encourage the process 
of liberalization. Sylvia Maxfield (1997) has argued 
that an independent central bank—often credited with 
keeping inflation in check—is likely to attract exter- 
nal capital. Central bank independence may therefore 
underpin a government’s confidence in liberalizing the 
flow of capital. We include data from McNamara and 
Castro (augmenting earlier work by Cukierman [1992]) 
that document major episodes of moves to make the 
central bank more independent. And by including a 
measure of the ruling party’s level of nationalism, we 
consider the possibility that nationalist-leaning govern- 
ments will be reluctant to initiate liberalization (see, 
however, Helleiner 2002). 


Geography. Geography alone may continue to exer- 
cise an independent influence on economic policy dif- 
fusion through mechanisms that we have not explic- 
itly considered. We experiment with two geographical 
variables—the logged distance between capitals and 
common borders. Our inclusion of spatial variables in a 
fully specified model allows us to isolate the effect that 
is due exclusively to changing material or reputational 


payoffs and informational influences that would other- 
wise be summarized in a geography term. This strategy 
helps to distinguish the diffusion mechanisms we are 
interested in from mere clustering. 

It is important to keep in perspective just what we 
are trying to accomplish by the fullest possible speci- 
fication of alternative hypotheses. The challenge is to 
parse out diffusion mechanisms from other plausible 
explanations for policy choice. The best way to demon- 
strate the likely causal impact of diffusion is to remove 
alternative explanations for the mere clustering of lib- 
eralization policies in time and space. If the weight 
of the evidence suggests that these controls outperform 
the channels of international diffusion theorized here, 
then we can be satisfied that traditional approaches to 
the political economy of liberalization are apt. 

Table 1 summarizes the mechanisms, concepts, and 
measures in the model and presents their summary 
statistics. 


Sampling and Estimation 


Our sample includes as many as 182 IMF-member 
states with yearly observations from 1967 to 1996.18 
We measure our dependent variables annually in bi- 
nary form (see above). To model policy transitions, 
we employ a semi-Markov model, which is commonly 
used for estimating transitions among mutually exclu- 
sive states of being. This approach allows us to consider 
transitions in both directions as well as vacillations be- 
tween policies (see Allison 1984). We run two hazard 
models for each dependent variable: one for transitions 
to liberal policies and one for transitions from liberal 
policies. We do not have especially strong distributional 
assumptions except that we expect that there will be 
some effect of time on the hazard rate. We use a Weibull 
survival model, but a Cox proportionate hazard model, 
on which Figures 3 and 4 are based, are substantively 
identical. 

Incorporating spatial variables introduces a number 
of statistical complications.’ One is that spatial lags, 
as weighted averages of the dependent variable in the 
“neighborhood,” often capture omitted variables that 
are highly correlated with membership in the group.” 
Specifying the model as completely as possible is impor- 
tant. We are reasonably confident that we have identi- 
fied some important predictors of liberalization, but no 
model is fully specified and caution is in order before 
inferring strong effects for diffusion. Multicollinearity 
is another potential concern with this type of anal- 
ysis. As noted above, networks of influence tend to 
overlap. For example, countries that are geographically 





18 Our data are both left and right censored: Roughly 100 govern- 
ments are under observation since the beginning of our analysis time 
(1967) and another 80 or so enter the analysis in the 1970s. Except in 
the case of state dissolution, all remain at nsk after 1997. 

19 See Anselin 1988 for a discussion of these issues with respect to 
spatial terms and Blalock 1984 or Przeworski 1974 on contextual 
variables more generally. 

20 For this reason, many spatal models of this kind use 2SLS or 
even 3SLS estimators, an approach that comes with its own set of 
complications, especially in event history models 
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clustered are also likely to be important trade partners, 
competitors, or cultural peers. Indeed, this is the case, 
but not to an alarming degree. The correlations across 
the diffusion variables range from —0.03 to 0.43. 


FINDINGS 


What conditions lead to policy transitions, whether 
toward liberalization or toward restrictions? Table 2 
reports the Weibull hazard ratios for each of the six 
equations?! 


Conditions That Affect Materlal and 
Reputational Payoffs: Economic 
Competition and Global Norms 


The most pronounced effect on policy transition comes 
from economic competition, most notably competition 
for global capital. Governments clearly tend to lib- 
eralize when their competitors do. The influence of 
policy change for countries with the same risk rat- 
ings is correctly signed in all six models and is signif- 
icant at least at the 90% confidence level in all cases 
(Table 2).~ Furthermore, when we substituted our in- 
dex of comparable foreign direct investment venues 
(based on education of the work force and development 
of infrastructure), the results were correctly signed in 
all six models and statistically significant in the case 
of capital control liberalization®—precisely where the 
theoretical case for competition is strongest. 

Trade competition, on the other hand, is a less con- 
vincing causal mechanism, theoretically and empiri- 
cally. Liberalization among countries that compete for 
the same export markets is quite likely to be important 
with respect to liberalization of the capital account, but 
the effects are ambiguous in the other cases. When we 
substituted the measure of sectoral competition, all re- 
sults were statistically insignificant. If these policies are 
linked to economic competition, it seems likely that the 
desire to attract capital is a far more powerful motive 
than trade competition. 

What are the sizes of these effects? The hazard ratios 
in Table 3 can be interpreted as the effect on the odds 
of transition associated with a one-unit move on the de- 
pendent variable. Remember that each of the diffusion 





21 Hazard ratios can be roughly understood as the change in the odds 
of tranntion associated with a one-unit change in the explanatory 
variable. Therefore, hazard ratios over one represent an increased 
probability of transition, of zero to one, a decreased probability of 
transition; and of one, zero effect. For the transitions to restrictive 
states, we reverse the scoring of the diffusion variables (originally 
scored as the weighted proportion of liberal policies) to be the 
weighted proportion of restrictive policies. Accordingly, the estimates 
of diffusion effects in Table 3 should be in the same direction for both 
transitiona, Not so for the control variablea 

Z This finding is robust (and nearly identical) to the use of an alter- 
native dataset for untficaton of exchange rates based on Reinhart 
and Rogoff 2002 Using those data, we estimate the hazard ratio for 
the effect of other countnes with a similar credit rating to be 1.756 
(p < 0.05) for unification and 1.419 (p < 0.10) for transition to a 
multiple rate system. 

D The desirability of venue index returned a hazard ratio of 1 414 
(p < 010). 
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variables has been rescaled to range from 0 to 10, so 
that each unit on these variables represents 10 percent- 
age points. Thus, if we consider the policies of capital 
competitors, the model suggests that on average a shift 
of 10 percentage points in the percentage of one’s com- 
petitors with a liberal policy will render a government 
anywhere from 1.8 (capital account) to 2.1 (exchange 
rates) times as likely to shift to a liberal policy. These 
are fairly strong effects, as one can illustrate by plotting 
survival curves for different conditions of competition. 
(In event history analysis, survival curves depict the 
probability of “survival,” that is, the probability of not 
having shifted to a new state or policy, at each time point 
in the analysis.) Figure 3 plots survival curves based on 
estimates of the effect of capital competition on the 
probability of shifting to a liberal capital account policy 
(that is, the first model in Table 3). The curves represent 
two conditions—one in which none of a governments 
competitors has a liberal policy and another in which 
one half of competitors have liberal policies. All other 
variables in the model are held at their means. The 
effects appear to be quite significant. Almost 30% of 
those with liberalized competitors would have shifted 
to liberal policies by the mid 1980s, while less than 5% 
of those without liberalized competitors would have 
shifted by that time. This effect appears especially large 
if we contrast it with that of a currency crisis (Figure 4). 

While the effect of competition appears significant, 
global norms—operationalized as the share of coun- 
tries globally assuming a particular policy stance—had 
few discernible effects, and the one statistically sig- 
nificant result was in the unexpected direction. This 
outcome suggests that it is far more likely that policy 
diffusion takes place via specific conduits carved out 
by economic competition, and not as a result of diffuse 
signals sent by the world at large. 


Diffusion Due to New Information: 
Conspicuous Success, Communication 
Channels, and Cultural Reference Groups 


The hypothesis that success attracts adherents is well 
supported by these results. Table 2 establishes a fairly 
clear pattern: Governments tend to implement the poli- 
cies chosen by other “successful” countries (with the ex- 
ception of capital account liberalization). In four cases, 
there is evidence of a strong tendency to follow the 
policies taken by the highest growth countries, mea- 
sured here as the fastest-growing decile. This is highly 
suggestive evidence that economic policy making of 
the most “successful” becomes data for updating policy 
beliefs—and, ultimately, actions—of governments else- 
where. If the proportion of countries in the top decile 
in growth with a liberal exchange rate policy changes 
from 25% to 75%, the probability of a transition among 
other countries increases by 36 percentage points. 
Communication networks add little to the unchan- 
neled Bayesian updating model. Individual measures 
of private communication via telephone generally 





24 This finding applies to the highest-growth countries. When the 
cutoff was made at the median, these effects were not significant. 
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TABLE 3. The Improvement In Model Fit Assoclated with Each Explanatory Mechanism— 
Likellhood-Ratlo Tests of the Joint Significance of Blocks of Variables: Universe, 
IMF Member Countries, 1966-96 














Transitions to Liberal Policies Transitions to Restrictive Policies 


Capital Exchange Current Capttal Exchange Current 
Account Rate Account Account Rate Account 
Log IIkellhood of full model 153.988 628.616 379.005 116.229 329.877 426.308 
Constralned models 
Attered 
Competition = 0 26.87 (2) 25.62 (2) 64.43 (2) 1.44 (2) 33.21 (2) 6.24 (2) 
p=0.00 p=000 p=0.00 p=0.48 p=0.00 p=0.04 


































New Information 
Learning from success = 0 0.96 (1) 2.82 (1) 6.51 (1) 3.00 (1) 0.75 (1) 3.52 (1) 
p=0.32 p=0.09 p=0.01 p=0.08 p=039 p=0.06 
Officlal communication = 0 12.58 (2) 3.50 (2) 6.03 (2) 0.15 (2) 0.68 (2) 1.23 (2) 
p=0.01 p=0.17 p=0.49 p=0.92 p=0.71 p=0.54 
Cultural similarity = 0 11.44 (3) 10.16 (3) 18.07 (3) 5.22 (3) 19.08 (3) 46.8 (3) 
p=001 p=0.01 p=0.00 p=0.15 p=000 p=0.00 











Control mechanisms 
Economlc determinants = 0 13.68 (3) 14.29 (3) 14.85 (3) 4.45 (3) 10.05 (3) 4.82 (3) 






External political pressures = 0 






Polttical determinants = 0 








Geography = 0 






FIGURE 3. The Probability of Maintaining a Restrictive Capital Account: The Effect of Policies of 
Capital Competitors 
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Crisis In the Preceding Year 


FIGURE 4. The Probability of Malntalning a Restrictive Capital Account: The Effect of a Currency 
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Note: Survival curve calculated from Cox proportional hazard estimates. All variables except currency crisis held at thair means. 





had no effects and were dropped from the models. 
Private business contacts (proxied here as major im- 
port and export partners) produce results that are in- 
consistently signed and yield no clear insights. Official 
contacts fared little better. Common membership in a 
PTA may predict transitions to unified exchange rate 
systems but otherwise have no discernible effects. The 
policies of countries with which a country has signed a 
bilateral investment treaty appear, if anything, to have 
a negative effect on policy (statistically significant in 
the cases of capital account liberalization and exchange 
rate unification). We believe that this result probably 
has to do with adverse selection effects for signing a BIT 
in the first place (Elkins, Guzman, and Simmons 2003). 
Overall, we find no compelling evidence that commu- 
nication ties alone affect the decision to liberalize or to 
restrict external monetary and financial policies. 
Consider next the effects of cultural reference 
groups. We have argued that governments may rea- 
sonably search out new information from cases per- 
ceived to have cultural relevance to their own situ- 
ation. The most conspicuous finding here is that the 
policies of countries with similar dominant religion are 
remarkably influential. The effects of the other two 
measures—common language and common colonial 
heritage—wash out in all but a few cases (note that 
policies among similar language and religious group- 
ings are positively but mildly correlated at 0.29). The 
effects of the policies of countries with shared religious 
values are significantly positive for each of the six policy 
transitions.~ Moreover, the size of this effect is impor- 


2 This finding is robust (and nearly identical) to the use of an alter- 
native dataset for unification of exchange rates based on Reinhart 


tant. In fact, a move of one standard deviation one way 
or another on the mean policy of countries with sim- 
ilar dominant religions is associated with a change of 
roughly 15 points in the probability of a country’s policy 
transition. Cultural lenses that influence acceptance of 
a particular economic policy model may provide far 
more purchase on actual policy choices than the recent 
political economy literature has allowed. 


Control Variables 


Finally, we consider the effects of the control variables. 
The hypothesis that governments are responding to var- 
ious economic shocks is not especially well supported. 
The directions of effects are generally as one would 
expect, however. High world interest rates may tend 
to decrease the probability of liberalization of the cur- 
rent account. GDP growth may encourage and help 
maintain current account liberalization—its only sta- 
tistically significant effect. Current account surpluses 
tend to be associated with policy inertia with respect to 
the unification of the exchange rate regime (there is a 
significant reduction in the hazard rate for both liberal- 
ization and restriction), whereas deficits tend to stim- 
ulate policy change in both directions. Currency crises 
seem to be associated with liberalization in the follow- 
ing period, a result that was statistically significant for 
both the capital account and exchange rate unification. 


and Rogoff 2002 Using those data, we estimate the hazard ratio for 
the effect of other countries of the same dominant religion to be 
1.91 (p < 0 10) for unification and 1.184 (p < 0 05) for transition to 


a multiple rate system. 
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Developing countries as measured by per capita GDP 
almost certainly find it more difficult to make lib- 
eral transitions (statistically significant in every case), 
but the hazard ratios for implementing restrictions, al- 
though notstatistically significant, indicate by their con- 
sistency that developing countries may find it harder to 
implement policy transitions at all. 

The results from the hegemonic variables are mixed. 
In none of the models did borrowing from the IMF 
increase the likelihood of a liberal policy shift. On the 
other hand, as one might expect, a contract with the 
IMF appears to be associated with a reluctance to re- 
strict the current account. The use of IMF credits is as- 
sociated with restrictions on the capital account, which 
is probably indicative of capital flight problems that 
precipitated extramarket borrowing in the first place. 
There is also some evidence that overseas development 
assistance is in some way related to policy shifts in these 
three areas. Dependence on such aid (which includes 
both multilateral and bilateral sources) is associated 
with liberalization of the current account and eschew- 
ing of multiple exchange rate regimes, but it is also 
associated with a reduction in the probability of lib- 
eralizing capital markets. In order to test hegemonic 
arguments further, we experimented with models that 
controlled for share of exports to the United States, 
share of imports from the United States, and joint mem- 
bership with the United States in a preferential trade 
arrangement or in a bilateral investment treaty. We also 
entered all of these variables for relations with the Eu- 
ropean Union. We then controlled for the party affilia- 
tion of the U.S. president. None of these specifications 
returned statistically significant results, and they were 
dropped from the model. Overall, it is difficult to sus- 
tain the argument that the waves of liberalization and 
restriction in these policy areas have been systemati- 
cally influenced by direct or organizationally mediated 
hegemonic pressure. 

The domestic political variables also behave unex- 
pectedly weakly. As the political economy literature 
has long held, there is evidence that people residing 
in open economies demand and probably get greater 
policy liberalization, at least in the case of the capital 
account, and revert to restrictions with less frequency. 
Nationalist governments are much less likely to liber- 
alize (especially on the capital account). Democracies 
tend to favor liberal transitions, although these results 
are statistically significant only for the capital account. 
The effects of legal heritage—hypothesized to reflect 
attitudes toward regulation and property rights—were 
inconsistent and, with one exception in the wrong direc- 
tion, insignificant. Improvements in central bank inde- 
pendence were similarly not associated with particular 
policy demarches. 

Once we control for other factors, it appears that 
geography per se is not a convincing explanation of 
policy diffusion. The apparently strong geographical 
effects in Figure 1 are obliterated by the functional and 
cultural relationships described above. With one minor 
exception in the wrong direction, neither geographical 
distance nor status as a bordering country had any in- 
dependent effect on policy stance. 
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Another way to look at the plausibility of the diffu- 
sion mechanisms we have proposed is to examine their 
effects as a block. This is useful because we want to 
make claims about broad mechanisms for which we em- 
ploy related, but disaggregated measures. In Table 3, 
we summarize the strength of these blocks by com- 
paring the full model with nested models in which the 
blocks of diffusion effects are constrained to zero. The 
likelihood-ratio test of such comparisons indicates the 
improvement in fit associated with the addition of each 
block of variables. 

Viewed in this way, the aggregate effects of clusters 
of measures are quite robust. Economic competition 
is an especially important and consistent part of the 
explanation for change in both directions for each of the 
three policy areas: In five of the six transition models, 
the block of measures of competition is almost certainly 
significant, with the only exception being transitions to 
restrictive policies in the current account. The variables 
that represent cultural similarity taken as a block are 
consistently important in explaining policy choice, with 
the possible exception of capital account liberalization. 
On the other hand, communication networks do not 
clearly add much explanatory power to policy choice. 
As we had only one indicator of learning from success, 
the results in Table 3 reflect the findings reported in 
Table 2. 

The blocks of control variables are much less con- 
vincingly associated with policy choice in these areas. 
The economic variables jointly contribute to an im- 
provement in model fit in only half of the cases. The 
domestic political and institutional variables do a rela- 
tively good job at improving the fit for liberal transitions 
but fail to add anything to our understanding of policy 
restrictions. Hegemonic pressure seems to help explain 
capital and current account liberalization, but our ear- 
lier models uncovered a counterintuitive direction to 
these influences in the former case. Geography alone 
never stands up to scrutiny. 


CONCLUSIONS 


There are good reasons to believe that governments 
are sensitive to external signals to liberalize and to 
restrict their monetary and financial policies. Tempo- 
ral and spatial clustering support the proposition that 
something systematic must be driving states’ policies in 
this way. Indeed, this characterization is easy to accept 
intuitively. Scholars and laypersons alike find it easy to 
grasp the competitive implications for Mexico of free 
trade between the United States and Canada (Gruber 
2000), as well as the socially emulative impulses of 
developing countries (Finnemore 1996). Much is at 
stake, theoretically and practically, in recognizing the 
importance of policy diffusion to the current state of 
globalization. The recent political economy literature 
has concentrated primarily on the domestic sources of 
foreign economic policy or, at most, economic policy 


26 The null hypothesis of these tests is that the joint effect of the block 
of variables is zero. A rejection, therefore, suggests that the vanables 
mmprove the fit of the model. 
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choice in response to price signals from the unmediated 
international economy. Purely economic explanations 
of policy coordination—theories of optimal currency 
areas, for example—have consistently failed to cap- 
ture government choices on the ground.” Meanwhile, 
international financial markets have been extolled or 
vilified by heavily ideological accounts that tend to 
hinder rather than assist in creative analysis. Further- 
more, some scholars and observers have attributed pol- 
icy liberalization to exogenous pressures from the IMF 
as the organizational embodiment of the “Washington 
Consensus” The analysis in this paper suggests that 
these approaches do not sufficiently explain why gov- 
ernments decide to open or restrict their economies. 

We have explored two broad sets of mechanisms that 
might explain patterns that appear to involve policy 
diffusion among countries: altered payoffs and new in- 
formation. As an example of changing payoffs, com- 
petition for international capital seems to be an es- 
pecially compelling explanation for the international 
diffusion of liberal economic policy. Across all policy 
areas, policy liberalization is highly correlated with the 
orientation of other governments that compete for the 
same slice of global capital. Could this simply be due to 
the fact that similarly rated countries are economically 
similar in a number of ways, and so have independently 
similar incentives to open and restrict their capital and 
current accounts? This is possible, though it is rendered 
much less likely by the inclusion of a battery of eco- 
nomic controls (growth, balance of payments, world 
interest rates, currency crises, developmental level) that 
should to some extent control for this problem. We 
also found that a completely different specification of 
capital competitors, aimed more directly at competi- 
tion for foreign direct investment and based on similar 
levels of education and infrastructure, yielded results in 
the correct direction, which were highly significant for 
liberalization of the capital account. The relationship 
between competition for capital and policy diffusion 
is so empirically strong and theoretically plausible in 
these tests that it should be a high priority for future 
research. 

Next we tested arguments that new information 
and learning contribute to economic policy diffusion. 
Empirical work on learning must be grounded in plau- 
sible, observable proxies for this essentially psycholog- 
ical process. Our strategy has been to look for the ob- 
servable implications of learning, which we believe in 
the first instance are influenced by examples of conspic- 
uous economic success. Indeed, policies of the highest 
growth countries did have a significant impact on four 
of the policy choices examined here. Interestingly, this 
finding did not hold for capital account liberalization. 
Given the recent acknowledgment of such proliber- 
alization institutions as the IMF and the Economist 
newspaper that capital account liberalization makes 


77 Of the many studies that fail to explain extant currency patterns 
based on OCA theory, see Ghosh and Wolf (1994), who find that 
neither Europe nor the United States forms an optimum currency 
area; for both regions the costs of adopting a single currency exceeds 
estimates of the transaction cost savings. 


sense only in specific macroeconomic and regulatory 
contexts, this finding is actually quite encouraging. 

Few political economists, (though of course many 
more sociologists) would have nominated broad cul- 
tural orientation as a central explanation for policy dif- 
fusion. The results here, however, are difficult to ignore. 
Our results show that governments tend to liberalize 
and to restrict the capital account, current account, and 
exchange rate regime along the lines of countries with 
which they share a religious identity, when we control 
for a wide range of other factors. A striking finding is 
that this relationship holds in all three policy areas, and 
symmetrically for both liberalization and restriction. 
The evidence supports neither the “world culture” so- 
ciologists, who emphasize the irrationality of absorbing 
global culture willy-nilly, nor the political economists, 
who remove culture from the calculus of policy choice 
altogether. We suggest that governments systematically 
consider the lessons their cultural peers have to offer 
when fashioning their own economic policy choices. Of 
course, religion should be thought of as just one indi- 
cator that taps the broader value orientation or cul- 
tural identity of a society. Note also that we are not 
arguing that religion speaks directly to the question of 
capital controls or exchange rate arrangements. These 
results do suggest, though, that values common to a par- 
ticular religious tradition may shape attitudes toward 
risk, individualism, equality, and materialism generally. 
Governments tend to take these shared attitudes into 
account when searching for appropriate models in the 
absence of perfect (or perfectly understood) informa- 
tion. Cultural values may be a fundamental source of 
identity for governments as much as for individuals, 
with consequences for highly material arenas of policy 
choice. 

One thing is clear: Economics and comparative po- 
litical economy can take us only so far in understanding 
the ebb and flow of foreign economic policy liberaliza- 
tion over the past three decades. The apparent diffusion 
of policy choice over this time demands explicitly inter- 
national or indeed transnational theory and testing. As 
we think in these directions, we may uncover under- 
emphasized sources of authority that structure compe- 
tition and channel the search for appropriate models of 
foreign economic policy. Research into the dynamics of 
globalization and its underlying governance structures 
should push us to understand how and why this takes 
place. 
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as political freedom and efficacy, that researchers define best with reference to examples; and 

what to do when respondents interpret identical questions in different ways. Scholars have 
long addressed these problems with approaches to reduce incomparability, such as writing more concrete 
questions—with uneven success. Our alternative is to measure directly response category incomparability 
and to correct for it. We measure incomparability via respondents’ assessments, on the same scale as the 
self-assessments to be corrected, of hypothetical individuals described in short vignettes. Because the 
actual (but not necessarily reported) levels of the vignettes are invariant over respondents, variability in 
vignette answers reveals incomparability. Our corrections require either simple recodes or a Statistical 
model designed to save survey administration costs. With analysis, simulations, and cross-national surveys, 
we show how response incomparability can drastically mislead survey researchers and how our approach 


W: address two long-standing survey research problems: measuring complicated concepts, such 


can alleviate this problem. 


ory, including a rough agreement on normative 

theories preferring freedom, democracy, and po- 
litical equality, among others, and the development of 
positive theories focused on understanding the causes 
and consequences of these variables. Empirical polit- 
ical science, in turn, is devoted in large part to mak- 
ing causal inferences about these same variables. Un- 


T: discipline of political science is built on the- 
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dergirding this superstructure of theory. and causality 
is measurement, the detailed mapping of the levels of 
these basic variables. Although it may not seem as ex- 
citing as causal inquiry, better measurement obviously 
has the potential to affect our understanding of the ex- 
tent of any problems that may need addressing and the 
estimates of any causal effects. Indeed, achieving the 
theoretical and causal goals of our field and all other 
empirical fields “would seem to be virtually impossi- 
ble unless its variables can be measured adequately” 
(Torgerson, 1958). 

We address two long-standing problems with mea- 
surement using sample surveys (a data collection device 
used in about a quarter of all articles and about half of 
all quantitative articles published in major political sci- 
ence journals [King et al. 2001, fn 1]). The first is how to 
measure concepts researchers know how to define most 
clearly only with reference to examples—freedom, po- 
litical efficacy, pornography, health, etc. The advice 
methodologists usually give when hearing “you know it 
when you see it” is to find a better, more precise theory 
and then measurement will be straightforward. This is 
the right advice, but it leads to a well-known problem in 
that highly concrete questions about big concepts like 
these often produce more reliable measurements but 
not more valid ones. 

The second problem we address occurs because “in- 
dividuals understand the ‘same’ question in vastly dif- 
ferent ways” (Brady 1985). For example, Sen (2002) 
writes that 

the state of Kerala has the highest levels of literacy ... and 


longevity . . . in India. But it also has, by a very wide margin, 
the highest rate of reported morbidity among all Indian 
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states.... At the other extreme, states with low longevity, 
with woeful medical and educational facilities, such as 
Bihar, have the lowest rates of reported morbidity in India. 
Indeed, the lowness of reported morbidity runs almost fully 
in the opposite direction to life expectancy, in interstate 
comparisons.... In disease by disease comparison, while 
Kerala has much higher reported morbidity rates than the 
rest of India, the United States has even higher rates for 
the same illnesses. If we insist on relying on self-reported 
morbidity as the measure, we would have to conclude that 
the United States ıs the least healthy in this comparison, 
followed by Kerala, with ill provided Bihar enjoying the 
highest level of health. In other words, the most common 
measure of the health of populations is negatively corre- 
lated with actual health. 


Studying why individuals have perceptions like these, 
so far out of line with empirical reality, “deserves atten- 
tion” but measuring reality only by asking for respon- 
dents’ perceptions in these situations can be “extremely 
misleading” (Sen 2002). 

The literature on this problem has focused on de- 
veloping ways of writing more concrete, objective, and 
standardized survey questions and developing methods 
to reduce incomparability. Despite a half-century of 
efforts, however, many important survey instruments 
are still not fully comparable (Suchman and Jordan 
1990). Indeed, even though political scientists have 
been aware of the devastating consequences of ignoring 
the problem for almost two decades (Brady 1985), the 
lack of tools to deal with it has meant that the compa- 
rability of most of our survey questions has not even 
been studied. 

We have designed a new approach to survey instru- 
mentation that seems to partially ameliorate both prob- 
lems, Our key idea, in addition to following the vener- 
able tradition of trying to write clearer questions that 
are more comparable, is a method of directly measuring 
the incomparability of responses to survey questions, 
and then correcting for it. We ask respondents for self- 
assessments of the concept being measured along with 
assessments, on the same scale, of each of several hy- 
pothetical individuals described by short vignettes. We 
create interpersonally comparable measurements by 
using answers to the vignette assessments, which have 
actual (but not reported) levels of the variables that 
are the same for every respondent, to adjust the self- 
assessments. Our adjustments can be made with sim- 
ple calculations (straightforward recode statements) or 
with a more sophisticated statistical model that has the 
advantage of lowering data collection costs. Easy-to- 
use software to implement our statistical methods, a 
library of examples of survey questions using our ap- 
proach, and other related materials can be found at 
http: //GKing.Harvard.edu/vign/. 


PREVIOUS APPROACHES 

The most widely used modern terminology for inter- 
personal incomparability is differential item function- 
ing (DIF), which originated in the educational testing 
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literature.’ The search for methods of detecting or con- 
quering DIF usually centers on the identification of 
common anchors that can be used to attach the answers 
of different individuals to the same standard scale. 

The earliest and still the most common anchors in- 
volve giving the endpoints of the (or all) survey re- 
sponse categories concrete labels—“strongly disagree,” 
“hawk,” etc. This undoubtedly helps, but is often insuf- 
ficient. An early and still used alternative is the “self- 
anchoring scale,” where researchers ask respondents 
to identify the top- and bottommost extreme examples 
they can think of (e.g., the name of the respondent’s 
most liberal friend and most conservative friend) and 
then to place themselves on the scale with endpoints de- 
fined by their own self-defined anchors (Cantril 1965). 
This approach is still used but, depending as it does on 
extremal statistics, it often lowers reliability, and it will 
not eliminate DIF if respondents possess different lev- 
els of knowledge about examples at the extreme values 
of the variable in question. 

Researchers sometimes compare a survey response 
at issue to “designated anchors,” which are ques- 
tions that tap the same concept that experts believe 
have no DIF (Przeworski and Teune 1966-67; Thissen, 
Steinberg, and Wainer 1993). This is an important ap- 
proach, but as the authors recognize, it begs the ques- 
tion of where knowledge of the anchors come from 
in the first place. Sometimes researchers evaluate each 
survey question in turn by comparing it with an av- 
erage, or factor analyzed weighted average, of all the 
others that measure the same concept. As is also widely 
recognized, however, the assumption that all the other 
questions do not have DIF on average, as each question 
moves in and out of the “gold standard” comparison 
group, is internally inconsistent. 

Although not widely known outside our field, the 
most satisfactory approaches to correcting for DIF in 
any field have been in the context of application-specific 
models built by political scientists. The first such model 
was Aldrich and McKelvey (1977), which estimated the 
positions of candidates and voters in a common issue 
space. The actual positions of candidates were assumed 
the same for all respondents and, so, could be used 
as anchors to adjust both candidate and voter issue 
positions. Since these actual positions are unobserved, 
Aldrich and McKelvey assume that voters have unbi- 
ased perceptions of candidate positions but that the 
reported positions are linearly distorted in an unknown, 
but estimable, way. Because of the constrained compu- 
tational resources available at the time, they recognized 








1 In the educational testing literature, a test question is said to have 
DIF if equally able individuals have unequal probabilities of answer- 
ing the question correctly The analyms of reasons for the varying 
test performance of students in different racial groups has provided 
considerable umpetus for the study of DIF. Indeed, the term DIF 
was chosen to replace the older “item bias” term as an effort to 
sidestep some of the politically charged issues involved (see Holland 
and Wainer 1993 for a review of the literature). Paradoxically, the 
method we introduce here would seem applicable to all flelds where 
DIF is an issue except for educational testing. 
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but did not model several other features of the problem, 
such as the ordinal nature of the response categories” 

Using a similar logic, Groseclose, Levitt, and Snyder 
(1999) adjust interest group scores across time and 
houses of congress by using scores on the same leg- 
islator at different times (when serving in the same or 
different chambers) ‘as anchors. Their model thus as- 
sumes that members have constant expected, but not 
measured, interest group scores. Poole and Rosenthal’s 
(1991) widely used D-Nominate scores for scaling legis- 
lators and roll calls applies analogous ideas for anchors 
(see also Heckman and Snyder 1997 and Poole and 
Daniels 1985). Londregan (2000) uses similar anchor- 
ing in a model more amenable to small samples and 
resolves several identification problems by simultane- 
ously modeling the agenda, while Clinton, Jackman, 
and Rivers (2002) present a fully Bayesian approach. 
Baum (1988) adjusts the scaling of the liberalness of 
Supreme Court decisions by assuming the stability of 
individual justices over time, and anchoring the court 
decisions to justices that serve in more than one “nat- 
ural” court. See also Lewis 2001 for a similar approach 
to scaling voting behavior and for his review of other 
work in this area. 

The anchors used in most political science applica- 
tions are far better than the unadjusted values (and 
better than most anchors available in other fields), but 
as is fully recognized by the authors, the strategies em- 
ployed by political actors mean that the anchors are not 
completely free of DIF. For example, a reasonable char- 
acterization of much of the partisan process of writing 
legislation is to create DIF—to make the choice harder 
for opposition legislators than members of one’s own 
party. Similarly, if candidates succeed in being even in 
part “all things to all people,” the use of voter percep- 
tions of candidate positions as anchors could be biased. 

Most current efforts at dealing with DIF in other 
fields try to identify questions with DIF and delete 
them or collapse categories to avoid the problem 
(Holland and Wainer 1993). Some model DIF in unidi- 
mensional scales as additional unobserved dimensions 
(Caroll and Chang 1970; Shealy and Stout 1993). Others 
use Rasch models, a special case of item response 
theory, which come with a variety of statistical tests 
and graphical diagnostics (see Piquero and Macintosh 
2002). The multidimensional scaling literature has also 
paid considerable attention to DIF, which they call 
“interpersonal incomparability” (Brady 1989) or “in- 
dividual differences scaling” (Alt, Sarlvik, and Crewe 
1976; Clarkson 2000; Mead 1992). Others parse DIF 
into components like “acquiescence response set,” the 
differential propensity of respondents to agree with 
any question, no matter how posed; “extreme response 
set,” the differential propensity of respondents to use 
extreme choices offered, independent of the question; 


2 Palfrey and Poole (1987) show that the Aldrich and McKelvey pro- 
cedure recovers candidate locations well, even if errors (contrary to 
the model) are heteroskedastic over candidates, but voter postions 
are biased toward the mean, especially for poorly informed voters. 
Poole (1998) generalizes Aldrich and McKelvey 1977 to multiple 
dimenmons and to handle mussing data. 


and many others (Cheung and Rensvold 2000; Johnson 
1998; Stewart and Napoles-Springer 2000). DIF poten- 
tially affects most survey-based research throughout 
political science and in a wide variety of other fields. 


SURVEY INSTRUMENTATION: ANCHORING 
VIGNETTES 


The usual procedure for measuring sophisticated con- 
cepts with surveys is to gather a large number of exam- 
ples and design a concrete question that covers as many 
of the examples as possible. Our idea is, in addition to 
this approach, to use the examples themselves in survey 
questions to estimate each person’s unique DIF, and 
to correct for it. Examples presented in vignettes to 
respondents have a long history of use for other pur- 
poses in survey research (e.g., Kahneman, Schkade, and 
Sunstein 1998; Martin, Campanelli, and Fay 1991; Rossi 
and Nock 1983). We use an adapted version of vignettes 
that generalize the ideas in application-specific DIF- 
related research in political science. 

We ask survey respondents in almost the same lan- 
guage for a self-assessment and for an assessment of 
several (usually five to seven) hypothetical persons de- 
scribed by written vignettes. For example, the anchor- 
ing vignettes for one particular domain of political ef- 
ficacy might be as follows. 


1. “[Alison] lacks clean drinking water. She and her 
neighbors are supporting an opposition candidate 
in the forthcoming elections that has promised to 
address the issue. It appears that so many people 
in her area feel the same way that the opposition 
candidate will defeat the incumbent representative.” 

2. “[Imelda] lacks clean drinking water. She and her 
neighbors are drawing attention to the issue by col- 
lecting signatures on a petition. They plan to present 
the petition to each of the political parties before the 
upcoming election.” 

3. “[Jane] lacks clean drinking water because the gov- 
ernment is pursuing an industrial development plan. 
In the campaign for an upcoming election, an oppo- 
sition party has promised to address the issue, but 
she feels it would be futile to vote for the opposition 
since the government is certain to win.” 

4, “[Toshiro] lacks clean drinking water. There is a 
group of local leaders who could do something about 
the problem, but they have said that industrial de- 
velopment-is the most important policy right now 
instead of clean water.” 

5. “[Moses] lacks clean drinking water. He would like 
to change this, but he can’t vote, and feels that no 
one in the government cares about this issue. So he 
suffers in silence, hoping something will be done in 
the future.” 


(We view these vignettes as falling on an ordered scale, 
from most to least efficacy; our empirical analyses, be- 
low, support this interpretation.) The following often- 
used question is then read to the respondent for each 
vignette and for a self-assessment: 
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How much say [does ‘name’/do you] have in getting the 
government to address issues that interest [him/her/you]? 


For the self-assessment and each of the vignette ques- 
tion, respondents are given the same set of ordinal re- 
sponse categories, for example, “(1) No say at all, (2) 
Little say, (3) Some say, (4) A lot of say, (5) Unlimited 
say.” Answers to this self-assessment question are nor- 
mally referred to as “political efficacy,” and we use this 
shorthand too. But what we are measuring in fact is no 
more or less than the concept defined by the vignette 
definitions, which is at best only one specific dimension 
of political efficacy. Other dimensions could be tapped 
with separate sets of vignettes. 

We recommend asking the self-assessment first, fol- 
lowed by the vignettes randomly ordered. We also often 
randomly shuffle vignettes from two domains together. 
When feasible, we change the names on each vignette 
to match match each respondent’s culture and sex. 


MEASUREMENT ASSUMPTIONS 


Our approach requires two key measurement assump- 
tions. First, response consistency is the assumption that 
each individual uses the response categories for a par- 
ticular survey question in the same way when providing 
a self-assessment as when assessing each of the hypo- 
thetical people in the vignettes. Respondents may have 
DIF in their use of survey response categories for both 
a self-assessment and the corresponding vignettes, but 
the type of DIF must be approximately the same across 
the two types of questions for each respondent. In other 
words, the type of DIF may vary across respondents, 
and also for a single respondent across survey questions 
(each with its own self-assessment and corresponding 
set of vignettes), but not within the self-assessment and 
vignette questions answered by any one respondent 
about a single survey question. This assumption would 
be violated if respondents who feel inferior to hypothet- 
ical individuals set a higher threshold for what counts 
as their having “a lot of say” in government than they 
set for the people described in the vignettes. 

Second, vignette equivalence is the assumption that 
the level of the variable represented in any one vignette 
is perceived by all respondents in the same way and 
on the same unidimensional scale, apart from random 
measurement error. In other words, respondents may 
differ with each other in how they perceive the level of 
the variable portrayed in each vignette, but any differ- 
ences must be random and hence independent of the 
characteristic being measured. (Of course, even when 
respondents understand vignettes in the same way on 
average, different respondents may apply their own 
unique DIFs in choosing response categories.) This as- 
sumption would be violated if one set of respondents 
saw the vignettes above as referring to say in govern- 
ment through elections, as we intended, and the other 
interpreted our choice of words in one vignette to be 
referring to say in government through one’s personal 
connections. 

Thus, although we allow and ultimately correct for 
DIF in using survey response categories, assuming uni- 
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dimensionality means that we assume the absence of 
DIF in the “stem question.” It seems reasonable to 
focus on response-category DIF alone because the vi- 
gnettes describe objective behaviors, for which tradi- 
tional survey design advice to avoid DIF (such as writ- 
ing items concretely and using pretesting and cognitive 
debriefing, etc.) is likely to work reasonably well. In 
contrast, response categories describe subjective feel- 
ings and attitudes, and so should be harder to lay out 
in concrete ways and avoid DIF without our methods. 
Whether our response-category DIF correction is suf- 
ficient is of course an empirical question. Future re- 
searchers may wish to try to generalize our methods to 
deal with both types of incomparability. 

Even more basic than vignette equivalence, but im- 
plied by it, is the assumption that the variable being 
measured actually exists and has some logically co- 
herent and consistent meaning in different cultures. 
For variables and cultures where the extreme version 
of the area studies critique is correct, so that differ- 
ent regions are truly unique and variables take on 
completely different meanings, then any procedure, 
including this one, will fail to produce comparable 
measures. 

How do response consistency and vignette equiva- 
lence help correct for DIF? The problem with self- 
assessment questions is that answers to them differ 
across respondents according to both the actual level 
and DIF (along with random measurement error). In 
contrast, answers to the vignettes differ across respon- 
dents only because of DIF (and random measurement 
error). Since the actual level of political efficacy of the 
people described in the vignettes is the same for all re- 
spondents, we are able to use variation in answers to the 
vignettes to estimate DIF directly. We then “subtract 
off” this estimated DIF from the self-assessment ques- 
tion to produce our desired DIF-free (or DIF-reduced) 
measure. 

The key goal of survey design under this approach, 
then, is not to design DIF-free vignette questions, which 
would be as difficult as for self-assessment questions, 
but rather to achieve response consistency and vignette 
equivalence. Thus, vignettes should be written to de- 
scribe, in clear and concrete language, only the actual 
level of political efficacy of the person described, with 
all other language in the vignette geared to encourage 
respondents to think the person described is someone 
just like themselves in all other ways. In that way, the 
respondent would find it easier to use the response 
categories in the same way for the vignette as for the 
self-assessment. 

The methods described below include some tests of 
aspects of these assumptions, but for the most part they 
require iterating among concept definition, question 
development, pretesting, and cognitive debriefing. Un- 
like purely observational research, the veracity of the 
assumptions here is under the active control of the in- 
vestigator in designing the research—as in political sci- 
ence laboratory (Kinder and Palfrey 1993), field (Green 
and Gerber 2001), and survey experiments (Sniderman 
and Grob 1996)—but of course having control does not 
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A SIMPLE (NONPARAMETRIC) APPROACH 


We now combine our survey instrumentation and mea- 
surement assumptions to show how to correct DIF 
without sophisticated statistical techniques. The sim- 
plicity of this approach is also helpful in illustrating the 
key concepts and in clarifying the source of the new 
information. 

This method can easily be used, and we use it be- 
low, but it also has two important disadvantages: First, 
it requires the vignette questions be asked of all the 
same respondents as the self-assessments, and so it 
can be expensive to administer. Second, as with many 
nonparametric methods it is statistically inefficient in 
some circumstances, which means that by foregoing as- 
sumptions some information is wasted. Our parametric 
approach, described in the section that follows, avoids 
these problems, However, since the nonparametric ap- 
proach makes none of the parametric models’ statistical 
assumptions and requires no explanatory variables, it 
makes possible several diagnostic tests of the paramet- 
ric model’s assumptions. 

Figure 1 portrays one self-assessment and three vi- 
gnette assessments for each of two individual survey 
respondents (labeled 1, on the left, and 2, in the middle). 
The self-assessed level of political efficacy is higher for 
Respondent 1 (and they agree on the ordinal ranking 
of the vignettes). However, the fact that Alison’s (or 
Jane’s or Moses’s) actual level of political efficacy is the 
same no matter which respondent is being asked about 
her makes it possible to make the two comparable by 
stretching Respondent 2’s scale so that the vignette as- 
sessments for the two respondents match. We do this in 
the scale on the right in Figure 1. With this adjustment, 
we can see that in fact Respondent 2 has a higher level of 
actual political efficacy than Respondent 1. This comes 
from the fact that Respondent 1’s rates herself lower 
than Jane, whereas Respondent 2 rates herself higher 
than Jane. 

Analyzing anchoring vignettes data by literally mark- 
ing and stretching rubber bands to match Figure 1 
would work fine, but we also offer an even simpler 
method. The idea is to recode the categorical self- 
assessment relative to the set of vignettes. Suppose that 


all respondents order the vignettes in the same way. 
Then for the vignettes in Figure 1, assign the recoded 
variable 1 if the self-assessment is below Moses, 2 if 
equal to Moses, 3 if between Moses and Jane, 4 if equal 
to Jane, 5 if between Jane and Alison, 6 if equal to 
Alison, and 7 if above Alison. (By this coding, the first 
respondent in Figure 1 is coded 3 and the second is 
coded 5.) The resulting variable is DIF-free, has eas- 
ily interpretable units, and can be analyzed like any 
other ordinal variable (e.g., with histograms, contin- 
gency tables, or ordered probit). This method assumes 
response consistency and vignette equivalence, but no 
additional assumptions or models are required. 

To define this idea more generally, let y, be the cate- 
gorical survey self-assessment for respondent i (i= 
1,...,m) and z be the categorical survey response 
for respondent i on vignette j (j =1,..., J). Then for 
respondents with identical ordinal rankings on all vi- 
gnettes (%)-1 < Zy, for alli, j), the DIF-corrected vari- 
able is 


if yu < 41, 
i “= 41, 
if Zi < Ys < %2, 


“WNP 
aa) 


2J+1 if y > Zw. 
Respondents with ties in the vignette answers would re- 
duce our knowledge of C, to a set of values rather than 
just one value. Inconsistencies in the ordinal ranking 
are grouped and treated as ties. When few survey re- 
sponse categories exist with which to distinguish among 
the categories of C, additional collapsing may occur. 
The inefficiencies in this method come precisely from 
the information lost due to these ties and ranking in- 
consistencies. (In contrast, our parametric method, de- 
scribed below, recognizes that some of these will be due 
to the random error always present in survey responses, 
and so it can extract more information from the data.) 
To study this method, we included the questions 
on the electoral dimension of political efficacy de- 
scribed above on a sample survey of two provinces in 
China (with n = 430 respondents) and three in Mexico 
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FIGURE 2. Nonparametric Estimates of an Electoral Dimension of Political Efficacy 
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(1 =551). The surveys were completed in June of 2002 
for the World Health Organization. Since these surveys 
were designed as pretests for subsequent nationally 
representative samples, each province surveyed was 
chosen to be roughly representative of the entire coun- 
try. In our experience, pretests such as these usually 
turn out similar to the results from our subsequent 
nationwide surveys, but obviously this analysis should 
only be considered a comparison of the provinces or 
people surveyed. 

Despite the absence of a gold standard measurement, 
the difference between these countries on political effi- 
cacy could hardly be more stark. The citizens of Mexico 
recently voted out of office the ruling PRI party in an 
election closely observed by the international commu- 
nity and widely declared to be free and fair. After a 
peaceful transition of power, the former opposition 
party took control of (and still controls) the reins of 
power. Despite the existence of limited forms of lo- 
cal democracy, nothing resembling this has occurred in 
China. Levels of political efficacy presumably also vary 
a good deal within each country, with, for example, po- 
litical elites in China having high levels and prisoners in 
Mexico having low levels, but the average differences 
would seem to be unambiguous. 

If we did not know these facts, and instead used stan- 
dard survey research techniques, we would have been 
seriously misled. The left graph in Figure 2 plots his- 
tograms of the observed self-assessment responses, and 
quite remarkably, it shows that the Mexicans think that 
they have less say in government than the Chinese think 
that they have. The right graph plots C, our nonpara- 
metric DIF-corrected estimate of the same distribution. 
The correction exactly switches the conclusion about 
which country has more political efficacy, and makes it 
in line with what we know. Indeed, the spike at C=1 
is particularly striking: 40% of Chinese respondents 
judge themselves to have less political efficacy then they 
think the person described in the fifth (“suffering in si- 
lence”) vignette has. This result, which we never would 
have known using standard survey methods, calls into 
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question research claims about the advances in local 
elections in China, even in the limited scope to which 
such elections are intended. 

Thus, the vignettes take the same logical place as the 
candidate position questions in Aldrich and McKelvey 
1977, except that vignette questions are under control 
of the investigator and applicable to a wider range of 
substantive problems. In addition to political efficacy, 
we have written survey questions with corresponding 
vignettes for political freedom, responsiveness of the 
political system in some areas of policy, and separate 
domains of health (mobility, vision, etc.). We have 
tested subsets of these questions and our method in 
surveys we designed in 60 countries. The full battery of 
questions is now being used in the World Health Sur- 
vey, which is presently in the field in about 80 countries. 
Other similar efforts are being used or considered by 
other survey organizations in several disciplines. We 
hope this paper will make it possible to apply the idea 
in other contexts.? 


A PARAMETRIC APPROACH: MODELING 
THRESHOLDS 


As a complement to our nonparametric approach, we 
now develop a parametric statistical model. This model 
enables researchers to save resources by asking vi- 
gnettes of only a random sample (or subsample) from 
the same population as the self-assessments. For exam- 
ple, researchers could include the vignettes only on the 
pretest survey; alternatively, for each self-assessment 
on the main survey they could add, say, one additional 
item composed of four vignettes asked of one-quarter 





3 We have also tried a series of other ways of using these vignettes 
that we hoped would be even simpler, such as asking respondents 
to choose the vignette closest to thear own level on the variable in 
question, but (in part because of the difficulty respondents have re- 
membering and assessing all the vignettes at once) we have found no 
direct measurement alternatrves that do as well as the approach we 
describe here. 
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of the respondents each. For panel studies or those 
with a series of independent cross sections, researchers 
could include the vignettes on only some of the waves. 
This model avoids the inefficiencies of the nonpara- 
metric approach by recognizing that the variable being 
measured is perceived with random measurement er- 
ror and, as we show below, is modeled with a normal 
error term. We further increase efficiency by allowing 
researchers to include multiple self-assessment ques- 
tions for the same underlying concept (in a single fac- 
tor analysis-type setup). We accomplish all these tasks 
by letting the thresholds (which turn the unobserved 
continuous variable to be measured into an observed 
categorical response) vary over individuals as a func- 
tion of measured explanatory variables. 

In broad outline, our model can be thought of as 
a generalization of the commonly used ordered pro- 
bit model, where we model DIF via threshold varia- 
tion, with the vignettes providing the key identifying 
information.’ Given the importance of thresholds in 
this model-based method, we first illustrate their role 
with an alternative simplified view of DIF using a vari- 
able measured in almost every survey, age. Age also 
has an expository advantage since its perceived value 
is typically indistinguishable from the actual age. Then, 
instead of asking survey respondents for their date of 
birth (which obviously would be preferable), we imag- 
ine trying to make inferences if the survey question 
only asked whether respondents described themselves 
as (A) elderly, (B) middle-aged, (C) a young adult, or 
(D) a child. 

Figure 3 considers interpretations two individuals 
might use to map their years of age into the avail- 
able survey response categories. The age scale is bro- 
ken at the threshold values 11, t2, and 73, but the two 
individuals have different values of these quantities. 
The scale on the left with lower threshold values (and 
hence, e.g., “elderly” defined as over 40 years of age) 
is what individuals might use in a country with a low 
life expectancy; the scale on the right is probably a bet- 
ter description of a developed country like the United 
States. If we knew only the response category chosen, 
we would not know much about that person’s actual 
age since, for example, “middle-aged” could mean com- 
pletely different things to different people. Without 
knowing the threshold differences, we could easily get 
the age rankings of the countries wrong. 

If we somehow knew the threshold values, the only 
issue in understanding a persons’ age would be group- 
ing error, which is straightforward to deal with statisti- 
cally (i.e., using an “interval regression model,” which 
is an ordered probit with known thresholds). The key 
to our approach, then, is that the vignettes enable us to 


4 We have also experimented with many alternative versions, includ- 
ing models that generalize the “graded response” or “partial credit” 
frameworks more common in the psychometnics literature (Linden 
and Hambleton 1997). We find that the empincal results across the 
range of alternative models tend to be quite smular. The version 
we present here has the advantage of building on components that 
are more familiar to political scientists, but we emphasize that the 
particular parameterization chosen is less important than the idea of 
umng anchoring vignettes to measure DIF directly in some way. 
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FIGURE 3. Categorizing Years of Age 
Soe genet en De aa Ae 
90 90 Elderly 
80 80 | +73 
70 
70 Elderly Middle aged 
60 60 
T 
50 50 
40 | +73 40 Young adult 
30 Middle aged 30 
Tl 
20 Stee adult 20 
10} +71 cy 10 Child 
0 
Note. The graph portrays possible mappings from actual age 
to an Individual’s choice among the four survey cate- 
gores, possibly for individuals In low (on the left) and high (on the 
right) Ife expectancy countnes. The t's are thresholds between 
the categories. 





estimate the threshold values, and with this information 
we correct the self-assessment responses. 

Our model contains, for each respondent and sur- 
vey question (continuous and unobserved), actual and 
perceived and (ordered categorical and observed) re- 
ported levels of the variable being measured. Respon- 
dents perceive their actual levels correctly on average 
but with noise (i.e., equal to the actual levels plus ran- 
dom measurement error), but when they turn their 
perceived values into an answer to the survey ques- 
tion, different types of people use systematically 
different threshold values. Hence, actual values are un- 
observed but comparable. Perceived values are compa- 
rable only on average due to random error, and are in 
any event unobserved. Raw survey responses are ob- 
served, but they are incomparable. The following two 
parts of this section define the self-assessment and vi- 
gnette components of the model, respectively; the third 
part then provides a substantive interpretation of the 
model (Appendix A derives the likelihood function, 
and Appendix B shows how to compute quantities of 
interest from it). To help keep track of our notation, 
Figure 4 provides a graphic summary of the model and 
all its elements. 


Self-Assessment Component 


Denote the actual level of respondent i as p, (i = 
1, ..., n) on a continuous, unbounded, and unidimen- 
sional scale (with higher values indicating more free- 
dom, political efficacy, etc.). Respondent i perceives 
py, only with random (standard normal) error, as in 
ordered probit, so that for self-assessment question 


s(s =1,..., 5), 
Yt ~ M(t, 1) (1) 


represents respondent i’s unobserved perceived level. 
The actual level varies over i as a linear function of 
observed covariates X, and an independent normal 
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FIGURE 4. Parametric Model Summary 
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random effect n;, 


la = XB+, (2) 


with parameter vector £ (and no constant term, for 
identification), so that 


n ~ N(0, o”) (3) 


is modeled as independent of X,. When S = 1, we drop 
n since w is then not identified. 

We elicit a reported answer for respondent i to self- 
assessment question s with K, ordinal response cate- 
gories (higher values indicating more political efficacy, 
freedom, etc.). Thus, respondent i turns the continuous 
perceived levels Y¥ into the reported category ys via 


go 
this observation mechanism: 


Ys=k if thle ytc ct (4) 


with a vector of thresholds ty (where t? = —oo, t™® = 


u 
oo, and t$! < +, with indices for categories k= 
1,..., K, and self-assessment questions s = 1,..., S) 
that vary over the observations as a function of a vector 
of covariates, V, (which may overlap X,), and a vector 
of unknown parameter vectors, y; (with elements the 
vector y$): 


5) 
(k=2,..., K, 51) 


(c£ Groot and van den Brink 1999 and Wolfe and Firth 
2002). 


Vignette Component 


Denote the actual level for the hypothetical person de- 
scribed in vignette j as 0, (for j =1,..., J), measured 
on a continuous and unbounded scale (higher values in- 
dicating more efficacy, freedom, etc.). The assumption 
of vignette equivalence is formalized by 6, not being 
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subscripted by (and thus assumed the same for every) 
respondent. 

We index respondents in the sample of people asked 
vignettes by Z (£ = 1,..., N). (To allow vignettes to be 
asked of separate samples, i and ¢ may index differ- 
ent individuals.) Respondent £ perceives 6, only with 
random (normal) error so that 


Z, ~ N(b,,0?) © 


represents respondent ¢’s unobserved real-valued per- 
ception of the level of the variable being measured de- 
scribed in vignette j (elements of which are assumed 
independent over j conditional on 6;). (Although we 
avoid complicating the notation here, we also often let 
a? vary over vignettes, since their estimates are conve- 
nient indicators of one aspect of how well each vignette 
is understood.) 

The perception of respondent £ about the level of 
the person described in vignette j is elicited by the 
investigator via a survey question with the same Kj; 
ordinal categories as the first self-assessment question. 
Our software also allows other self-assessment ques- 
tions, each with its own corresponding set of vignettes, 
but these notational complications are unnecessary for 
present purposes, since one set of vignettes correspond- 
ing to only one self-assessment question is sufficient to 
correct multiple self-assessments. 

Thus, the respondent turns the continuous Z}, into 
a categorical answer to the survey question z; via this 
observation mechanism: 


wok if $< ek, O 
with thresholds determined by the same y coefficients 
as in (5) for y,1, and the same explanatory variables but 
with values measured for units £, Vz: 

ta =y Ve 


=] i 
TH =h tent 


(8) 
(k=2,..., Kı —1). 


Response consistency is thus formalized by » being 
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the same in both the self-assessment and the vignette 
components of the model. 


Model Interpretation 


Identification for DIF Correction. Response- 
category DIF appears in the model as threshold varia- 
tion (Ts and Te varying over respondents i and £) and 
requires at least one vignette for strong identification. 
We can see the essential role of vignettes by what hap- 
pens if we try to estimate the self-assessment compo- 
nent separately and, also, set the explanatory variables 
X affecting the actual level to be the same as those V 
affecting the thresholds. In this case, £ (the effect of X) 
and y (the effect of V) would be dubiously identified 
only from the nonlinearities in the threshold model (5). 
This generlizes the well-known result in ordered probit 
that the thresholds are not all separately identified from 
the constant term (Johnson and Albert 1999, ch. 5). 

For another view of how vignettes correct for DIF 
consider this simpler model based on an analogue to 
Aldrich and McKelvey (1977). Suppose that a single 
self-assessment response y, and two vignette responses 
z (for j =1,2) are continuous, perceptual error is 
nonexistent (i.e., the variances in Eqs. [1] and [6] are 
zero), and vignettes and self-assessments are asked of 
the same people (i = £). Then we could specify the self- 
assessment response (contrary to, butin the spirit of, the 
model) to be a linear function of the actual level with 
parameters that vary over respondents, y, = t} + T fas 
and the same for the two vignettes, zy = T} + 1/0, 
(for j = 1,2 and gı < z2). Since their values are ar- 
bitrary, we make the identifying restrictions 6 = 0 
and 6, = 1. Finally, we solve: u, = (Yı — %1)/(Z2 — %1). 
This equation shows that the actual level is equal to the 
observed y, distorted by the values on the two vignette 
questions. Clearly, without the vignettes, y, would be of 
little use in estimating ,. Our model has a variety of 
useful features not in this simple model, but the intu- 
ition is closely analogous. 


Specifying the Substantive Model. Explanatory vari- 
ables Xin the substantive model (Eq. [2]) must be cor- 
rectly specified, just as in linear regression or ordered 
probit. Conditional on the model, £ is interpreted as a 
vector of effect parameters and 4, as the actual level 
(see Appendix B for details). The added random ef- 
fect n, is a strict improvement over the standard spec- 
ification (when œ? > 0), in that it recognizes that we 
are unlikely to be able to measure and include in X 
all reasons why actual levels differ across individuals. 
The random effect can greatly improve estimation of 
the actual level yz, and, of course, makes estimates less 
sensitive to specification decisions about X (due to the 
result in the last section in Appendix B). However, it 
can only provide this added benefit for the portions of 
unmeasured explanatory variables that are unrelated 
to X (and it is only possible to use when multiple self- 
assessment questions are available). If variables omit- 
ted from and correlated with X have an effect on 4, 


we could have omitted variable bias just as in linear 
regression. 
Specifying the Measurement Model. The explanatory 
variables V that predict threshold variation in the mea- 
surement model (Eqs. [5] and [8]) must also be correctly 
specified, but according to one of two different stan- 
dards depending on the purposes for which they will be 
used. For our main goal of estimating the actual level 
p or the effect parameters on the actual level £, V only 
need include enough information so that the Y and Z 
are independent given V (i.e.,so that the product can be 
taken in the likelihood function in Eq. [12]). In fact, we 
can test this assumption nonparametrically when multi- 
ple observations are available for each unique vector of 
values of V,. The test is to check that the crosstabulation 
of the values of Y and Z for observations that fall within 
each unique strata of V are unrelated. If not, then addi- 
tional variables must be included. We can also perform 
parametric tests of this assumption by checking that 
elements of y are significantly different from zero. 
Measurement model specification decisions must 
meet higher standards if the goal is to study why differ- 
ent individuals have different thresholds. Then we must 
avoid omitted variable bias according to rules analo- 
gous to those for linear regression. The measurement 
model includes no random effect (and including one 
would be computationally complex and would make 
it impossible to ask vignettes and self-assessments of 
separate samples) and so we are not protected in the 
same way as with the substantive model from omitted 
variables unrelated to V,. 


Tests for Vignette Equivalence. Our self-assessment 
questions are all assumed to measure the same unidi- 
mensional actual level. If the concept is actually mul- 
tidimensional, then separate questions and vignettes 
should be used for each dimension. Unidimensional- 
ity is best verified via standard survey techniques of 
extensive pretesting and cognitive debriefing. Our ap- 
proach does not mean that a researcher can ignore any 
of the advice on writing good survey questions learned 
over the last half-century. We still need to be careful 
of question wording, question order, accurate transla- 
tion of the meaning of different items, sampling design, 
interview length, social background of the interviewer 
and respondent, etc.* 

Under our parametric model, researchers can test to 
a degree for vignette equivalence by checking whether 
the @ values are ordered as expected. The extent of 
ranking inconsistencies in our nonparametric model 
can also be indicative of multidimensionality, although 
care must be used in interpretation since the same 
“inconsistencies” can also result under our parametric 


5 Working m different languages and cultures is of course particularly 
difficult For example, m our research we considered asking variants 
of how healthy a person is who can run 20 km. With some question 
wordings and translations, however, some of our pretest subjects in 
sub-Saharan Africa revealed in in-depth cognitive interviews that 
they thought anyone who would even consider running that far must 
be peculiar, ff not mentally ill, and so would clearly be judged less 
healthy than someone who could only run, say, 5 km! Missing cultural 
differences like these would obviously threaten our approach. 
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model from unidimensionality and large random mea- 
surement error. The key in detecting multidimension- 
ality is searching for inconsistencies that are systemat- 
ically related to any measured variable. 


Number and Type of Vignettes Needed. The optimal 
number of vignettes to ask (or whether to ask more 
vignettes or to ask the same vignettes of more respon- 
dents), in terms of the right trade-off in bias reduction 
vs. survey costs, depends on the nature of DIF and what 
information the investigator needs. For example, in 
some of our experiments with these methods, we were 
most interested in having higher resolution in measure- 
ment near the top of the scale and so we included more 
vignettes near that end. In general, only one vignette is 
needed to identify our parametric model, but we nor- 
mally advise including more. In the nonparametric 
model, the amount of information about the actual self- 
assessments increases with 2J + 1 (the number of cate- 
gories of the nonparametric estimate, C) in the number 
of vignettes J. In both methods, the vignettes only help 
when they divide up the distribution of self-assessment 
answers and so have discriminatory power. Since the 
vignettes identify y, the perfect vignette for our model 
is one with 0 that falls between the t’s predicted by 
categories of V. For example, if V includes a country 
dummy, then the optimal vignette is one with 0 between 
the values of the thresholds of the two countries. 

When possible, we recommend asking all respon- 
dents self-assessment and vignette questions during 
the pretest and then studying how much information 
is lost by examining the stability of the y parameters 
when dropping subsets of vignettes and respondents. 
In our experience, much of the benefit of our approach 
is realized after including the first two or three vig- 
nettes if they are carefully chosen to be near the self- 
assessments, although in practice at this early stage in 
using this methodology we have typically used five to 
seven. Similarly, in the literature on scaling roll calls, the 
values of only one or two legislators are typically used 
as anchors (e.g., Clinton, Jackman, and Rivers 2002 and 
Londregan 2000). 


Weights on Self-Assessment Questions. When mul- 
tiple self-assessment questions to measure the same 
construct are available, the model estimates a single 
actual level for all the questions. Although the vari- 
ance of the perceived level is the same for each, the 
variance of the reported answers can still differ be- 
cause the model allows the thresholds to vary across 
self-assessment questions. (Letting the variance at the 
perceived level differ also would not be separately iden- 
tified or needed.) As such, under the model, questions 
with less measurement or perceptual error, and those 
that are more highly correlated with the single dimen- 
sion of the concept being measured, provide more dis- 
criminatory power and are effectively weighted more 
heavily in estimating u. Thus, the model’s reported 
level provides the equivalent of the item-discrimination 
parameter in item-response theory or factor loadings in 
scaling theory. The consequence is that the actual level, 
4, and effect parameters in the substantive model, £, 
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will be fairly robust to self-assessment questions of dif- 
fering quality, but studies of how and why thresholds 
vary over respondents will be more model-dependent. 


MONTE CARLO EVIDENCE 


Our parametric model meets all the standard regular- 
ity conditions of maximum likelihood and so estimates 
from it are consistent and asymptotically efficient. In 
this section we offer Monte Carlo evidence that demon- 
strates it has similarly desirable small sample proper- 
ties. We do this by drawing 1,000 data sets from the 
model with a fixed set of parameters and examining 
the properties of the estimates of these parameters. 
The results reported in this section are therefore con- 
ditional on the model being correct and thus do not 
address issues such as robustness to misspecification. 

We summarize the results in Table 1, which shows 
that the maximum likelihood estimates and asymp- 
totic standard errors are unbiased (i.e., within Monte 
Carlo approximation error of zero). Similarly, the 95% 
asymptotic confidence intervals seem to cover the true 
value about 95% of the time. 

We designed this Monte Carlo experiment to simu- 
late the conditions for which the method was designed 
by shifting the actual level 4, in one direction (with the 
coefficient on the country dummy in X, f2 = 1) and 
shifting the threshold value for a country in the same 
direction (so that y} = 1). When DIF like this occurs, 
the absence of an anchor means that ordered probit will 
not detect the change in either the coefficient or the 
threshold, which we demonstrate in the top two graphs 
in Figure 5. These graphs plot a density estimate (a 
smooth version of a histogram) of the estimated values 
across the 1,000 simulated data sets for both ordered 
probit and our method. As expected, ordered probit 
finds no difference in the actual levels between the 
countries because it is not able to detect the threshold 
variation. 

A similar result occurs when studying estimates of 
the actual level, 1, which we illustrate using the first 
data set drawn with our simulation algorithm. The bot- 
tom graph in Figure 5 gives the true variation in the 
actual level 2 (with variation coming from the random 
effect) for a hypothetical 65-year-old respondent and 
compares it to the posterior density computed by or- 
dered probit and our model. As with the coefficients, 





é To draw the 1,000 data sets, we follow this algorithm: (1) Set 
P, 07, w*, 0, and y to the values in Table 1, as well as n= N = 2,000, 
S=2, K, =4, and J =3, (2) Set X to a variable corresponding to a 
country dummy and age, fixed across the two countries, and V to 
a constant and the country dummy, but (for stmplicity and to save 
computational time) for r! only. (3) Draw values for n,(1=1, .., n) 
from Eq. (3), orthogonalizing with respect to a constant and X for 
efficiency, and fix it for a set of smulatons. (4) Finally, draw m data 
sets Ors and z, ) by repeating this algorithm m times: (a) Draw one y,, 
(for: =1,. ,n,s—1,..., by calculating 4, from Eq. (2), draw- 
ing Y; from Eq (1) foreachs(s=1, .,S), calculating the 1,,’8 by 
Eq. (8), and calculating ys from Y* by using Eq (4). (b) Draw one 
value of z, (forém@1, . , Nand f=1,..., J) by drawing Z, from 
Eq. (6) and turning Z, into z with Eq. (7) We set m= 100 and then 
repeated the entire algorithm 10 times. 


American Political Science Review 


Vol. 98, No. 1 


American Political Science Review S o m I mamama amuma 


TABLE 1. Monte Carlo Analysis of Point Estimates, Standard Errors, and 
Confidence Interval Coverage 


























True Moan Digg 95% 
Parameter Value Polnt Estimate SE Coverage 
6 1 0.0042 0.000059 0.95 
62 —0.25 0.0027 —0.0034 0.96 
63 —0.7 0.0021 —0.0024 0.95 
By: age —0.02 0.000023 —0.000075 0.96 
fe: country 1 0.0034 —0.000097 0.95 
In(@) —1 —0.015 —0.0014 0.96 
In(c) 0 0.00066 0.0019 0.95 
yi —1 0.001 —0.0031 i 0.96 
yi? country 1 0.0029 0.00086 0.95 
ye —0.8 —0.0000056 0.0015 0.94 
ou —0.9 0.0018 0.0011 0.94 
ya" —i.3 0.00031 —0.0045 0.97 
y®: country 1 0.0028 0.0016 0.94 
ye —1 —0.0025 —0.00042 0.96 
yg —1 —0.0003 0.00058 0.95 
Note. All estimates are given to two significant digits. 





FIGURE 5. Monte Carlo Results 




















Note The top two graphs display the sampling distribution of 
parameters across 1,000 Monte Carlo experments, In compar- 
Ison to the truth. The bottom graph compares the unconditional 








ordered probit’s inability to correct for DIF makes it 
miss most of the true density, while estimates from our 
model are on target. 


EMPIRICAL EVIDENCE 


To illustrate the difference our parametric approach 
can make compared to the most common method of 
analyzing ordinal dependent variables, ordered probit, 
we include here two very different empirical examples: 
a political variable, which is an extension of the polit- 
ical efficacy example introduced during our discussion 
of the nonparametric method above, and a policy out- 
come variable, the visual acuity dimension of health. 
Although many possible uses of our technology are 
within a single country, we choose two especially diffi- 
cult examples, each requiring comparison across a pair 
of highly diverse countries. Since ordered probit and 
our model are scaled in the same way, the results from 
the two methods are directly comparable, although if 
DIF is present, only our approach would normally be 
eomparable across cultures and people.’ 


Polltical Efficacy 


As a baseline, we compare China and Mexico by run- 
ning an ordered probit of the response to the self- 
assessment question on a dummy variable for country 
(1 for China, 0 for Mexico), controlling for years of age, 
sex (1 for female, 2 for male), and years of education. 
The results appear in the first numerical column of 
Table 2. The key result is the country dummy, which 


7 We estimate the model with d generic optimizer and, when multi- 
ple self-assessments are available, sumple one-dimensional numerical 
integration. 
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AeL 2. Comparing Political Efficacy In Mexico and China 














Ordered Probtt Our Method 

Eq. Vanable Coeff. (SE) Coeff. (SE) 
L China 0.670 (0.082) —0.364 (0.090) 
Age 0.004 (0.003) 0.006 (0.003) 
Male 0.087 (0.078) 0114 (0.081) 
Education 0.020 (0.008) 0.020 (0.008) 
T! China —1.059 (0.059) 
Age 0.002 (0.001) 
Male 0.044 (0.036) 
Education —0.001 (0.004) 
0.425 (0.147) 0.431 (0 151) 
r China —0.162 (0.071) 
Age —0.002 (0.002) 
Male —0.059 (0.051) 
Education 0.001 (0.006) 
Constant —0.320 (0.059) —0.245 (0.114) 
T China 0.345 (0.053) 
Age —0.001 (0.002) 
Male 0.044 (0.047) 
Education —0.003 (0.005) 
Constant —0.449 (0.074) —0.476 (0.105) 
tí China 0.631 (0.083) 
Age 0.004 (0.002) 
Male —0.097 (0.072) 
Education 0.027 (0.007) 
Constant —0.898 (0.119) —1.621 (0.149) 
Vignettes 4 1.284 (0.161) 
Q 1.196 (0.160) 
6, 0.845 (0.159) 
O4 0.795 (0.159) 
Os 0.621 (0.159) 
Ino —0.239 (0.042) 

Notes. Ordered probit indicates counte and probably Incorrectty that the Chinese have higher political efficacy than the Mexi 

whereas our approach reveals that this Is because the Chinese have comparatively lower standards (t's) for moving from one categoncal 





response into the next highest category The result ts that although the Chinese give higher reported levels of polttical efficacy than the 


Mexicans, It ls the Mexicans who are In fact more politically efficacious. 





is in boldface. It shows the same remarkable result 
from Figure 2: Even though we have now also included 
controls, citizens of China choose response categories 
indicating higher levels of political efficacy than do cit- 
izens of Mexico. Since the underlying political efficacy 
scale being estimated is conditionally normal with stan- 
dard deviation 1, the coefficient on the China dummy 
of 0.67 is quite large, and its standard error indicates 
that a researcher using the ordered probit model would 
conclude that they have a high degree of confidence in 
this counterintuitive conclusion. 

We now use our parametric model to analyze the 
same example. (We include the same variables in the 
mean function as in the model for threshold variation. 
Our experiments indicate that the key results on the 
differences between the countries are not sensitive to 
many changes in these specifications.) Results appear in 
the last pair of columns in Table 2. The key conclusion 
to be drawn from our model is the opposite to that of 
ordered probit: the country dummy (in the top panel 
in boldface) has now switched signs. This means that 
once DIF is corrected we can see that Mexicans do 
indeed have higher levels of political efficacy than the 
Chinese. The effect (—0.364) is reasonably large and the 
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standard error indicates considerable precision, condi- 
tional on this improved model. (Note also that the sig- 
nificant positive effect of education on this dimension of 
efficacy has not changed appreciably between the two 
models, which shows that correcting DIF only affects 
parameters related to it.) 

The other parameters clarify the reason why esti- 
mates of the actual level switched and so provides 
some additional insight into how respondents are un- 
derstanding these survey questions. To begin, note that 
the estimates of the actual values of the vignettes (at the 
bottom of Table 2) are not constrained by our model to 
be ordered, but they all turn out to be ordered in exactly 
the way we expected (as in the list above). This provides 
some evidence that the concept being measured is as 
we understood it, and thus, for example, is likely to be 
unidimensional. 

Another important feature is the country dummy 
predicting each of the thresholds (given in boldface). 
The y coefficient on the China dummy variable in 
the equation predicting t! is the threshold between 
“no say” and “a little say.” This large and significantly 
negative coefficient (—1.059) indicates that the same 
actual low level of political efficacy is considerably 
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| FIGURE 6. Threshold Variation 
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Note. This figure gives a political efficacy scale from high (up) to 
low (down). Density estimates (a smooth version of a histogram) 
of the t's are given for Maxtco (on the left) and China (on the 
right). Also given ts the distribution of Y* and the 95% confidence 
Interval for the actual values of the first and last ignettes (@; and 
æ), In gray hortzontal bars. 








more likely to be judged by Chinese respondents than 
Mexican respondents to be a little rather than no say 
in government. Another way of saying this is that the 
Chinese have lower standards for what constitutes this 
particular level of efficacy. The parameterization in 
Eqs. (5) and (8) means that the other r’s are easier to in- 
terpret graphically, which we do in Figure 6. This figure 
plots the distribution of each t across respondents, for 
Mexico (on the left) and China (on the right). All four 
of the t distributions (pointed out in the middle of the 
graph) are all shifted substantially lower for China, in- 
dicating that they have lower standards for the level of 
efficacy in every category than the Mexicans. 

Figure 6 also presents the distribution of Y*, the un- 
biased self-perceptions, in each country, which shows 
how the +’s in each country divide up these perceived 
self-assessments. (The actual values, the ;, are not 
presented, but their average value, which is also the 
average of the Y* distribution, does appear.) The figure 
also displays the 95% confidence interval for the actual 
value of @, and 65 (the first and last vignette), which are 
constant across the two countries (see the two horizon- 
tal gray bars; the others are omitted to reduce graphical 
clutter). Since the power of the vignettes comes from 
breaking up the distribution of the thresholds, we can 
use the figure to evaluate the vignettes. It shows that 
the vignettes are best for identifying the coefficients 
in the t! and t? equations in Mexico and in t? and 
t* in China. The vignettes clearly provide much more 
information than necessary to identify the difference 
between the countries; indeed, to pick up the general di- 
rection of intercountry differences, one vignette would 
be sufficient. If instead we could afford to add vignettes 
to subsequent surveys, the extreme ends of the scale 
would be the most productive place to add them. Of 
course, other data sets need not follow this particular 
pattern. 





So what is happening is that the Chinese respon- 
dents have lower actual levels of political efficacy than 
the Mexicans. But the Chinese also have even lower 
standards for what qualifies as any particular level of 
“gay in government.” The combination of these effects 
causes the Chinese to report higher levels of efficacy 
than those reported by the Mexicans. Thus, relying 
on the observed self-assessment responses for a mea- 
sure of the political efficacy differences between China 
and Mexico would seriously mislead researchers. Using 
standard techniques like ordered probit to analyze 
these numbers would also produce badly biased results. 
Our parametric and nonparametric approaches reveal 
the problem with the self-assessments and fix it by using 
vignettes as anchors to generate interpersonally and 
interculturally comparable measures. 

Although our main purpose is to design a method 
that makes it possible to correct for DIF to improve 
measurement, the reasons for these threshold differ- 
ences seem well worth studying in and of themselves. 
This could be pursued by including other variables in 
the threshold portion of the model. If some of the un- 
derlying reasons for the intercountry differences were 
found and controlled, the coefficient on the country 
dummy would likely drop. We expect that research into 
these kinds of social_psychological questions would be 
a productive path to follow. 


Visual Aculty 


We included self-assessment and vignette questions to 
measure visual acuity, a fairly concrete policy outcome 
variable, on surveys for the World Health Organiza- 
tion in China (n=9,484; completed February 2001) 
and Slovakia (n=1,183; completed December 2000). 
Half of the respondents, randomly chosen, were asked 
vignette questions. 

These surveys were useful because we were also able 
to include a “measured test” for vision—the Snellen 
Eye Chart test—for half of the respondents, randomly 
chosen. This is the familiar tumbling “E” eye chart test, 
with each row having smaller and smaller Es, and with 
respondents having to judge which direction each E 
is facing. Although this test is subject to measurement 
error, the errors should be less subject to cultural differ- 
ences and so the test should provide a relatively DIF- 
free standard for comparison. 

Our vision self-assessment question was, “In the last 
30 days, how much difficulty did you have in seeing and 
recognizing a person you know across the road (i.e., 
from a distance of about 20 meters)?” with response 
categories (A) none, (B) mild, (C) moderate, (D) se- 
vere, (E) extreme/cannot do. We also included eight 
separate vignettes, such as “[Angela] needs glasses to 
read newsprint (and to thread a needle). She can rec- 
ognize people’s faces and pick out details in pictures 
from across 10 meters quite distinctly. She has no prob- 
lems with seeing in dim light.” We then followed our 
procedure of asking almost the same question about 
the people in the vignettes and with the same response 
categories as used in the self-assessments. 
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TABLE 3. Comparing Estimates of Vision In Slovakia and China Using the Snellen Eye Chart Test 

















with Analyses of Survey Responses Using Ordered Probit and Our roach 
Snellen Eye Chart Ordered Probl Our Method 
Mean (SE) 4 (SE) 4 (SE) 
Slovakia 8.006 (0.272) 0.660 (0.127) 0.286 (0.129) 
China 10.780 (0.148) 0.673 (0.073) 0.749 (0.081) 
Difference -2.774 __(0 452) _ —0.013 (0.053) —0.463 _ (0.053) 





Note All numbers indicate the badness of vislon, but the eye chart test Is measured on a different scale than the statistical procedures. 





To save space, we give results here only for our quan- 
tities of interest (see Table 3). All numbers in the table 
are measures of how bad the respondent's vision is. The 
first column is the Snellen Eye Chart test, which is an 
estimate of the number of meters away from an object a 
person with “20/20 vision” would have to stand to have 
the same vision as the respondent at 6 m. So the larger 
the number is over six, the worse the respondent’s vi- 
sion. In part because glasses are not generally available, 
and in part due to inferior health care, the Chinese, 
as expected, have considerably worse vision than the 
Slovakians. In contrast, the ordered probit model 1s not 
able to detect a significant difference between the coun- 
tries at all. The Slovakians have higher standards for 
their own vision, which translates into higher threshold 
values and hence more reported values in the worse 
vision categories, 

In contrast to the implausible and apparently incor- 
rect ordered probit results, our approach seems to cor- 
rect appropriately, producing an answer in the same 
direction as the measured test. The scale of the our 
parametric model (and ordered probit) results is not the 
same as the eye chart test, but we find that the Chinese 
have substantially worse vision than the Slovakians 
(0.463 on a standard normal scale with a small standard 
error), as in the measured test. 

Measured tests provide a useful standard of com- 
parison here for judging the relative performance of 
ordered probit and our model. They would also be a 
general solution to the problem of DIF if they could 
always be used accurately in place of survey questions. 
Unfortunately, administering these tests is far more ex- 
pensive, and maintaining quality control is much more 
difficult, than for traditional survey questions. Part of 
the problem is that interviewers are trained in solic- 
iting attitudes, not conducting medical tests. But even 
when highly trained medical personnel are used, the 
difficulties of conducting these tests in extremely di- 
verse environments can generate substantial measure- 
ment error. In some preliminary tests we have con- 
ducted of different types of measured tests for other 
policy outcomes, we have found that the error in 
some versions of these tests swamps the error that 
results even from unadjusted self-assessments. Al- 
though carefully administered measured tests can pro- 
vide us with a clear gold standard to evaluate our 
methodology for some constructs, they are infeasible 
for most concepts survey researchers measure, such 
as freedom, political efficacy, and partisan identifica- 
tion. 





CONCLUDING REMARKS AND EXTENSIONS 


The approach offered here would seem to be applica- 
ble to measuring a wide range of concepts routinely 
appearing in survey research. These include concepts 
like partisan identification, ideology, tolerance, polit- 
ical efficacy, happiness, life satisfaction, postmaterial- 
ism, health, cognitive attributes, attitudes, and Likert 
scale items measuring most attitudes, preferences, and 
perceptions. We do not know which of the presently 
used survey questions have bias due to DIF and would 
thus benefit from our corrections, but without some 
approach to verifying that survey responses are indeed 
interpersonally comparable, the vast majority of sur- 
vey research remains vulnerable to this long-standing 
criticism. 

We have found our survey instrumentation and sta- 
tistical methods useful even when DIF is not present, as 
they tend to make our survey measurements far more 
concrete. They also often lead us to discover, clarify, 
and define additional dimensions of complicated con- 
cepts, and they may ultimately help develop clearer 
concepts. 

Vignettes could be used with a modification of our 
model for survey responses that are closer to contin- 
uous, such as income, wealth, and prices. Indeed, our 
general approach might also be used to improve non- 
survey measures like the Consumer Price Index, which 
is derived from overlapping market baskets of goods 
from different historical periods. A similar approach 
could be used to create comparable measures of income 
or exchange rates over time or across cultures where the 
market baskets of goods chosen would also change. In 
these applications, instead of trying to identify some- 
thing New Yorkers and Ethiopians both routinely buy, 
we could use DVD players for the former and goats 
for the latter. That is, each anchor could be designed to 
span only a few years or countries, so long as the entire 
set of observations were linked at least pairwise since it 
would then be correctable in a chain by many anchors 
analyzed together. 

Ideally, our basic theoretical concepts would be suf- 
ficiently well developed so that neither vignettes nor 
a statistical model would be necessary. Perhaps even- 
tually we will improve our concepts and learn how 
to design survey questions that apply across cultures 
without risk of bias from DIF. Until then, we think 
that survey researchers should recognize that some 
approach, such as the one we introduce here, will be 
necessary. Anchors designed by the investigator, such 
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as with vignettes, do not solve all the problems, but they 
should have the potential to reduce bias, increase effi- 
ciency, and make measurements closer to interperson- 
ally comparable than existing methods. Moreover, re- 
searchers who are confident that their survey questions 
are already clearly conceptualized, are well measured, 
and have no DIF now have the first real opportunity 
to verify empirically these normally implicit but highly 
consequential assumptions. 


APPENDIX A: THE JOINT LIKELIHOOD 
FUNCTION 


If the random effect term n, were observed, the likelihood 
for observation i, for the self-assessment component, would 
take the form of an ordered probit with varying thresholds: 
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where I(y,. = k) is one if Ya = k and zero otherwise, and F is 
the normal CDF and where y, = {ys;5=1,..., S}. However, 
since 7, is unknown, the likelihood for the self-assessment 
component requires averaging over 7, in addition to taking 
the product over i: 
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In the special case where S = 1, this simplifies to 
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which is possible by writing out the definition of the normal 
CDF, invoking Fubini’s theorem, and solving. Equation (11) 
is also useful because it clearly shows that the variance of the 
perceived value of the vignette’s level (which is set to one in 
the model) and œ? would not be separately identified if this 
component were estimated alone. If S > 1, we evaluate (10) 
with one-dimensional numerical integration. 

The likelihood for the vignette component 1s a J-variate 
ordered probit with varying thresholds: 
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where the product terms are over observations, vig- 
nettes, and survey response categories, respectively. The like- 
lihoods from the two components share the parameter vector 
y and so should be estimated together. The complete likeli- 
hood 1s 


L(B, 07, 0, 6, y 1y, 2) = LB, 07, y 1y) L0, n, 0°12). 2) 


APPENDIX B: COMPUTING QUANTITIES OF 
INTEREST 


Several quantities are of interest from this model, which we 
describe here, along with computational algorithms. 


Effect Parameters 


The effect parameters £ that indicate how actual levels {n 
depend on X, can be interpreted as one would a linear re- 
gression of Y7 on X, with a standard error of the regression 
of one, just as in ordered probit. For example, if Xj is a 
researcher’s key causal variable, and the model is correctly 
specified, then f; is the causal effect—the increase in actual 
levels of freedom, or political efficacy, etc., when X increases 
by one unit. (Although we have scaled our model so that it 
is directly comparable to ordered probit, in applications we 
often scale u [and £] relative to the most and least extreme 
vignettes, so that the results will be simpler to interpret.) 
The other set of effect parameters y show how the thresh- 
olds t depend on explanatory variables V. They indicate how 
norms and expectations differ by cultures and types of people. 


Actual Levels, without a Self-Assessment 
Response 


Suppose that we are interested in the actual level for a (possi- 
bly hypothetical) person described by his or her values of the 
explanatory variables, which we denote X.. Since we have no 
direct information with which to distinguish this person from 
anyone else with the same X,, the posterior density of pe is 
similar to that in linear regression, 


P(e ly, 2) = N(He | X, XVX +67), 03) 


where we are using the asymptotic normal approximation 
to the posterior density of £ (with mean, the MLE £, and 
variance matrix V[A]) and are conditioning on the MLE of 
the random effect variance, *, and the full set of data y 
(although we do not observe y,). Sampling from the exact 
postenors of £ and w would be a theoretical improvement, 
but our Monte Carlos so far indicate that these complications 
are unnecessary. 

We can compute quantities of interest from this posterior 
density analytically or via simulation. For example, the actual 
level for a person with characteristics X, is E(u, |X.) = XB, 
and the point estimate is X. 8. Since the thresholds adjust from 
person to person on the basis of how they respond differently 
to the same questions, estimates of pe for any two people are 
directly comparable (conditional on the model). 


Actual Levels, with a Self-Assessment 
Response 


We could use the algorithm in the previous section for peo- 
ple we have asked a self-assessment question, but such a 
procedure would be inefficient, as well as more sensitive to 
model misspecification than necessary. Their properties are 
also highly dependent on the correct specification. Thus, when 
we have self-assessment information y, for person c, we shall 
estimate P(u, | y, z, ye) rather than P(x, | y, z) (following a 
strategy analogous to that of Gelman and King [1994] and 
King [1997}). 

To see the advantage of this strategy, suppose that we are 
trying to measure the actual levels of Respondents 1 and 2, 
who have the same explanatory variable values, X; = %2. By 
the unconditional method, these individuals will also have 
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the same posterior density, P(u; | y, z) = P(n | y, z). If they 
also have the same values of their explanatory variables on 
the thresholds, V; = V4, and hence the same threshold values, 
they will have the same posterior distribution of probabilities 
across each of their K survey responses. But suppose also that 
Respondent 1 has chosen the self-assessment category yı with 
the highest posterior probability, but Respondent 2 chose the 
ya with the lowest posterior probability. In this situation, it 
would make sense to adjust the predictions of Respondent 2 
(but not Respondent 1) in the direction of the observed value 
J, since we have this extra bit of information with which to 
distinguish the two cases. In other words, the observed yy 
looks like enough of an outlier to cause us to think that this 
person might not act like others with the same description 
and so should have an adjusted prediction for p that differs 
from the others. (We would not wish to adjust the prediction 
all the way to y, because of interpersonal incomparability and 
higher variance of this realized value; i.e., there is an advan- 
tage to borrowing strength from all the other observations 
that are used in the predicted value.) If we had covariates 
with a very high discriminatory power (i.¢., if œ? is small), 
very little adjustment would be n , whereas if our co- 
variates did not predict well (i.e, when om is large), we would 
adjust more. This, of course, is classic Bayesian shrinkage, but 
instead of shrinking the observed value toward a global mean, 
we shrink toward the common interpersonally comparable 
adjusted value our model assigns to all people with the same 
values of X, u2. 

To calculate P(uely, Z, Yc), we start with P(u,|y, z) 
from Eq. (13), and use Bayes theorem to condition on 
Ye also, P(e | y, z Ye) x Plc | Mer YZ) P(e | y, z), where 
P(ye | Me, y, z) is Eq. (9) integrated over t (and which we 
approximate by replacing t in [9] with its MLE). Thus, 
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which we could summarize with a histogram or a point esti- 
mate (such as a mean) and a (Bayesian) confidence interval.® 





8 We draw the univariate ue by discretization, with the inverse CDF 
method applied to trapezoidal approximations within each discrete 
area, which we find to be fast and accurate. If self-assessments and 
vignettes are asked of the same people, we can improve estimates 
even further by conditioning on both ye and z: 
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where we assume before conditioning on ye and ze that 6 and y 
are independent (which ıs closely approxmated emprrically), and 
we set 0, œw, and o (which are constant over c) at their MLEs This 
univariate density can be constructed by using the integral, which can 
be evaluated by averaging the expression for different simulations of 
yY, to scale the last normal at each of a grid of values on e. The 
uncertainty m 6, œ, and o can also be added here by drawing them 
from their posteriors during the simulation of the integral. 
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Campbell Craig 


This engrossing story of how the 
three chlaf architects of a powerful 


ideology struggled with the Implica- 


tions of their own creation offers 
crucial context for contemporary 


debates about the resort to war and 


weapons of mass destruction 


“A fascinating Intellectual history of 


three founders of Amencan 
Realism.” —Robert Jervis, 
Columbia University 


216 pages © $34 50 cloth 


Inventing Iraq 
The Failure of Nation Bullding and 
a History Denled 


Toby Dodge 


Offering a penetrating history of the 


formation of modem Irag, Dodge 
uncovers numerous troubilng 
parallels between the policies of a 


declining British empire and those of 


the current American goverment, 
which together form a timely and 
trenchant cautionary tale. 


“For Dodge, the Americans running 


things In Baghdad have learned 


ittie from the British experiance In 
iraq. This book ought to be required 


reading for them.” 
—WMike Schuster, MPR, 
“AH Things Considered” 
288 pages © $24 95 cioth 


~ New from East European Monographs 


Ceausescu 
From the End to the Beginnings 


Pavel Campeanu 


Based largely on hitherto unavailable 
documents, the book focuses on the 


escendance of Nicolas Ceausescu 
from a mere member of the 
Romanian Communist Party to that 
of leader of the monstrous Party 
and State apparatus that collapsed 
m 1989. 

320 pages © $62 50 cloth 


Nuclear North Korea 
A Debate on 

Engagement Strategies 

Victor D. Cha and 

David C Kang 


“Nuclear North Korea, written in a 


smooth and lucid style, Is the most 
thoughtful and analytical treatment 


of practical strategies for dealing 
with North Korea that exdsts In 
print."——Ashton Carter, 

Harvard Unhversity 

280 pages © 12 figures © $24.95 cloth 


After the Emplre 
The Breakdown of the 
Amerkan Order 


Emmanuel Todd 
Translated by C Jor Delogu 
Foreword by Michael Lied 
“A powerful antidote to hysterical 
exaggeration of American power 
and potential by American 
trtumphallsts and ant-Amertcan 
polemicist alike. A best-seller In 


Europe, Todd's book should be read 
by all thoughtful Americans for Its 


provocative and well-informed 
analysis of thelr nation and tts 


prospects.” —from the foreword by 


Michael Lind 
192 pages © $29 95 cloth 


EUROPEAN FERAPECTIONS, LAMHEMCE D KFU ZkbAd, EDITOR 


Ethnic Cleansing 
In Twentieth 


Century Europe 


Edited by Steven Bela Vardy 


and T. Hunt Tooley 

The result of a conference held at 
Duquesne University in November 
2000 which brought together shty 


scholars, as well as survivors of ath- 


New from Columbia 


The Financiers of 
Congressional 
Elections 

Investors, Ideologues, 

and Intimates 


Peter L. Francia, Paul S. 
Hermson, John C Green, 
Lynda W. Powell and 
Clyde Wilcox 


individual donors play a critical role 
In financing congressional elections, 
accounting for more than half of all 
money ralsed In House campaigns. 
But significant donors (defined here 
as those contrfouting more than 

$200) are the least understood 

participants In the system. This book 
focuses on those donors. 


“An Important book, well 
researched and crafted, about 
some very Important people.” 
—NMichael J. Malbin, Unversity at 
Albany, SUNY 

216 pages © $22 50 paper ° $59 60 coth 


PORER, COMFLICT, AND DEMOCRACY AMERICAN POLITICS 
INTO THE 21ST CENTURY 


Contemporary 
Portugal 

Politics, Society, and Culture 
Edited by Antonio 

Costa Pinto 

An Introduction to the evolution of 
Portuguese politics, soctety and cul- 
ture In the twentieth century. 
Eminent historians, political sclentists 
and experts In literature and art, 
explore a wide spectrum of topics. 


Be 
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Marilyn W. Thompson THE KILLER STRAIN 


Anthrax and a Government Exposed 
Marilyn W. Thompson 


; “A dramatic, chilling account of ‘when the anthrax 
Al NTHRAX : hit the fan’ in Flordia, New York City, and 
Washington, D.C. by one of the country’s best 
EXPOSED i investigative reporters.” 
' —Benjamin C. Bradlee, 
former Executive Editor, Washington Post 


and a Government 


ISBN 0-06-052279-8 © paperback * $13.95 ($21.95 Can.) ° 272 pages 





TEA vol nf 


AMERICA THE VULNERABLE A yi F R | C re 
How the U.S. Has Failed . 

to Secure the Homeland and Protect 

Its People from Terrorism 





Stephen Flynn 


An alarming and authoritative view of what America 
still needs to do to provide security in the face of new 
terrorist attacks. In a new world of heightened risk and 
fear, America the Vulnerable delivers a timely, forceful 
message that cannot be ignored. 


ISBN 0-06-057128-4 © hardcover © $25 95 ($39.95 Can.) © 288 pages 
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NEEDS TO KNOW 


WHAT’S RIGHT WITH ISLAM 
A New Vision for Muslims and the West 


Imam Feisal Abdul Rauf 


With an introduction by Karen Armstrong, What’s 
Right With Islam offers straightforward answers for 
today’s toughest questions about Islam, and a founda- 
tion and plan for reconciliation between Islam and the 
West. 





ISBN 0-06-058272-3 © hardcover * $21.95 ($33.95 Can.) * 240 pages 





BUYING 


THE BUYING OF THE PRESIDENT 2004 OF THE 


Who’s Really Bankrolling Bush and His Democratic PRESID ENT 
Challengers—and What They Expect in Return 


Charles Lewis & The Center for Public Integrity 


An expose of how much money each presidential candidate 
has, where they got it, and the strings that come attached, 
this is the authoritative guide to the big-money interests 
behind current presidential candidates. 

"Required reading." —Portland Oregonian 


ISBN 0-06-054853-3 © paperback * $14.95 ($22.95 Can.) © 528 pages AN | 
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EXILE PATRIOTISM AND 
TuE BALHAR wans 





Homeland Calling 

Exile Patriotism and the Balkan Wars 
Paul Hockenos 

“There ıs simply no way to understand 
how the different wars in the Balkans 
erupted dunng the 1990s without 
knowing the role played by the 
different exile and diaspora 
communttles Paul Hockenos has done 
truly pioneering work In descnbing this 
crucial aspect of the Balkans issues” 
—Carl Bildt, former UN Special Envoy for the 
Balkans and former High Representative In 
Bosnia and Herzegovina. $2795 


The Illusions of Egalttarlanism 
John Kekes 

“The Illusions of Egalrtartanism ìs an 
excellent and much-needed critidsm 
of the egalitananism that dominates 
contemporary Anglo-American ethics 
and political philosophy. John Kekes 

ıs devastating in showing how 
egalrtanan thinkers attempt to impose 
an inappropriately theoretical template 
on the real world which highlights Just 
one moral consideration above all 
others and ıs highly intolerant of 
diversity or world pluralism” 
—Anthony O’Hear, Honorary Director of 

the Royal Institute of Philosophy. $29 95 


Pivotal Deterrence 

Third-Party Statecraft and 

the Pursurt of Peace 

Timothy W. Crawford 

“In this pathbreaking book, Timothy 
W. Crawford substantially advances 
our understanding of the role that 
third parties can play In averting war 
between rivals. Prvotal Deterrence is 
elegant in theoretical construction, 
rich tn empirical narrative, and wise 
in guidance to the policy communtty.” 
—Charles A. Kupchan, author of The End of 
the American Era. Come STupes N Sacunmy 
Aras 4 illustrations. $39 95 
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Final Solutions 


Mass Killing and Genocide 

in the 20th Century 

Benjamin A. Valentino 

“Final Solutions deftly focuses on the 
responsibllity of the strategic calcula- 
tions of polttical leaders for mass killing 
in the twentieth century. By examining 
a wide selection of cases, including a 
number in which large-scale massacres 
and genocide did not occur, Benjamin A. 
Valentino ıs able to test his Ideas about 
the prevention of one of humantty’s most 
dire problems "—Norman M. Nalmark, 
author of Fires of Hatred: Ethnic Cheansing 


in Twentieth-Century Europe. Come STupies 
IN Secumy Anus $29 95 


Unwanted Company 

Foreign Investment in American Industries 
Jonathan Crystal 

“Jonathan Crystal's Innovative and 
interesting book explores the polrtical 
underpinnings of market openness 

In advanced Industnal economies. tt 
carefully elucidates the way firms in 
Important American industnes calculate 
the costs and benefits of competing 
strategles for dealing with incoming 
forelgn direct investment ”"—1ouls W. Pauly, 
author of Who Elected the Bankers?: Surveillance 
end Control in the World Economy. $35 00 


The Transformation of Central Asla 


States and Societies from Soviet Rule 

to Independence 

Edited by Pauline Jones Luong 

“In this superb collection of essays 
eight scholars bring new data and fresh 
insights to key questions of the relation- 
ship of state and society In post-Soviet 
Central Asla that require us to rethink 
conventional understandings of the 
region.”—Ronald Suny, Untverstty of Chicago. 
3 illustrations, 1 map $49 95 doth, $22 95 paper 





Whole World on Fire 


Organizations, Knowledge, 

and Nuclear Weapons Devastation 
Lynn Eden 

“Whole World on Fire is a thoughtful 
examination of the effect of organiza- 
tons on the science and technology of 
determining nuclear devastation Lynn 
Eden's book ıs thoroughly researched 
and well written.”—Dr. WHllam J. Perry, 
19th US Secretary of Defensa. Cornea. Sruoes N 
Secunsy Aras 18 illustrations, 6 maps $3250 








NEW IN PAPER 


Corporate Warriors 


The Rise of the Prrvatized 

Milttary Industry | - 

P. W. Singer 

“[P]rovides a sweeping survey of 
the work of MPRI, Alrscan, Dyncorp, 
Brown & Root, and scores of other 
firms that can variously put troops in 
the field, bulld and run milttary bases, 
train guerrilla forces, conduct alr 
surveillance, mount coups, stave 
off coups, and put back together 
the countnes that wars have just 
destroyed.”— The Atharetic Momthiy. 
Comment STUDES IN SECURITY AFFARS 

3 Illustratons,1 map $19 95 


Whose Detrolt? 


Polrtics, Labor, and Race In 

a Modern Amencan Crty 

Heather Ann Thompson 
“Thompson. .. uses Detroit In the 
1960s and earty 19705 to consider how 
the battles for civil and workers nghts 
have shaped Amencan dtes. [T]here’s 
plenty here for readers eager to think 
deeply about our hometown’s 
challenges "—Detrolt Free Press. 

20 illustrations $19 95 
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Justice, Legitimacy, and Self- 
Determination 
Moral Foundations for International Law Gi 


ALLEN BUCHANAN 

This book advances vigorous l 
criticisms of the central dogmas ae 

of international relations and sacra mained 
international law, arguing that Moral Peadar far 
the international legal system — 


should make justice, not simply 

peace among states, a primary goal, and rejects the 
view that it is permissible for a state to conduct its 
foreign policies exclusively according to what is in 
the “national interest.” 

(Oxford Political Theory) 


Working Together l 
How Workplace Bonds Strengthen a 
Diverse Democracy 

“To reverse Americas growing 
social fragmentation, especially 
in the context of increasing 
diversity, will require that we 
explore and exploit the civic- 
potential of the workplace. . 
Cynthia Estlund’s book admirably opens that 
debate and should be on the ‘must read’ list of any- 
one concerned to revitalize American democracy.” 
—Robert D. Putnam, author of Bowling Alone 
2003 $29.95 


"© Social Bases of Institutional Change ` 


The New Imperialism 

DAVID HARVEY 

In this closely argued and clearly 
written book, David Harvey, one 
of the leading social theorists of 
his generation, builds a concep- 
tual framework to expose the 
underlying forces at work behind 
the momentous shifts in US 
policies and politics. The com- 
pulsions behind the projection of US power on the 
world as a “new imperialism” are here, for the first 
time, laid bare for all to see. 

(Clarendon Lacteres in Geography and Environmental Studies) 

2003 $2200 
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Voices from the Edge 

Narratives about the Americans with 
Disabilities Act 

Edited by RUTH O'BRIEN 
Giving voice to the many types of 
discrimination the disabled 
face—at a small Southern 
College, in the Library of 
Congress, on a New York City 
sidewalk—while illustrating the 

personal stakes underlying legal disputes over the 
Americans with Disabilities this collection offers 
unparalleled insight into the lives behind the law. 
2004 paper $16.95 cloth $55 00 





How Congress Evolves 


NELSON W. POLSBY 
“Combining historical breadth 
and intimate detail, Nelson Polsby pben 
explains how and why the U.S. 
Congress came to change both its 
ideological makeup and its inter- 
nal decision-making processes at 
mid century. An important book 
by a gifted storyteller, How Congress Evolves is a trea- 
tise in its sweep and a memoir in its depth. No other 
political scientist could have delivered this combina- 
tion of strengths.” —Richard Fenno, author of 
Home Style: House Members in their Districts 

2003 $2995 


Islamic Political Identity in 
Turkey 

M. HAKAN YAVUZ 

“In this important book, Hakan 
Yavuz explains how Islamic iden- 
tity came to occupy a central 
place in modern Turkey. The first 
hand observations and interviews 
community leaders give this 
book an original and engaging quality that is sure to 
make it an indispensable source for understanding 
modern Turkey.” —Resat Kasaba, author of The 
Ottoman Empire and the World Economy 

(Relignen and Global Politics) ; 

2003 $49.95 
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Prices are subject to change and apply only In the US To order, or for more 
Informatlon, please call 1-800-451-7556. In Canada, call 1-800-387-8020. 
Visit our website at www.oup.com/us 


ASSISTANT DEAN, SPEA IN 


The School of Public and Environmental pla Seta at Indiana University Kokomo 1s seeking ap 
Il Professor. Applicants must have thelr Ph.D. 

research, and service. Of particular Interest are candidates with a primary substantive bac 

Criminal Justice, Political Science, or a closely related discipline) and a secondary interest in Public Affalrs.' The 
onentation, and a strong Interest in community 
at IU Kokomo, and will also assume certain res 


Dean with tenured facu 
established record of te 
Criminal Justice ( 


rank of Associate 
l 


Ideal candidate will also possess a student 
candidate will administer all aspects of 
SPEA System. 


SPEA ıs a multtdisciplinary, untvers de dmsion of Indiana Univers 
campuses. Organized as a profession 


issues of environmental science, public and environmental policy and man 


plicants for a postion of Assistant 
(a J.D. will not suffice), ae ee 
nd in 


networking. The successful 
ponslbilrtes wrthin' the broader 


with over 100 faculty (tenured or tenure track) on sk 
school, SPEA 1s committed to excellence in teaching, research, and service, addressing critcal 
agement. - 


One of elght campuses of Indiana Unrverstty, Indiana University Kokomo Is a comprehensive, nomresidental campus located 50 miles 


north of Indiana 


lis In Kokomo, Indiana, a city with a Population of about 47,000. Committed to student success, the campus serves 


i students from an 11-county area In north central Indiana. An addrtonal 350 students purs 


ty 
technology In cooperation with Purdue University. IUK has approximately 210 
Associate, baccalaureate, and master's degrees are offered. Additonal Info 
accessed at http://www.iuk.edu. 


has approximately 160 majors. 
Review of applications began January 15, 2004 and will continue until the position Is filled. Please submit a letter of applicaton, 


current 
of Public and Ennronmental Affairs, Indiana University Kokomo, 2300 South Washi 
Opportunity/Affirmattve Action Employer. Persons with disabilities that need 

application process should contact the Office 
Disablirbes Act (ADA) may be required. ; 


The Globalization of Human Rights: 
` The United Nations In the Twenty-first 


Century ' 
The Intemational community has embarked on 
an unprecedented effort to map out normative 


guidelines for global Justice. The core premisa 
of this effort Is to reach a more ethical 


til 
GLOBALIZATION 
al 


INMAN 
RIGHIS 


E.03 IIL_A4 
9280810804 
220pp. 
$26.95 


UNITED NATIONS PUBLICATIONS - ent of Public Information 
Sales and Ma Section, Room DC2-0853, A511, New York, NY. 10017 
Tel. (800) (212) 963-8302; Fax (212) 963-3489 - E-Mail: 





vitae, 'and names, address, e-mall, and telephone numbers of frve references to: Dr. Allen F. Anderson, Ass 





Dean, School 


n, Kokomo, IN 46904-9003. IUK ıs an Equal 
cations or adjustments to partclpate In this 
Affirmative Action at (765) 455-9529. Vertfcation of a disability under Americans with 





Regional Peacekeepers: The Paradox of 


Russian 

This publication Investigates Russian multary 
presence In its former Soviet tertttory, to 
determine whether these forces have been 
genuinely peacekeeping, or are In fact a post- 
imperial presence that seeks to maintaln former 
strategic Interests. The book Includes first 
hand accounts of the Commonwealth of 
independent States (CIS) peecekseping efforts 
In South Ossetia, Abkhazia, Moldova, and 
Taffkistan. Thelr efforts are assessed, as aro 
those of Russian 
peacekeepers and 
NATO forces. The 
sttuation In Chechnya 
Is also examined, 


E.08.111.A.3 


Visit us at weww.urLorg/publications 
Visa, MC and AMEX accepted. 
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Of Paradise and Power 
America and Europe in the New World Order 
by Robert Kagan 
New Expanded Edition in Paperback 
*[Kagan] writes elegantly, has an excellent command of history 
and consistently demonstrates superior intelligence and 


inught. He ranks. . . among analysts whose work must be 
read.”—Warren I. Cohen, Los Angeles Times 


Vintage | Paper | 176 pages | $11.00 


Eugene McCarthy 
The Rise and Fall of Postwar American Liberalism 
by Dominic Sandbrook 


"With each passing year Eugene McCarthy's remarkable 
career seems to grow ın importance. Dominic Sandbrook’s 
subtle, intelligent portrait explains why. It gives us Gene 
McCarthy in all his enigmatic brilhance, but it also shows 
us a man we hardly knew—boldly pragmatic, highly ambitious, 
a leader as well as a maverick.”—Sam Tanenhaus, author of 
Whittaker Chambers 


Knopf | Cloth | 416 pages | $25.95 


Responsibility and Judgment 
by Hannah Arendt 
edited by Jerome Kohn 


Responsibility and Judgment gathers together unpublished writings 


from the last decade of Arendt’s hfe, as she ed to 
explicate the meaning of Eichmann mn Jerusalem. “. . . [A] superb 
introduction to her views and a chance to probe, without 
hearsay or slander, one of the great thinkers of our time.” 
—Susan Neiman, author of Eal m Modern Thought 


Schocken | Cloth | 336 pages | $25.00 


The Origins of Totalitarianism 
by Hannah Arendt 
Recognized upon publication as the comprehensive account 
of its subject, this first and most enduring investigation of one 


of the 20th century's most crucial phenomena continues to 
be the definitive history of this political movement. 


Schocken | Cloth | 688 pages | $35.00 


For desk and examination copies: 


The Anatomy of Fascism 

by Robert O. Paxton 
A groundbreaking, authoritative work—the culmination of 
a lifetime of study—that will have a profound impact on our 
understanding of 20th-century history. This unprecedented 
analyns of fascism discusses the forms it has taken and the 
reasons for its stronghold in certain countries and not 
ın others. 


Knopf ! Cloth | 336 pages | $26.00 


Europe's Last Summer 
Who Started the Great War in 19147 
by David Fromkin 

In a gripping narrative, based on the latest scholarship, 
Fromkin argues that hostilites were started deliberately and 
reveals how and why diplomacy was destined to fail. Shedding 
light on such current issues as preventative war and how to 
combat terrorism, he provides detailed descriptions of the 
negotiations and incisive portraits of the key diplomats, 
generals, and rulers. 


Knopf | Cloth | 368 pages | $26.95 


Centennial Crisis 
The Disputed Election of 1876 
This compelling history of the 1876 presidential election 
gives an insider’s look at the maneuvering, corruption, and 
disputed outcome of the great political drama of the 19th 


century, as the conflicts and ramifications played out for 
nearly frre months. 


Knopf | Cloth | 288 pages | $26.00 


Imperial America 
The Bush Assault on the World Order 
by John Newhouse 
Woven through with illuminating anecdotes and vivid 
portraits of the players, Imperial Amanca is a brilliantly clear, 


timely, and powerfully thought-provoking exposé of recent 
American foreign policy: how it has been made and 
perilously mishandled. 


Knopf | Cloth | 208 pages | $23.00 


Visit: www.randomhouse.com/academic 


Emall: acmart@randomhouse.com 


Write: Knopf Academic Marketing, 1745 Broadway, 20th Floor, New York, NY 10019 
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How We Judge anigtica’s Presidents 
JAMES P. PFIFFNER 


THE CHARACTER FACTOR 


How We Judge America’s Presidents 
JAMES P. PFIFFNER 


To most Americans, the president's character matters deeply. In this volume, Pfiffner 
leads us through a survey of three aspects of presidential character that have proved 
problematic for recent chief executives: lies, promise-keeping, and sexual probity. 
His goal is not to tell us which presidents have been “good” or “bad,” but rather, to 
help us think critically about complex character issues. *. . . a splendid analysis of 
the vital—and sometimes perplexing—role that character plays in the performance 
of presidents.”—David M. Abshire, Center for the 

Study of the Presidency. 


$40.00 cloth; $16.95 paper 
POWER- 


POWER AND PRUDENCE 


ne Psting areoge sn son MARTE 


Ryan J. BARILLEAUK AND Marx J. ROZELL 


When George H. W. Bush left office in 1993, the American electorate resoundingly 
proclaimed his administration a failure. Here, Barilleaux and Rozell challenge this 
conventional wisdom and present a reinterpretation of the leadership of a poorly 
understood president. They suggest that the circumstances of Bush’s presidency 
may have limited his opportunities'to achieve far-reaching policy objectives. “. . . 
a matrix for future researchers.”—Herbert S. Parmet, City University of New York. 


$35.00 
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REAL DEMOCRACY 


ates 


NEW FRO 


The New England Town Meeting and How It Works 


Frank M. Bryan 


“Real Demecrecy is a magnificent analysis of the New England town meenung. It should be read again and 
again over the next hundred (indeed, the next thousand) years, Whether existing town mectings flourish : 
and provide a model for the rest of the world or whether they lose their power and begin to wither, this 
book will provide the scholarly worlds only besis for their systematic comparison ın the furure.” 


—Jane Mansbridge, Harvard University 


Asumican PouTics Aue POUTICAL Economy Sains 
Papar $19.00 





POLITICS 


REDESIGNING THE MEDICARE CONTRACT 
Politics, Markets, and Agency 

Edward F. Lawlor 

reveals the fundamental ambiguity at the core of Medicare's 
about what Medicare contracts ought to look like ” 
—Adam Atherly, Emory University 

Cora $39.00 


RECONSIDERING ROOSEVELT ON RACE 


How the Presidency Paved the Road to Brown 
Kevin J. McMahon 

tionship that existed between African Americans and the 
Democranc party of the 1930s-1950s, McMahon shows how 
Roosevelt came to be a avil rights reformer and explains why 
he decided w side with civil rights advocates egainst the pow- 
erful southern wing of his own party.”—Cornell Clayton, 
Parar $20.00 


TALKING ABOUT POLITICS 

Informal Groups and Social Identity In American Life 
Katherine Cramer Walsh 

“Walsh's arguments are highly original, well-researched, and 
creatively constructed. This is a fascinanng sudy of the dynam- 
ic role that social identity plays in political understanding.” 
—Ann Crigler, author of The Prychelogy of Political 
Communication 

STUDIES in COME CATION, BLIDIA, AXD PUBLIC Orion 


_ Parte $19.00 


PROJECTIONS OF POWER 

Framing News, Public Opinion, and U.S. Forelgn Pollcy 
Robert M. Entman 

"Projections of Power is « methodologically sensitrve and often 
original attempt to disentangle the causal relations among 
White House actions, news media coverage, and public opin- 
son. A timely book and a book for our umes.” 

—Herbert J. Gans, author of Demecnecy and the News 
Stupers in COMUNICATION, MDL, AXD PUBLIC OPTION 

Parer $16.00 
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Easy to learn yet programmable for the most _ sot 
demanding requirements dates. 
“ match merge acca 
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PE. E ROC. curves | | Customizable with 
i ae l ___.. regression fits < pübiication-qualtty results 
z= : iv > combined graphs 
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nested logit 
.. much more 


Stata 8 Is your complete system for managing, graphing, and analyzing data. It Is easy to learn through the extensive 
graphical Interface yet Is completely programmable for the most demanding data management and statistical requirements. 
Stata release 8 continues to bulld on Stata’s hallmark breadth of statistical methods and depth In data analysts. 


STATISTICAL SOFTWARE FOR PROFESSIONALS || 800-STATAPC || WWW.STATA.COM 
Stata Corporation ° 4005 Lakewey Drive • College Station, TX 77845 ° 800-782-8272 e 979-886-4600 ° FAX: 979-006-4801 © Info@stata.com 
Stata is a registered trademark of Stata Corporation, College Station, Texas U.S.A. 





ULITICAL PERSPECTIVES 


Outspoken 


Free Speech Stories 
Nan Levinson 


“Outspoken is heart- 
warming and inspir- 
ing, as it reminds us 
that there are people 
who refuse to be 
cowed into obedi- 
ence, who maintain 
their resistance to 
war and injustice 
against all odds.” 
—Howard Zinn, 
author of 
A People’s History of 
the United States 
$29.95 cloth 


The Fractious Nation? 


Unity and Division in Contemporary 
American Life 


Jonathan Rieder, Editor; 
Stephen Steinlight, Associate Editor 


“Offers a bracing challenge to widely held 
beliefs about cultural and political frag- 
mentation in the United States today. The 
Fractious Nation? may well change the 
debate on issues ranging from multicultur- 
alism and race relations to governance and 
public philosophy.” 

—William A. Galston, author of 

Liberal Purposes 

$50.00 cloth, $19.95 paper 


Food Politics 


How the Food Industry Influences 
Nutrition and Health 


Marion Nestle 


“Provocative and highly readable.” 

—Economist 
2002 Award for Excellence, Association of American 
Publishers; James Beard Foundation Awards, James 
Beard Foundation; Harry Chapin Media Awards, 
World Hunger Year; California Studies in Food and 
Culture, $15.95 paper 
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Notes from the Editor 


Readers of a certain age who are still able to summon 
up memories of those thrilling days of yesteryear may 
recall that the previews for Hollywood movies used 
to feature such come-ons as “Years in the making!,” 
“With a cast of thousands!,” and “In living color!” The 
articles in this issue of the APSR may or may not 
have been years in the making, though I am pleased 
to attest that the review and production phases of 
their creation fell well short of epic proportions. Nor, 
although some of these articles are co-authored, did 
their dramatis personae ever exceed a sub-DeMillean 
three. As Louis B. Mayer might have said, though, 
color we've got. As usual, our cover shimmers. But 
this time the color comes in multiple hues rather than 
the normal monochrome, and the color can be found 
not just on the cover but in our lead article as well, 
where the cover graphic reappears along with several 
multi-color accompaniments. Where, the traditional- 
ists among us may wonder, will it all end? Brightly 
colored covers were bad enough—but when the next 
issue of the APSR arrives, should three-dimensional 
scattergrams be expected to pop up out of its pages? 
Will question-wording appendixes be intoned in the 
basso profundo of James Earl Jones? Will the textual 
analyses self-deconstruct? The mind boggles; the slope 
is slippery; a little color is a dangerous thing. 

Because my job as editor consists in large measure— 
though, thankfully, by no means exclusively—of de- 
livering bad news to 90 percent or so of those who 
submit their work for consideration here, I hope I can 
be forgiven for what may appear to be an unnatural 
fixation on bright colors and striking graphics. That 
is a “fun” part of the job. A source of far greater 
pleasure, however, is the intellectual diversity of the 
papers that we publish, and this issue is an excellent 
case in point. Geographically and temporally, this issue 
ranges from ancient Athens to the NeverNeverLand of 
Beita. Methodologically, we have simulation, textual 
analysis, policy analysis, formal modeling, case stud- 
ies, symbolic interactionism, high-powered statistical 
analysis, constructivism—you name it and it’s likely 
to be here, or forthcoming in our pages. Please stay 
tuned. 


IN THIS ISSUE 


Our lead article, the colorful “Secessionism in Multi- 
cultural States: Does Sharing Power Prevent or En- 
courage It?” has a dual agenda. Substantively, Ian S. 
Lustick, Dan Miodownik, and Roy J. Eidelson’s point 
of departure is the often-contradictory array of results 
that have been reported by “small-n” and “large-n” 
researchers interested in ethnic mobilization and se- 
cession. Methodologically, in their attempt to bring 
some order to this field in disarray, Lustick and his 
associates pursue an analytic approach—agent-based 
modeling—that is simultaneously high-tech and highly 
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compatible with a perspective—constructivism—that 
is not ordinarily associated with high-tech approaches. 
This article, featuring one of the first forays of agent- 
based modeling into a mainstream social science jour- 
nal, should be of interest to an unusually broad spec- 
trum of readers, and not just for its colorful graphics. 

In “The Inefficient Use of Power: Costly Conflict 
with Complete Information,” Robert Powell maintains 
Lustick et al.’s (and our discipline’s) focus on power, 
but sets out in a different direction. Powell wonders 
why, when players bargain over the costly “pie” of con- 
flict, their use of power fails to achieve a Pareto optimal 
outcome. Rather than treating asymmetric information 
as the cause, Powell argues that commitment problems 
are to blame for inefficient uses of power. Identify- 
ing a rapidly shifting environment as a condition of 
inefficiency, Powell demonstrates how all equilibria of 
a two-actor stochastic game will be inefficient despite 
the perceived advantage of having complete informa- 
tion. Powell’s analysis not only promises to provide 
a starting point for explaining why the use of power 
produces unintended consequences in various contexts, 
but also signals the need for additional research to 
clarify why rapid environmental shifts occur in the first 
place. 

Conventional wisdom about public policy in the 
United States, especially as it bears on the welfare 
state, centers on its inertial character—its bias toward 
the status quo and the difficulty of changing directions 
once they have been established. However, in an im- 
portant new analysis, Jacob S. Hacker challenges the 
conventional view of inertia and what is required to 
overcome it. In “Privatizing Risk without Privatizing 
the Welfare State: The Hidden Politics of Social Policy 
Retrenchment in the United States,” Hacker argues 
that even in the absence of visible, legislated policy 
change, substantial alterations to welfare policy can 
occur through the mechanisms of drift, conversion, and 
layering—strategies that are used to sidestep political 
and institutional barriers to outright policy change. This 
model of policy change, both overt and hidden, has the 
potential to guide analyses not only of social policy 
in the United States, but more broadly of policy pro- 
cesses, institutional politics, and political change in a 
wide array of settings. 

“Remodeling the Competition for Capital: How Do- 
mestic Politics Erases the Race to the Bottom,” by Scott 
J. Basinger and Mark Hallerberg, provides an excellent 
companion piece to the Hacker article. Basinger and 
Hallerberg want to know why the outcome that would 
have been expected based on the “race to the bot- 
tom” model of interstate competition for capital has 
not materialized. Based on their analysis of tax policy 
choices in 20 OECD countries, they conclude that the 
answer lies in domestic politics. Analyzing policy de- 
cisions from a “tournament” perspective, they argue 
that states take into account not only the tax reforms 
that their competitors have instituted, but also such 
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domestic political considerations as ideological resis- 
tance to the reforms and the operation of domestic veto 
players. This analysis, then, turns our attention toward 
the domestic factors that shape policies resulting from 
interstate competition for capital and make tax reforms 
possible. 

In “Diverging Orbits: Situation Definitions in Cre- 
ation of Regimes for Broadcast and Remote Sensing 
Satellites,” M.J. Peterson operates from an analytical 
perspective seldom seen in political science, symbolic 
interactionism, which spotlights interactions among 
those involved in a particular situation. Peterson’s anal- 
ysis centers on two simultaneous sets of negotiations 
concerning cross-border information flows that took 
place during the 1960s and ended with different results. 
Rather than simply looking at the immediate interests 
of the involved states, Peterson (echoing the domestic 
politics emphasis of the Basinger-Hallerberg analysis) 
focuses on how domestic actors who were not immedi- 
ately involved in the international arena came to influ- 
ence national policies and the international negotiation 
process. Peterson’s symbolic interactionist approach is 
the key element in her explanation of the development 
of a state’s preferences and its behavior in the interna- 
tional arena. 

From space in Peterson’s (remote sensing satellites) 
sense we turn to space in the very different sense 
of Jonathan Bendor and Adam Meirowitz’s “Spatial 
Models of Delegation.” Bendor and Meirowitz expand 
our understanding of when and to whom delegation 
occurs by modeling delegation in ways that go beyond 
the assumption of risk-aversion. Especially because 
their nested family of models helps pinpoint condi- 
tions responsible for various outcomes, Bendor and 
Meirowitz’s approach combines simplicity, rigor, and 
generality in a model that promises to be a fertile source 
for future research on a wide-ranging set of issues. 

Ancient Athens has long been held up as an exem- 
plar of classical and direct democracy for the modern 
age. Given Athens’ special importance in Western po- 
litical traditions, its political arrangements should be 
of widespread interest, but most political scientists’ un- 
derstandings of how Athenian democracy functioned 
are rudimentary at best, and their attempts to use the 
Athenian case to bolster their arguments about current 
issues accordingly fall short. In “Athenian Democracy 
and Legal Change,” Melissa Schwartzberg unearths ev- 
idence that the ease of modifying institutions and laws 
was a defining characteristic of Athenian democracy. 
Schwartzberg’s underlying premise is that by exam- 
ining the balance that was struck in ancient Athens 
between flexibility and innovation and between legal 
continuity and predictability, contemporary democrats 
can draw important insights about the merits of their 
own attempts to balance responsiveness and stability. 

The next three articles in this issue all fall into the 
“methodological” category, and in so doing demon- 
strate the extraordinary scope of that subfield, which 
extends all the way from highly abstract treatises 
about ways of knowing to extremely concrete devel- 
opments in data analysis technology. In the first of 
these contributions, Ryan Patrick Hanley reintroduces 


Isaiah Berlin’s oft-overlooked consideration of politi- 
cal inquiry. In “Political Science and Political Under- 
standing: Isaiah Berlin on the Nature of Political In- 
quiry,” Hanley points to the need for moral excellence 
and “humanism” when solving contemporary political 
problems, two important points that often fall victim 
to the pressures of scientific rigor. Hanley’s analysis 
re-establishes the priority of one of Berlin’s important 
messages: that social scientists must continue to act 
responsibly in formulating their questions. As Hanley 
puts it, “More necessary than method... is what comes 
before method, namely the judgment that enables us 
to determine which ideas are worth our attention and 
efforts, and which are—to use a favorite locution of 
Berlin’s—‘justly forgotten.” 

John Gerring asks and answers two key method- 
ological questions in “What is a Case Study and What 
is it Good For?” By assessing the virtues and limita- 
tions of the case study approach to social science re- 
search, Gerring advances our understanding of social 
science methodology and the trade-offs that confront 
researchers. Preferences differ: Some political scien- 
tists would prefer to explain 10 percent of the variance 
in 100 cases, while others would prefer to explain 90 
percent of the variance in a few cases. Gerring finds 
value in both types of analysis. Rather than disparaging 
case studies as a lesser or even primitive mode of in- 
quiry, he argues that they should be accepted as integral 
components of social science research, complementary 
to the full range of other social science methodologies. 

Closer to the technical end of the methodological 
continuum is Joshua Clinton, Simon Jackman, and 
Douglas Rivers’s “The Statistical Analysis of Roll Call 
Data.” The ability to estimate spatial parameters from 
roll call voting data has produced a burgeoning litera- 
ture in the past two decades, originally among students 
of Congress but with important spillovers into other 
research subfields as well. Continuing to develop these 
methodological tools, Clinton, Jackman, and Rivers use 
a Bayesian procedure to “revise and extend” the statis- 
tical underpinnings of the now-familiar NOMINATE 
scores, which have become a key component in quan- 
titative analyses of legislative politics. Their findings 
demonstrate, among other things, how others can in- 
novate in developing models that are attuned to specific 
formal and other theories. 

In an article that was published in the March 2000 
issue of the APSR, Nathaniel Beck, Gary King, and 
Langche Zeng asserted that the widely used and ac- 
cepted logit statistical model was not the best tool 
for deriving good forecasts of militarized disputes in 
the international arena. Instead, they suggested that 
neural network models are superior for this purpose. 
However, in “Untangling Neural Nets,” which appears 
in our “Forum” section, Scott de Marchi, Christopher 
Gelpi, and Jeffrey Grynaviski take issue with Beck, 
King, and Zeng’s characterization of the logit model 
and argue that logit models compare favorably to neu- 
tal network models in terms of parsimony, interpre- 
tation, and model fit. Their critique and the accom- 
panying response by Beck, King, and Zeng (“Theory 
and Evidence in International Conflict: A Response to 
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de Marchi, Gelpi, and Grynaviski”) shed new light on 
the relative merits and demerits of these statistical tech- 
niques and, in so doing, promise to inform quantitative 
modeling methods efforts for some time to come. 


INSTRUCTIONS TO CONTRIBUTORS 


General Considerations 


The APSR strives to publish scholarly research of 
exceptional merit, focusing on important issues and 
demonstrating the highest standards of excellence 
in conceptualization, exposition, methodology, and 
craftsmanship. Because the APSR reaches a diverse 
audience of scholars and practitioners, authors must 
demonstrate how their analysis illuminates a significant 
research problem, or answers an important research 
question, of general interest in political science. For the 
same reason, authors must strive for a presentation that 
will be understandable to as many scholars as possible, 
consistent with the nature of their material. 

The APSR publishes original work. Therefore, au- 
thors should not submit articles containing tables, 
figures, or substantial amounts of text that have already 
been published or are forthcoming in other places, or 
that have been included in other manuscripts submit- 
ted for review to book publishers or periodicals (in- 
cluding on-line journals). In many such cases, subse- 
quent publication of this material would violate the 
copyright of the other publisher. The APSR also does 
not consider papers that are currently under review 
by other journals or duplicate or overlap with parts of 
larger manuscripts that have been submitted to other 
publishers (including publishers of both books and 
periodicals). Submission of manuscripts substantially 
similar to those submitted or published elsewhere, or 
as part of a book or other larger work, is also strongly 
discouraged. If you have any questions about whether 
these policies apply in your particular case, you should 
discuss any such publications related to a submission in 
a cover letter to the Editor. You should also notify the 
Editor of any related submissions to other publishers, 
whether for book or periodical publication, that occur 
while a manuscript is under review by the APSR and 
which would fall within the scope of this policy. The 
Editor may request copies of related publications 

If your manuscript contains quantitative evidence 
and analysis, you should describe your procedures 
in sufficient detail to permit reviewers to understand 
and evaluate what has been done and, in the event 
that the article is accepted for publication, to per- 
mit other scholars to carry out similar analyses on 
other data sets. For example, for surveys, at the least, 
sampling procedures, response rates, and question 
wordings should be given; you should calculate re- 
sponse rates according to one of the standard formulas 
given by the American Association for Public Opinion 
Research, Standard Definitions: Final Dispositions of 
Case Codes and Outcome Rates for Surveys (Ann AT- 
bor, MI: AAPOR, 2000). This document is available 
on the Internet at <http://www.aapor.org/default.asp? 
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page = survey_methods/standards_and best_practices/ 
standard_definitions>. For experiments, provide full 
descriptions of experimental protocols, methods of 
subject recruitment and selection, subject payments 
and debriefing procedures, and so on. Articles should 
be self-contained, so you should not simply refer read- 
ers to other publications for descriptions of these basic 
research procedures. 

Please indicate variables included in statistical anal- 
yses by capitalizing the first letter in the variable 
name and italicizing the entire variable name the first 
time each is mentioned in the text. You should also use 
the same names for variables in text and tables and, 
wherever possible, should avoid the use of acronyms 
and computer abbreviations when discussing variables 
in the text. All variables appearing in tables should 
have been mentioned in the text and the reason for 
their inclusion discussed. 

As part of the review process, you may be asked 
to submit additional documentation if procedures are 
not sufficiently clear, the review process works most 
efficiently if such information is given in the initial 
submission. If you advise readers that additional infor- 
mation is available, you should submit printed copies 
of that information with the manuscript. If the amount 
of this supplementary information is extensive, please 
inquire about alternate procedures. 

The APSR uses a double-blind review process. You 
should follow the guidelines for preparing anonymous 
copies in the Specific Procedures section below. 

Manuscripts that are largely or entirely critiques or 
commentaries on previously published APSR articles 
will be reviewed using the same general procedures as 
for other manuscripts, with one exception. In addition 
to the usual number of reviewers, such manuscripts will 
also be sent to the scholar(s) whose work is being crit- 
icized, in the same anonymous form that they are sent 
to reviewers. Comments from the original author(s) to 
the Editor will be invited as a supplement to the advice 
of reviewers. This notice to the original author(s) is 
intended (1) to encourage review of the details of 
analyses or research procedures that might escape 
the notice of disinterested reviewers; (2) to enable 
prompt publication of critiques by supplying criticized 
authors with early notice of their existence and, there- 
fore, more adequate time to reply; and (3) as a courtesy 
to criticized authors. If you submit such a manuscript, 
you should therefore send as many additional copies of 
their manuscripts as will be required for this p 

Manuscripts being submitted for publication should 
be sent to Lee Sigelman, Editor, American Political 
Science Review, Department of Political Science, The 
George Washington University, 2201 G Street N.W., 
Room 507, Washington, DC 20052. Correspondence 
concerning manuscripts under review may be sent to 
the same address or e-mailed to apsr@gwu.edu. 


Manuscript Formatting 


Manuscripts should not be longer than 45 pages in- 
cluding text, all tables and figures, notes, references, 


Notes from the Editor 


and appendices. This page size guideline is based on the 
U.S. standard 8.5 x 11-inch paper; if you are submitting 
a manuscript printed on longer paper, you must adjust 
accordingly. The font size must be at least 11 points for 
all parts of the paper, including notes and references. 
The entire paper, including notes and references, must 
be double-spaced, with the sole exception of tables 
for which double-spacing would require a second page 
otherwise not needed. All pages should be numbered in 
one sequence, and text should be formatted using a nor- 
mal single column no wider than 6.5 inches, as is typical 
for manuscripts (rather than the double-column format 
of the published version of the APSR), and printed on 
one side of the page only. Include an abstract of no 
more than 150 words. The APSR style of embedded 
citations should be used, and there must be a separate 
list of references at the end of the manuscript. Do not 
use notes for simple citations. These specifications are 
designed to make it easier for reviewers to read and 
evaluate papers. Papers not adhering to these guide- 
lines are subject to being rejected without review. 

For submission and review purposes, you may place 
footnotes at the bottom of the pages instead of using 
endnotes, and you may locate tables and figures (on 
separate pages and only one to a page) approximately 
where they fall in the text. However, manuscripts ac- 
cepted for publication must be submitted with end- 
notes, and with tables and figures on separate pages at 
the back of the manuscript with standard indications of 
text placement, e.g., [Table 3 about here]. In deciding 
how to format your initial submission, please consider 
the necessity of making these changes if your paper 
is accepted. If your paper is accepted for publication, 
you will also be required to submit camera-ready copy 
of graphs or other types of figures. Instructions will be 
provided. 

For specific formatting style of citations and refer- 
ences, please refer to articles in the most recent issue 
of the APSR. For unusual style or formatting issues, 
you should consult the latest edition of The Chicago 
Manual of Style. For review purposes, citations and 
references need not be in specific APSR format, 
although some generally accepted format should be 
used, and all citation and reference information should 
be provided. 


Specific Procedures 
Please follow these specific procedures for submission: 


1. You are invited to submit a list of scholars 
who would be appropriate reviewers of your 
manuscript. The Editor will refer to this list 
in selecting reviewers, though there obviously 
can be no guarantee that those you suggest will 
actually be chosen. Do not list anyone who has 
already commented on your paper or an earlier 
version of it, or any of your current or recent 
collaborators, institutional colleagues, mentors, 
students, or close friends. 

2. Submit five copies of manuscripts and a diskette 
containing a pdf file of the anonymous ver- 


May 2004 


sion of the manuscript. If you cannot save the 
manuscript as a pdf, just send in the diskette 
with the word-processed version. Please ensure 
that the paper and diskette versions you sub- 
mit are identical; the diskette version should 
be of the anonymous copy (see below). Please 
review all pages of all copies to make sure that 
all copies contain all tables, figures, appendices, 
and bibliography mentioned in the manuscript 
and that all pages are legible. Label the diskette 
clearly with the (first) author’s name and the 
title of the manuscript (in abridged form if need 
be), and identify the word processing program 
and operating system. 


. To comply with the APSR’s procedure of 


double-blind peer reviews, only one of the five 
copies submitted should be fully identified as 
to authorship and four should be in anonymous 
format. 


. For anonymous copies, if it is important to the 


development of the paper that your previous 
publications be cited, please do this in a way 
that does not make the authorship of the sub- 
mitted paper obvious. This is usually most eas- 
ily accomplished by referring to yourself in the 
third person and including normal references 
to the work cited in the list of references. In no 
circumstances should your prior publications be 
included in the bibliography in their normal al- 
phabetical location but with your name deleted. 
Assuming that text references to your previous 
work are in the third person, you should include 
full citations as usual in the bibliography. Please 
discuss the use of other procedures to render 
manuscripts anonymous with the Editor prior 
to submission. You should not thank colleagues 
in notes or elsewhere in the body of the paper or 
mention institution names, web page addresses, 
or other potentially identifying information. AH 
acknowledgments must appear on the title page 
of the identified copy only. Manuscripts that are 
judged not anonymous will not be reviewed. 


. The first page of the four anonymous copies 


should contain only the title and an abstract 
of no more than 150 words. The first page of 
the identified copy should contain (a) the name, 
academic rank, institutional affiliation, and con- 
tact information (mailing address, telephone, 
fax, e-mail address) for all authors; (b) in the 
case of multiple authors, an indication of the 
author who will receive correspondence; (c) any 
relevant citations to your previous work that 
have been omitted from the anonymous copies; 
and (d) acknowledgments, including the names 
of anyone who has provided comments on the 
manuscript. If the identified copy contains any 
unique references or is worded differently in 
any way, please mark this copy with “Con- 
tains author citations” at the top of the first 
page. 


No copies of submitted manuscripts can be returned. 


American Political Science Review 


ELECTRONIC ACCESS TO THE APSR 


Back issues of the APSR are available in several elec- 
tronic formats and through several vendors. Except for 
the last three years (as an annually “moving wall”), 
back issues of the APSR beginning with Volume 1, 
Number 1 (November 1906), are available on-line 
through JSTOR (http://wwwjstor.org/). At present, 
JSTOR’s complete journal collection is available only 
via institutional subscription, e.g., through many col- 
lege and university libraries. For APSA members who 
do not have access to an institutional subscription to 
JSTOR, individual subscriptions to its APSR con- 
tent are available. Please contact Member Services at 
APSA for further information, including annual sub- 
scription fees. 

Individual members of the American Political Sci- 
ence Association can access recent issues of the APSR 
and PS through the APSA website (www.apsanet.org) 
with their username and password. Individual non- 
member access to the online edition will also be avail- 
able, but only through institutions that hold either a 
print-plus-electronic subscription or an electronic-only 
subscription, provided the institution has registered 
and activated its online subscription. 

Full text access to current issues of both the APSR 
and PS is also available on-line by library subscription 
from a number of database vendors. Currently, these 
include University Microfilms Inc. (UMI) (via its CD- 
ROMs General Periodicals Online and Social Science 
Index and the on-line database ProQuest Direct), On- 
line Computer Library Center (OCLC) (through its 
on-line database First Search as well as on CD-ROMs 
and magnetic tape), and the Information Access Com- 
pany (LAC) (through its products Expanded Aca- 
demic Index, InfoTrac, and several on-line services 
[see below]). Others may be added from time to 
time. 

The APSR is also available on databases through 
six online services: Datastar (Datastar), Business 
Library (Dow Jones), Cognito (LAC), Encarta Online 
Library (LAC), IAC Business (Dialog), and Newsearch 
(Dialog). i 

The editorial office of the APSR is not involved in the 
subscription process to either JSTOR for back issues 
or the other vendors for current issues. Please contact 
APSA, your reference librarian, or the database ven- 
dor for further information about availability. 


BOOK REVIEWS 


The APSR no longer contains book reviews. As of 2003, 
book reviews have moved to Perspectives on Politics. 
All books for review should be sent directly to the 
Perspectives on Politics Book Review Editors, Susan 
Bickford and Greg McAvoy. The address is Susan 
Bickford and Gregory McAvoy, Perspectives on Pol- 
itics Book Review Editors, Department of Political 
Science, University of North Carolina at Chapel Hill, 
CB No. 3265, Chapel Hill, NC 27599-3265. E-mail: 
bookreviews@unc.edu. 


VoL 98, No. 2 


- If you are the author of a book you wish to be con- 
sidered for review, please ask your publisher to send 
a copy to the Perspectives on Politics Book Review 
Editors per the mailing instructions above. If you are 
interested in reviewing books for Perspectives on Poli- 
tics, please send your vita to the Book Review Editors; 
you should not ask to review a specific book. 


OTHER CORRESPONDENCE 


The American Political Science Association’s address, 
telephone, and fax are 1527 New Hampshire Avenue, 
N.W., Washington, D.C. 20036, (202) 483-2512 (voice), 
and (202) 483-2657 (fax). E-mail: apsa@apsanet.org. 
Please direct correspondence as follows. 


Information, including news and notes, for PS: 


Dr. Robert J-P. Hauck, Editor, PS 
E-mail: rhauck@apsanet.org 


Circulation and subscription correspondence (domes- 
tic claims for nonreceipt of issues must be made within 
four months of the month of publication; overseas 
claims, within eight months): 

Elizabeth Weaver Engel, 

Director of Member Services 

E-mail: membership@apsanet.org 


Reprint permissions: 
E-mail: reprints@apsanet.org 


Advertising information and rates: 


Advertising Coordinator, 
Cambridge University Press 
E-mail: journals_advertising@cup.org 


EXPEDITING REQUESTS FOR COPYING 
APSR AND PS ARTICLES FOR CLASS USE 
AND OTHER PURPOSES 


Class Use 


The Comprehensive Publisher Photocopy Agreement 
between APSA and the Copyright Clearance Center 
(CCC) permits bookstores and copy centers to re- 
ceive expedited clearance to copy articles from the 
APSR and PS in compliance with the Association’s 
policies and applicable fees. The general fee for arti- 
cles is 75 cents per copy. However, current Associa- 
tion policy levies no fee for the first 10 copies of a 
printed artide, whether in course packs or on reserve. 
Smaller classes that rely heavily on articles (i.e., upper- 
level undergraduate and graduate classes) can take 
advantage of this provision, and faculty ordering 10 
or fewer course packs should bring it to the attention 
of course pack providers. APSA policy also permits 
free use of the electronic library reserve, with no limit 
on the number of students who can access the elec- 
tronic reserve. Both large and small classes that rely on 


Notes from the Editor 


these articles can take advantage of this provision. The 
CCC’s address, telephone, and fax are 222 Rosewood 
Drive, Danvers, MA 01923, (978) 750-8400 (voice), 
and (978) 750-4474 (fax). This agreement pertains only 
to the reproduction and distribution of APSA ma- 
terials as hard copies (e.g., photocopies, microfilm, 
and microfiche). 

The Association of American Publishers (AAP) 
has created a standardized form for college faculty 
to submit to a copy center or bookstore to request 
copyrighted material for course packs. The form is 
available through the CCC, which will handle copyright 
permissions. 

APSA also has a separate agreement pertaining to 
CCC’s Academic E-Reserve Service. This agreement 
allows electronic access for students and instructors 
of a designated class at a designated institution for a 
specified article or set of articles in electronic format. 
Access is by password for the duration of a class. 

Please contact your librarian, the CCC, or the APSA 
Reprints Department for further information. 


APSR Authors 


If you are the author of an APSR article, you may use 
your article in course packs or other printed materials 
without payment of royalty fees and you may post it at 
personal or institutional web sites as long as the APSA 
copyright notice is included. 


May 2004 


Other Uses of APSA-Copyrighted Materials 


For any further copyright issues, please contact the 
APSA Reprints Department. 


INDEXING 


Articles appearing in the APSR before June 1953 were 
indexed in The Reader's Guide to Periodical Literature. 
Current issues are indexed in ABC Pol Sci; America, 
History and Life 1954-; Book Review Index; Current 
Contents: Social and Behavioral Sciences; Econ- 
Lit; Energy Information Abstracts, Environmental 
Abstracts; Historical Abstracts; Index of Economic . 
Articles; Information Service Bulletin; International 
Index; International Political Science Abstracts; the 
Journal of Economic Literature, Periodical Abstracts; 
Public Affairs; Public Affairs Information Service 
International Recently Published Articles; Reference 
Sources; Social Sciences and Humanities Index; Social 
Sciences Index; Social Work Research and Abstracts; 
and Writings on American History. Some of these 
sources may be available in electronic form through 
local public or educational libraries. Microfilm of the 
APSR, beginning with Volume 1, and the index of the 
APSR through 1969 are available through University 
Microfilms Inc., 300 North Zeeb Road, Ann Arbor, 
MI 48106 (www.umi.com). The Cumulative Index to 
the American Political Science Review, Volumes 63 to 
89: 1969-95, is available through the APSA. 


American Political Science Review 


Vol. 98, No.2 May 2004 


Secessionism in Multicultural States: Does Sharing Power Prevent 


or Encourage It? 
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University of Pennsylvania 


nstitutional frameworks powerfully determine the goals, violence, and trajectories of identitarian 

movements—including secessionist movements. However, both small-N and large-N researchers 

disagree on the question of whether “power-sharing” arrangements, instead of repression, are more 
or less likely to mitigate threats of secessionist mobilizations by disaffected, regionally concentrated 
minority groups. The PS-I modeling platform was used to create a virtual country “Beita,” containing 
within it a disaffected, partially controlled, regionally concentrated minority. Drawing on constructivist 
identity theory to determine behaviors by individual agents in Beita, the most popular theoretical positions 
on this issue were tested. Data were drawn from batches of hundreds of Beita histories produced under 
rigorous experimental conditions. The results lend support to sophisticated interpretations of the effects of 
repression vs. responsive or representative types of power-sharing. Although in the short run repression 
works to suppress ethnopolitical mobilization, it does not effectively reduce the threat of secession. Power- 
sharing can be more effective, but it also tends to encourage larger minority identitarian movements. 


surprising that the maintenance of state boundaries 

would appear as a vital problem and that “state con- 
traction,” “secession,” or “partition” would be figured, 
by most scholars and politicians, as evidence of public 
policy failure or as desperately exercised options of last 
resort. To be sure, in the 1990s some scholars revived 
interest in territorial self-determination via partition 
of existing states as a sometimes useful policy option 
for individual states and for the international commu- 
nity. Against a background of severe political instability 
in the Balkans, central Asia, Africa, the Middle East, 
and the former Soviet Union, it has been suggested 
that some political conflicts might be managed best by 
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“rightsizing” states, i.e., adjusting their boundaries or 
creating new states (Hoppe 1998; O’Leary, Lustick, and 
Callaghy 2001). Others have emphasized the occasional 
necessity for forcible partitions and even population 
transfers to achieve a correspondence between eth- 
nopolitical or sectarian allegiances and the contours of 
states legitimized by them (Kaufmann 1998; Tullberg 
and Tullberg 1997). On balance, however, the bulk of 
the scholarly and public policy community continues 
to oppose such approaches on moral, legal, practical, 
and other grounds. Instead they direct much more at- 
tention to how conflict, especially violent conflict, can 
be managed while protecting existing state boundaries 
(Carley 1997; Horowitz 1985, 588-92; 1997, 435; Kumar 
1997; Sambanis 2000). 

Many factors, considered independently or in inter- 
action, have been prominent in recent studies of the eti- 
ology of secessionism. They include the implications of 
economic advantage/disadvantage, topography, world 
region, demographic patterns, globalization, cultural 
distinctiveness, inter-group antipathy, type of identities 
in conflict, political entrepreneurship, and outside in- 
tervention by irredentist or culturally related powers. 
In this paper, however, we focus specifically on one key 
thread in this sprawling conversation—the relation- 
ship between institutionalized empowerment of poten- 
tially secessionist groups and the appearance of seces- 
sionism. 

Indeed it can be argued that the single most popular 
line of argument offered by scholars to policy makers 
has been to suggest political and institutional arrange- 
ments to satisfy demands by whatever regional, reli- 
gious, ethnic, or other groups with secession potential 
appear to threaten the integrity of the existing terri- 
torial states. Under this rubric various techniques and 
approaches have been elaborated, including affirma- 
tive action, multicultural liberalism, federalism, auton- 
omy, cantonal arrangements, and power-sharing (e.g., 
Danspeckgruber 1996; Lapidoth 1996; Lijphart 1977, 
1985; Sambanis 2000). The general view here is that by 
responding positively and integratively, if only partially, 
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to the demands of disgruntled minorities, secessionism 
can be abated and secession prevented while preserving 
the predominance and stability of the central state. By 
making government more responsive to the concerns of 
disgruntled minorities, potentially secessionist groups 
will be encouraged to feel confident of representation 
and protection for their most vital concerns. Such in- 
stitutional responses by the central state are deemed 
capable, if designed and implemented properly, of re- 
ducing the intensity of separatist demands by those who 
otherwise might make them. In Hirschman terms, the 
impetus for exit is to be blunted by providing oppor- 
tunities for voice and reasons for loyalty (Hirschmann 
1970). 

However the opposite view is also strongly argued— 
that creating autonomous, federal, or otherwise 
devolved institutions of self-government or self- 
administration, especially if they allow regionally con- 
centrated groups to mobilize within them, is liable 
to contribute to secessionism by affording elites and 
groups the political resources they need to undertake 
mass mobilization and wage separatist struggles. This 
literature supports a widely acknowledged fear among 
state elites that granting regionally concentrated mi- 
Norities special forms of autonomy, devolved powers, 
or privileges within a system of asymmetric federalism 
would not so much assuage demands for representa- 
tion, control, or resources, as lead the country down a 
slippery slope to separatism (Cornell 2002; Hale 2000; 
Mozaffar and Scarritt 2000; Roeder 1991). 

A distinct but related line of argument was spurred in 
part by the collapse of the Soviet bloc, successful seces- 
sion (or partition) in Yugoslavia and Czechoslovakia, 
and eruption of secessionist conflicts in many parts of 
the former Soviet Union. Some researchers during the 
last decade have argued that democratization might 
not be able to be fine-tuned enough to prevent devo- 
lution and empowerment from producing more rather 
than less secessionism and attendant violence. From 
this position there is a tendency to minimize the im- 
portance of minority group dissatisfaction with cen- 
tral state policies as an explanation for secessionism. 
Instead, a “supply-side” interpretation is favored. On 
this account, illegal mobilization against the state for 
secessionist purposes does not increase as a function of 
demands for more responsiveness, democracy, or au- 
tonomy. Rather it arises when the human and material 
wherewithal to make such risky behavior profitable is 
available to political entrepreneurs and the small group 
of activists actually engaged in mobilizational or violent 
activities (Collier 2000; Fearon and Laitin 2003; Laitin 
2001). 

Another form of the position that dangerous instabil- 
ities arise from the abilities of peoples to rebel, rather 
than from the depredations of states, has been articu- 
lated by researchers who point to apparent correlations 
between democratization and the eruption of severe 
ethnic violence and secessionism. Jack Snyder offers a 
neo-Huntingtonian argument that stresses the impor- 
tance of establishing strong political institutions prior 
to democratization to prevent demands for participa- 
tion from disrupting the political stability of the state. 
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Whereas Donald Horowitz used peaceful interethnic 
accommodation in Malaysia and violent secessionism 
in Sri Lanka to support his thesis that properly designed 
democratic institutions prevent ethnic violence and se- 
cessionism, Snyder (2000), writing in the late 1990s, 
argues that Malaysia illustrates the “advantages of au- 
thoritarianism” while Sri Lanka shows the “dangers of 
democratization” (275, 280).! 

This line of analysis reflects a subtheme in the litera- 
ture on containing potentially secessionist and violent 
ethnic mobilizations via repression, control, or domi- 
nation. The idea that “repression works” is implicit in 
the argument that states can maintain their borders, 
prevent secessionism, and preserve the political posi- 
tion of ascendant groups by refraining from offers of 
institutional compromises or added resources to ac- 
commodate out-group demands (Collier 2000; Laitin 
2001). 

The current state of the scholarly debate regarding 
the general relationship between institutionalized em- 
powerment of potentially secessionist regional groups 
is confused. Hechter and Okamoto address this head- 
on in their 2001 metastudy. They observe that “there 
is little consensus about the kinds of political insti- 
tutions that are most likely to contain nationalism” 
(203). As an example, they describe three positions 
held by important groups of scholars regarding the ef- 
fect of federalism on nationalist mobilization: (1) fed- 
eralism reduces nationalism, (2) federalism increases 
nationalism, and (3) federalism itself does not deter- 
mine strength of nationalism (204). Similar to other 
of the more sophisticated approaches to this question, 
Hechter and Okamoto conclude that causal relation- 
ships between institutionalized empowerment of po- 
tentially secessionist minorities and the rise of seces- 
sionism are complex, nonlinear, and highly sensitive to 
context. Thus in his own work Hechter (2000) suggests a 
possible reconciliation of the two opposing arguments 
in the literature: “Whereas decentralization may pro- 
vide cultural minorities with greater resources to engage 
in collective action ... at the same time it may erode the 
demand for sovereignty.”? 

A similarly nuanced view of the curvilinear patterns 
associated with different combinations of pairs of im- 
portant variables is advanced by Atul Kohli. He ar- 
gues that if the central authority structure is strong 
but willing to be accommodating and responsive to 
demands by potentially secessionist minorities, the re- 
sult will be short-term increases in ethnopolitical mobi- 
lization but long-term decreases in likelihood that the 
state will face potent secessionist threats). Unaccom- 
modating strong states can expect continuing cycles of 
mobilization and repression. Weak but accommodating 


1 Snyder makes a similar argument about patterns of political vi- 
olence and secesmonism in India and the “perils of pluralism and 
power-sharing” in Rwanda and Burundi. See also Cornell (2002), 
McGarry and O'Leary (1994, 94), and Saideman (1998). 

2 For broader consideration of strategies of control and domination 
see Adam (1971), Fearon (1998), Fearon and Laitin (2000), Lustick 
(1979, 1980), McGarry and O'Leary (1993, 23-26), and Yittachel 


(2000). 
3 Emphasis m the original. 
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states face increased possibilities of peaceful breakup. 
Weak and unaccommodating states can be expected 
to experience turbulence or secession/collapse (Kohli 
1997).* More common, however, among scholars study- 
ing the relationship between autonomy, devolution, 
power-sharing, federation, affirmative action, elec- 
toral reform, democratization, etc., and ethnically 
charged secessionism is the admission, à la Hecther and 
Okamoto, that available findings are contradictory and 
inconclusive (Freeman 1999, 369-70; Hechter 2000, 9; 
Spencer 1998, 3). 


AN AGENT-BASED MODELING APPROACH 
TO SECESSIONISM: ISSUES OF VALIDITY 
AND THE ROLE OF CONSTRUCTIVIST 
THEORY 


We turn now to consideration of the methods used to 
produce these incomplete but intriguing results. Two 
methods are prominent in scholarly work on this sub- 
ject. One is to conduct process tracings of a significant 
case or cases, using synchronic or diachronic compar- 
isons. The other is to draw on one or more large data 
sets describing events, countries, or minority groups. 
Here the technique is to examine whether or not hy- 
potheses about general relationships between institu- 
tional practices and outcomes are consistent with the 
patterns discernible from consideration of the data in 
these arrays. 

There is much to be learned in these ways. Clearly 
there is no substitute for studying the phenomena them- 
selves, in the real world, through careful scrutiny of 
crucial cases, through artfully structured comparisons 
of small numbers of episodes, and through statistically 
sophisticated treatments of highly processed, standard- 
ized, but rigorously conceived data sets. But it is our 
view that with so many variables involved and so many 
interaction effects present, with the great difficulty of 
gathering data relevant to those variables deemed theo- 
retically most interesting, and with the extreme scarcity 
of episodes of secessionism and secessionist conflict, 
compared to the array of potential contexts for their 
emergence, a third technique—agent-based computer 
simulation—has a crucial role to play in advancing the 
study of questions about the conditions that shape the 
likelihood of serious internal threats to the integrity 
and stability of states. 

Among the most daunting challenges to scholars us- 
ing the small-N approach, drawn from one case or from 
structured focused comparisons of a small number of 
cases, is the stringent limit on the array of possible 
natural experiments that can be arranged. This limit 
arises from the rarity of full-blown secession and the 
infrequency of severe ethnic conflict relative to the 
number of interethnic encounters that might have but 
did not produce such conflict. Typically the investiga- 
tor must settle for some level of contamination of the 


4 For smularly nuanced views on the difficulty of making linear pre- 
dictions about the effect on secessionism of more inclusive electoral 
arrangements, see Crawford (1998), Reilly and Reynolds (1999), and 
Scarritt, McMillan, and Mozaffar (2001) 
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comparison based on factors that were either incon- 
veniently different or similar about the cases or about 
the episodes being compared within a single case study. 
Ideally the investigator would be able to select cleanly 
sorted comparisons, leaving all possibly relevant values 
identical except for differences on either the depen- 
dent or the independent variable. In the real world, 
investigators have to settle for best available, partial, 
confounded, and therefore clouded comparisons.’ 

Among the most daunting challenges facing schol- 
ars using large-N approaches is the inevitable gap that 
arises between the imputed meaning of the variables 
under investigation, as specified in theories under test, 
and the real measurements used to indicate the values 
taken on by those variables in specific cases. These gaps 
are in part produced by the need to use best available 
quantifiable surrogates for the complex sentiments, 
opinions, behavior patterns, and circumstances that are 
of theoretical interest. Such gaps are then widened by 
the need to treat data gathered in different ways in 
different countries as comparable across cases and by 
serious differences across cases and time periods in the 
reliability of the information gathered. Such problems 
are then further compounded by the challenge of es- 
tablishing and enforcing coding routines that avoid is- 
sues of selection bias, pass tests of intercoder reliability, 
and are updated regularly based on new or improved 
knowledge available about cases or historical episodes. 
Combined, these challenges pose serious risks that find- 
ings may be driven as much or more by artifactual as- 
pects of data collection and processing as by the under- 
lying patterns putatively reflected in the data collected. 
It is probably safe to assume that a substantial portion 
of the disagreements that arise among researchers us- 
ing large-N approaches springs from these problems 
(Chandra 2001, 10; King and Zeng 2001).° 

The kind of “bottom-up” simulations offered by 
agent-based modeling provides researchers a third way. 
Using computers to produce simulations of social and 
political phenomena based on widely distributed but 
interactive processes is not new. Important work has 
been done on mobilization or repression in political 
contexts (Bhavnani and Backer 2000; Epstein, Stein- 
bruner, and Parker 2002; Lustick and Miodownik 2002; 
Srbljinovic et al. 2003), identity diffusion (Axelrod 
1997; Hoffmann 2003; Lustick 2000; van der Veen 
2002), the emergence of ethnocentrism (Axelrod and 
Hammond 2003), and the endogenization of borders 
(Cederman 1997, 2002). Excellent metaanalyses have 
identified emerging trends, challenges, and opportuni- 
ties in this cluster of research programs (Cederman 
2001; Macy and Willer 2002).’ 


5 This us the case even as scholars pick and choose from an array 
of historical accounts that 1s considerably more differentiated than 
whatever it is that those accounts are secking to describe (Lustick 


1996) 

6 Important work is underway to improve the reliabihty of these 
See Doyle and Sambanis (2000), Laitin (2000), Laitin 

and Posner (2001), and Wilkinson (2000, 2001). 

7 An older tradition of computer simulation relies on differential 

equations rather than on interactions among multitudes of adaptive 
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If theoretical expectations are relatively clear, but 
data are hard to find that reliably match theoretical 
categories, if available natural experiments do not al- 
low crucial questions to be posed cleanly because of 
inconvenient confounds, and if key aspects of the phe- 
nomenon of interest are relatively rare, computer sim- 
ulation should be considered a logical complement to 
other techniques of analysis. Such simulation involves 
creating a virtual world in which the basic theoretical 
relationships among individuals or groups are imple- 
mented directly, obviating the need for surrogate mea- 
sures or indices of key variables. Large batches of “his- 
tories” or “futures” of these worlds can be produced 
by randomizing initial conditions or the pattern of per- 
turbations to which the world is subjected. Standard 
statistical tests can then be run on the distributions of 
outcomes produced under specified conditions. 

By controlling initial conditions and/or the valences 
and sequence of the streams of perturbations, the ef- 
fects of change in individual parameters of interest or 
the interaction between two or more specific variables 
of interest can be identified. Such theoretically signif- 
icant variables could include predominance of differ- 
ent identities across the population, geographical con- 
centration patterns, indices of difference or similarity 
across groups of agents, amount of variation in agent 
influence, etc, By randomizing perturbations and/or ini- 
tial conditions and collecting data on the trajectories 
produced by the “landscape” as it moves forward in 
time (with “agents” interacting and taking on or main- 
taining values depending on the algorithms with which 
they are endowed), researchers can systematically con- 
duct the thought experiments that they cannot conduct 
or observe in the real world and cannot perform in 
their heads because of the hundreds of thousands or 
even millions of calculations involved in every step. 

It is common and natural for questions to be raised 
about the “empirical validity” of computer simulation 
models. The important version of this question is not 
about whether or not the theory being tested with the 
model is true (a question that is obviously central in the 
evaluation of any model—whether computer deployed 
or not) but about whether the assumptions built into 
the virtual world are so radically different from con- 
ditions in the “real world” that inferences about the 
latter from the former cannot be justified. To under- 
stand why the problem of establishing the “empirical 
validity” of agent-based computer models is no more 
difficult, and possibly even less difficult, than in non- 
computer simulation models, it is useful to recall that 


agents. We have not found useful applications of this “systems dy- 
namics” approach to ethnopolitical mobilization or secesalonism 
We do know of a substantial amount of work m this area done by 
Russians scholars It is based largely on computer operationaliza- 
tions of Parsonian categories and variables. See, for example, Laptev 
(http:/Avww.univer omak.su/MEP/) For a discussion of this earlier 
tradition in relation to agent-based modeling approaches, see Macy 
and Willer (2002). 

8 By keeping the mitial conditions and pattern of perturbations con- 
stant, and changing the algorithms or microrules controlling agent 
behavior, problematic aspects of the theories of identity change, po- 
Ittical behavior, and elite recruitment these rules reflect can be tested 
as well 
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such questions pertain to the “generalizability” of the 
findings produced with the model. Can findings pro- 
duced in a virtual world, with a model whose terms are 
implemented systematically and exactly via computer, 
be generalized to cognate phenomena the real world? 

Of course all applications of scientific models to the 
teal world must include, or should include, discussion 
of limits on their generalizability. Such limits are associ- 
ated with any kind of research project. Hypotheses are 
always tested, not against the “real world,” but against 
a proxy for it that arises from the assumptions of the 
theoretical approach adopted and the rules used to ob- 
serve and code data. Accordingly, there is no intrinsic 
difference between the use by agent-based modelers 
of an explicitly constructed virtual world as that proxy 
and the use by researchers employing traditional small- 
N or large-N techniques of the implicitly virtual worlds 
that arise from their assumptions and coding rules as 
the backgrounds against which their hypotheses are 
tested (or “validated”). For each of these approaches 
limits to generalizability must be identified in terms of 
the versimilitude believed to exist between the surro- 
gate world created for the testing of hypotheses and 
the “real world.” Indeed, for each of these approaches, 
including agent-based simulation modeling, an impor- 
tant source of validation is productive communication 
across theoretical assumptions and methodological di- 
vides. If researchers using diverse methods, assump- 
tions, and coding rules produce findings that corre- 
spond with or usefully articulate with one another, 
that cumulative discussion itself constitutes validation. 
It represents a kind of triangulation of a stable “real 
world” to which each participating theoretical effort 
would appear to have access. 

If the validation question is fundamentally the same 
across approaches, each of which tests its hypotheses 
against a surrogate for the real world, what does dis- 
tinguish an agent-based modeling approach from oth- 
ers? It is the completeness and understandability of the 
virtual world it is using as its surrogate. Unlike other 
approaches, its machinery can be both complex and 
transparent, for everything in it is decidable, visible, and 
registered. Some approaches (such as rational choice) 
are transparent but radically simple in terms of the 
number of interacting parts and variables. Other ap- 
proaches using more complex models obscure the limits 
imposed by their substantive theories and conceal the 
vast number of arbitrary, uncontrolled, or unknown 
“parameter values” under a sweeping ceteris paribus 
assumption. 

In work with PS-I in general, and specifically in 
our studies of secessionism, we use specific substantive 
theories and “pretheories”—theories of constructivist 
identity, social identity, and collective identity mobi- 
lization. We find, just as virtually all social scientists do, 
that even the best of these theories and the most care- 
ful of research designs fail to produce an experimental 
world in which every element in any way related to the 
phenomenon under study is assumed as a constant or 
is incorporated in the research design as a controlled 
variable. An obvious example is the constructivist posi- 
tion that individuals and collectives have “repertoires” 
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of identities that are relatively easily presented or con- 
cealed. Exactly how much easier it is to present a con- 
cealed identity than to integrate a new identity formerly 
not within the repertoire is seldom if ever explored, let 
alone set out as a theoretical proposition or research 
finding. The advantage of agent-based modeling is not 
that it eliminates stipulated parameter values, but that, 
as in our work described below, every stipulation is 
noted. In contrast to other approaches, especially non- 
formal approaches, there are no unrevealed or unsee- 
able assumptions. An added value of computer sim- 
ulations of this sort is the opportunity for systematic 
sensitivity tests in the face of good (i.e., theoretically 
based) arguments that specific parameters might be 
particularly liable to produce artifactual results. 

The constructivist basis of political identity and 
identity change is now well established.? Studies of 
identity—ethnic, national, or otherwise—begin almost 
as commonly with a statement decrying the practice of 
comparing primordialist and constructivist approaches 
as they do with that comparison itself As Kanchan 
Chandra (2001) put it in her introduction to a sym- 
posium on the constructivist consensus prevailing in 
political science: 

The constructivist approach, developed across the disci- 

plines of anthropology, sociology, political science, history 

and literature, has discredited the primordialist approach 
by showing that ethnic groups are fluid and endogenous 
to a set of social, economic and political processes. Those 
who subscribe to the constructivist approach agree on two 
basic propositions: First, individuals have multiple, not sin- 
gle ethnic identities; and second, the identity with which 
they identify varies depending upon some specified causal 
variable, Changes in the value of these causal variables are 
likely to lead to changes in individual identifications. (7) 


From the now standard constructivist position, iden- 
tities at both the individual and the collective lev- 
els are ultimately fluid, chosen, instrumentalizable, re- 
sponsive to change in relevant incentive structures, 
and susceptible to manipulation by cultural or polit- 
ical entrepreneurs. Examples of more or less stan- 
dard positions within this approach are Aronoff (1998), 
Brass (1980), Laitin (1998), and Nagel (1994). Ex- 
tremes within this general perspective are anchored 
on one side by arguments of the sort made by David 
Brown (1988), Walker Connor (1998), A. D. Smith 
(1981, 1986), and some sociobiologists such as Gary R. 
Johnson (1997). These scholars emphasize the psycho- 
logical affinity of identity appeals based on kinship pat- 
terns or tropes to explain why communities imagined 
in common descent terms are so prevalent, stable, and 
politically potent (i.e., why the effects of opportunities 
for choice are so limited). At the other extreme are 
those scholars more impressed with the extraordinary 
variability of ethnic and other forms of cultural identifi- 
cation over time and in different circumstances. Indeed 


9 For early examples of the constructivist ch to ethnic and 
national identity see Barth (1959), Brass (1974), and Hechter (1975). 
For representative applications and expositions of the approach see 
Eley and Suny (1996), Kowert and Legro (1996), Nagel (1994), and 
Verdery (1991). 
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some sociologists, anthropologists, and cultural studies 
scholars operating in this mode shift their attention 
entirely from the individuals and groups themselves 
(and their putative “identities”) to the prevailing cate- 
gories of identification and the relationship of changing 
circumstances to “performances” of those categories. 
Rogers Brubaker (1996) is one of the most influential 
scholars taking this perspective. Some political econ- 
omy approaches to identity also tend toward extreme 
positions of the sort that implicitly imagine any iden- 
tity as presentable by any actor or group depending 
on incentives and the choices those incentives make 
attractive (Hardin 1995). 

To be sure, constructivism performs its fundamental 
theoretical role not by producing final answers but by 
framing questions. For example, how large are reper- 
toires of identities? How easily can they change? What 
is the effect on larger political processes of variation in 
the size of identity repertoires, the fluidity with which 
identities are traded, or the volatility and turbulence of 
the incentive structure affecting competing identities? 
These questions are implicit in constructivist identity 
theory, but until the development of agent-based mod- 
eling approaches they have been extremely difficult to 
investigate, or even pose properly (Lustick 2000). But 
the fact that there are good but unanswered questions 
implied by constructivism is evidence of its fruitfulness 
as a framework for research, not of its inability to pro- 
vide guidelines for designing mechanisms to animate a 
virtual world of competing political identities carried 
in the repertoires of multiple, but different, agents. 


BEITA—A VIRTUAL MULTIETHNIC STATE 


To exploit constructivist identity theory for the explo- 
ration of patterns of secession and secessionism a vit- 
tual state was created, named, for convenience, “Beita.” 
Beita was designed using the PS-I simulation platform 
to capture in composite form certain common features 
of multicultural or multiethnic states that might en- 
counter threats of secession (Dergachev 2003; Lustick 
2000, 2002; Lustick and Miodownik 2000, 2002). 

As displayed in Figure 1, Beita is a square with 66 
cells, or agents, per side (4356 cells). The external edge 
of Beita is comprised of a fixed, unbroken, and im- 
permeable array of 260 black-colored “border” cells. 
Within these borders are located 4,096 agents that com- 
prise the Beita polity.” Each square-shaped cell, or 
agent, in this array is endowed at time zero (¢ = 0) with 
a repertoire or portfolio of identities, one of which is 
“activated,” i.e., visible to the agents in its neighbor- 
hood (the eight agents bordering it on its four sides and 
its four corners). Different colors represent different 
identities. As Beita moves forward in time the rules gov- 
erning agent behavior permit the rotation and trading 
of identities as functions of changing advantages and 
disadvantages associated with individual identities and 
with local conditions. These identity substitutions are 


10 Agents can be magined as modeling individuals, families, villages, 
or any unit of political aggregation that may seem appropriate 
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FIGURE 1. Beita—A Virtual Multiethnic State 
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in turn reflected in patterns of color change and cluster- 
ing as some identities coalesce into control of particu- 
lar regions or lose their grip on those regions. Statis- 
tics describing changing characteristics of Beita are 
automatically collected for diachronic and synchronic 
analysis. 

The polity is divided into four quadrants, not by 
boundaries but by changes in patterns of overlap- 
ping and shared political identities. The upper left, or 
“northwest” (NW), quadrant is the core of the state. 
A national bureaucracy, comprised of a web of agents 
drawn from one of three “agent classes” of “influen- 
tials” (see Table Al, Appendix) and having double. 
triple, or quadruple the “influence” of “basic” agents, 
radiates out from this quadrant into the other quad- 
rants. In the “standard” version of Beita, used as a base- 
line for experimental purposes, these “bureaucrats” (or 
“officials”) all have the currently dominant, i.e., incum- 
bent, identity, identity “5,” in their repertoires and al- 
most all of them are activated on identity 5 at t = 0. 
Two other identities, identities 4 and 13, are “loyal op- 
position” national identities. All national bureaucrats 
have these identities in their repertoires. 

Top-echelon bureaucrats have an influence level of 
4, compared to the influence of a basic agent, whose 
influence level is 1. Top-echelon bureaucrats are few in 
number, relatively centrally located within the radiat- 
ing bureaucratic web, comprised (initially, at least, since 
all identities in Beita are tradable) of only the three 
national identities, and marked visually with a circle 
inside the normal agent square. Mid-echelon bureau- 
crats are marked with a spiral, have an influence level 
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of 3, and a slightly larger identity repertoire than top- 
echelon bureaucrats—reflecting the regionally preva- 
lent identities as well as national identities. Lower- 
echelon bureaucrats, those with influence level 2, 
have within their repertoires both regionally preva- 
lent and parochial identities in their region along with 
the three national identities. Thus, for example, lower- 
echelon national officials in the northeast (NE; upper 
right) quadrant have regionally prevalent identities 3 
and 15 in their repertoire, along with a less prominent 
“parochial” identity, identity 9, and all three national 
identities, 4, 5, and 13. For detailed information about 
the exact representation of different kinds of agents 
in Beita and their identity complexions, see Table A2 
(Appendix); for color coding see Figure 3. 

Beita was produced as a composite rendering of a 
multinational or multiethnic country, corresponding di- 
rectly to no one particular country but containing com- 
mon aspects of many. In the NE and southwest (SW) 
quadrants regionally prevalent identities have sepa- 
rate small authority structures. These can be identified 
as local arrays of officials activated on colors associ- 
ated with regionally prevalent identities—bureaucrats 
whose loyalties to the central state are reflected in the 
presence of national identities in their repertoires even 
as they begin Beita histories activated on their par- 
ticular ethnic identity. The intent in these quadrants 
was to model relationships of multinational democ- 
racy based on principles of federalism and/or multicul- 
tural liberalism. Thus national identities were included 
within the repertoires of both bureaucrats and ordi- 
nary inhabitants, regardless of their activation on more 
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particularist identities or the local prevalence of such 
identities within the repertoires of agents in that region. 
The radiating web of the national bureaucracy in each 
of these quadrants is substantial, though not as dense 
as in the state’s “national core” in the NW. The tolerant 
and accepting relationship between the national state 
and the ethnic groups in the NE and SW is also reflected 
in the presence of regionally prominent ethnic identi- 
ties in the repertoires of mid- and low-echelon national 
bureaucrats in those regions. 

But things are different in the southeast (SE). The 
SE is modeled as a region controlled by the state but 
inhabited by a disgruntled regional minority (DRM) 
whose identity (10) is present in the repertoires of 
79% of the agents in this quadrant.!! The fearful and 
alienated relations between the state and this group 
are reflected in the low activation rate of this identity 
and by the complete absence of identity 10 from the 
repertoires of the national bureaucrats stationed in SE. 
Beita also features a minority group in the SE region, 
associated with identity 16, that is historically distinct 
from and even antagonistic to the regionally dominant 
identity (10). As is quite typically the case, it is thus 
attractive as an ally against identity 10 by the national 
center.!? 

Such features in Beita were designed to make it a 
kind of “ideal type”—a country that actually does not 
exist but that, in the deliberate clustering of elements 
held to be crucial by relevant theories, bears stronger 
resemblances to countries relatively predisposed to se- 
cessionism linked to processes of identity-based self- 
determination movements. Indeed Beita is not pre- 
sented as a universal template for simulating the many 
kinds of economic, political, legal, international, and 
cultural pressures that may be involved in the entire 
range of ethnopolitical mobilization and secessionist 
activity. It should be regarded as a specialized tool, use- 
ful for exploring the extent to which some patterns of 
ethnopolitical mobilization and secessionism could be 
accounted for by focusing specifically on “identitarian” 
processes and pressures. 

In addition, particular aspects of Beita’s institutional 
design render it more like some countries than oth- 
ers. For example, Beita features a dominant but not 
unitary regime authority structure, emanating from a 
secure core. The institutions of the regime within the 
borders of Beita include a variety of relatively de- 
centralized structures with overlapping loyalties in ar- 
eas of identitarian diversity. Beita also includes one 
area of the country in which the regime appears as 


11 Tn the SE quadrant, Identity 10 is activated at t = 0 by 12% of the 
agents with that identity in their repertoire compared, for example, 
to an activation rate of 24% for identity 15 in the NE quadrant, where 
it is a regionally prominent identity. 

12 Reflecting the favor it enjoys from the regime, identity 16 has a 
very high activation rate (54% of agents in SE with identity 16 m 
their repertoire are activated on that identity at t = 0) It also has a 
high rate of overlap with the mcumbent national identity—identity 5. 
Indeed more than 80% of SE agents activated on 16 have all three na- 
tional identities present within their repertoires. In contrast, identity 
16 is present within the repertoires of only 8% of SE agents activated 
on 10 att = 0. 
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a rigid, unresponsive, and alien set of institutions in 
contrast to a regionally predominant identity group de- 
prived of any substantial authority structure of its own. 
Countries and their disgruntled regional minorities to 
which our findings may best apply include Iraq/Kurdish 
region; Turkey/Kurdish region; Spain/Basque coun- 
try; Canada/Quebec; Yugoslavia under Milosevic; 
Sri Lanka/Tamil Northeast; United Kingdom/Ireland 
(19th century); France/Algeria (late 19th to mid-20th 
centuries); Pakistan/East Pakistan; Pakistan/regions 
such as the Sindh or the Northwest Frontier 
Province; India/Kashmir, Assam, or the Punjab; 
France and Corsica; and Indonesia/Papua or Aceh. 
Needless to say, none of these countries corresponds 
exactly to Beita, and in some of these cases, clearly, vari- 
ables other than the political/institutional variables in- 
vestigated in this article are decisively at work. Never- 
theless, for different reasons these cases do contain the 
elements and dynamics of the kind of countries Beita 
is designed to simulate more than, for example, the 
United States/Native Americans, Iraq/southern Shia 
areas, France/Brittany or Provence, Morocco/Western 
Sahara, and Israel/Palestinian areas. 

As Beita moves forward in time, the patterns of iden- 
tity activation and hence the patterns of visible color 
across the landscape change. Individual cells change 
identity activation as a result of interactions with their 
neighbors (agents directly touching their sides or cor- 
ners). Each agent registers the activated identities and 
influence levels of its neighbors but not the composi- 
tion of their repertoires. Simple calculations of relative 
“identity weight” lead each agent to either remain ac- 
tivated on its currently activated identity; rotate into 
activation an alternative identity from its repertoire; 
substitute an identity from outside its repertoire for one 
inside its repertoire; or, in cases of a fairly overwhelm- 
ing discrepancy in favor of an identity not in its reper- 
toire, actually substitute and activate on an identity pre- 
viously absent from its repertoire. (See the Appendix 
for technical details.) Clusters of agents activated on the 
same identities form—a process usually accelerated in 
regions featuring webs of “influential” agents activated 
on a common identity or having that identity in their 
repertoires, The clusters can then expand to include 
agents who bring that identity into their repertoires 
after being surrounded or nearly surrounded by agents 
activated on that identity. For purposes of illustration, 
Figure 2 is provided, showing a typical run, or history, 
of Beita at time step 50 (t = 50). 

Just as in the real world the origin of signals that 
shape the identity-based behavior of people and groups 
is not wholly local, so too can agents in Beita (and in 
PS-I models in general) have access to some nonlocal 


13 One important variable excluded from the sets of experiments 
with Beita reported here is the direct influence of cross-border pres- 
sures from neighboring countries possibly sympathetic to secessionist 
tendencies withm Beita. The impermeable border surrounding Beita 
prevents such pressures, though in other work, here, we 
have explored the implications of increasing the porosity of the bor- 
der separating the southeastern quadrant from variously constituted 
neighboring states 
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FIGURE 2. Beita Baseline—Typical Run (Time = 50) 











SE 








information. At every point in time each identity is as- 
signed a “bias,” that is, a negative number, a positive 
number, or zero.'* According to what can be thought of 
as a “mass media” mechanism, each agent is aware of 
these signals and how they change. Each agent updates 
its activation by adding these signals to its calculations 
about local identity weights. 

Figure 3 displays all the identities present in the spec- 
trum of Beita (that is, present as a subscribed identity in 
at least one agent). Each identity is labeled by number 
and color. The figure also shows, for each identity, the 
bias assigned to it at time step 50. The assignment of bi- 
ases is random, though for experimental purposes that 
randomness can be adjusted to introduce more or less 
volatility in the way the world is changing, a greater or 
smaller range within which it can change, and more or 
less predictability in the way it will change.'> At every 
even-numbered time step each identity is eligible for 
the assignment of a new bias. In the Beita history from 
which these illustrations are taken, the volatility was 
set so that the probability of being eligible for a bias 
change at any one time step was .005% (with a bias 
range of between —2 and +2). 





'4 A bias of “+1,” for example, would count in the identity weight 
calculation of an agent as much as would one extra basic agent in its 
neighborhood activated on that identity. 

'S Predictability is a measure, assuming that a change of bias assign- 
ment takes place, of how likely it is that that change will entail a move 
of more than one integer step away from its original bias assignment. 
As with most experiments reported here, this illustrative history was 
set at a low predictability setting, such that large bias shifts were just 
as likely as small ones. 


STUDYING SECESSIONISM WITH BEITA 


Beita was designed as a state in which secession is 
not ruled out in any region but, in which under typi- 
cal conditions of governance and political relations, in 








FIGURE 3. Beita Identity Display 
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most regions, it does not occur.!® Secessionism among 
identity 10 agents in the southeast region is modeled in 
Beita as the emergence of boundaries within the state 
that separate a sizable region dominated by identity 10 
from the rest of the state—a region featuring substan- 
tial cultural homogeneity in terms of both activated and 
subscribed identities. Borders appear as some cells con- 
taining agents are transformed into small zones of sepa- 
ration that may be considered either as “immutable and 
inactive agents” or as “border cells” (black squares). K 
Such transformations can be considered “secession- 
ist” activity in the sense that the appearance of such 
zones of separation corresponds to the crystallization 
of identitarian and political differences associated with 
regionally based emergent processes of alienation and 
out-group mobilization. 

The results of various factors (such as institutional 
responsiveness, repression, and devolution) on the oc- 
currence of secessionism can then be investigated by 
adjusting those aspects of Beita and comparing the re- 
sults by analyzing large batches of runs under other- 
wise tightly controlled conditions. Apart from studying 
secessionism (how many border cells are produced in 
what regions), the emergence of a sufficient number of 
border cells combined with a sufficiently high level of 
activated homogeneity within the regions dominated 
by the secessionist group can be used as a measure of 
secession itself.18 

The rules governing the circumstances in which indi- 
vidual basic agents become border cells were designed 
to conform to basic and consensual understandings of 
secessionism as they appear in the scholarly literature. 
For although there is precious little agreement on the 
sufficient conditions for producing secessionism, i.e., 
the relative contribution of variables such as regional 
economic standing, degree of ethnic or identity differ- 
ence, irredentist influences, and institutional features, 
there is broad, if usually implicit, consensus on the nec- 
essary conditions for secessionism. These conditions 
are (1) polarization or alienation of the potentially se- 
cessionist identity group from the dominant identity 


16 Indeed in the thousands of histories we have generated, secession 
by groups other than identity 10, or ın regions other than the SE, 
where identity 10 1s concentrated, have almost never been observed. 
17 Technically the cells in the Beita array are “positions” that may 
or may not contain agents of different lands at any given time. Since 
almost every cell in Beita is always inhabited by an agent with par- 
ticular characteristics, we refer to them as “agents.” But since the 
transformation of a basic agent into a “border agent” is actually the 
transformation of an inhabited cell into a cell that is uninhabited 
but rather serves as a small “zone of separation,” we use the phrase 
“border cells” rather than “border agents” to describe the emergence 
of local manifestations of secessionism. 

18 By using the complexion of identity activation (via low tension 
levels within the newly bounded region) as a part of our index 
of secessionism, we do not reject the role of a reorganization or 
transformation of authority structures in the process of secession. 
Indeed, though we have not reported the data, secessions m Beita 
are frequently associated with the emergence of separate authority 
structures. That is, portions of the webs of influential agents previ- 
ously activated on regime identities are captured by the secessionist 
movement and transformed into webs of influentials activated on the 
secessionist identity and separated from the rest of Beita by walls of 
border cells 
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groups in the state; (2) regional concentration of the 
identity on a scale that is not negligible; and (3) some 
significant level of tension or disharmony in the rela- 
tions between individuals expressing the “alienated” 
identity and others in their social environment. Indeed, 
few if any analysts have produced theories of seces- 
sion that do not assume the satisfaction of these three 
conditions—alienation, nontrivial size, and individual- 
level tension with others. 

The conditions under which such behavior occurs 
thus pertain to both macro circumstances of which the 
individual agents are unaware and local realities of 
which they are aware. When for any particular agent 
in any particular time step each of these conditions is 
met, a low but nontrivial probability is created that that 
agent will be transformed into a border cell.!? The rules 
we have implemented to operationalize these condi- 
tions governing the production of border cells can be 
stated, nontechnically, as follows: 

1. Polarization/alienation: Secessionist activity can be 
expected to be unlikely or impossible to the extent 
that members of a potentially secessionist group also 
harbor the identity of the dominant group. Accord- 
ingly, no agent, at any particular time, can transform 
into a border cell if 20% or more of the agents acti- 
vated on that identity at that time have the dominant 
identity within their repertoires. 

2. Size of a qualifying identity: Secession of the lead- 
ing group in a society is excluded from these ex- 
periments insofar as the leading group is considered 
to be the identity activated at any particular time 
by a plurality of agents in the polity. Similarly, se- 
cessionism cannot be produced by a group unless it 
constitutes a substantial proportion of its region. In 
Beita no agent is allowed to transform into a border 
cell unless its activated identity is activated by at 
least 10% of Beita agents.” 

3. Individual action: Some otherwise qualifying agents 
are more likely than others to engage in secessionist 
activity, and those lacking very much contact with 


19 In the expermments reported in this paper the probability of an 
otherwise qualifying basic agent turning into a border cells was 20% 
per time step in which the basic agent remained qualified. This prob- 
ability can easily be adjusted for experimental purposes. Sensitivity 
tests iudicated that adjusting this probability value between 15% and 
25% did not alter the frequency with which border cells appeared. 
The effects on these adjustments on the number of border cells pro- 
duced were linear, relatively small, and in the expected directions 
2 Since the size of countries and the sizes of disaffected populations 
within those countries vary widely, the 10% rule for the minimum size 
of what we refer to as a “subordinate identity” cannot be considered 
absolute. Instead the rule used to calculate the minimum size of a 
regionally disgruntled minority capable of producing secessionism is 
40% of the ratio of the population of the region to the total population 
of the state. Thus, the SE quadrant of Beita represents 25% of the 
entire state. Forty percent of that is 10%, so the minimum size of an 
activated group in Beita capable of producing secessionrem is 10% of 
the size of Beita or 409 agents. Roeder (2003) presents an extensive 
discussion of the statistical issues involved in coding secession for 
comparisons across large and small states. Roeder’s findings, though 
arrived at with different techniques and for somewhat different pur- 
poses, are consistent with our coding rules. We are grateful to Paul 
Brass for posing the question that clarifled our understanding of this 
general point. 
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agents activated on identities other than their own 
can reasonably be expected to be less inclined to take 
the risks of secessionist action than liminal agents, 
exposed to other identities but not harboring those 
identities within their repertoires. Accordingly, no 
agent can transform into a border cell unless half 
or more of the agents it is in direct contact with are 
activated on an identity other than its own activated 
identity. 


With these rules implemented for the operation of 
Beita, we proceeded to investigate the relationship 
among variables of interest with respect to secession- 
ism and institutional schemes of repression, responsive- 
ness, and representation. This was done by comparing 
histories produced under slightly different conditions 
or with slight changes introduced into the polity to op- 
erationalize such institutional strategies, 


SIMULATION EXPERIMENTS 


Several groups of simulation experiments designed to 
explore relationships among institutions, ethnopolit- 
ical mobilization, secessionist activity, and secession 
were conducted. Each series of experiments focused 
on changing aspects of the authority structure in the 
SE region. The first series studied the effects of repres- 
sion, modeled as increasing the range and density of 
the distribution of regime officials in SE. The second 
focused on the effects of increasing the responsiveness 
of the regime officials to the demands of the potentially 
secessionist identity. The third group of experiments 
was aimed at investigating the effect of increasing the 
representativeness of the authority structure, whether 
by increasing the proportion of existing regime officials 
expressing the potentially secessionist identity or by 
expanding the authority structure by adding networks 
of regime officials expressing that identity. 

Each series of experiments entailed creation of dis- 
tinctive versions of Beita—each captured in a different 
“snapshot.” Each snapshot featured a carefully con- 
trolled group of parameter settings and/or changes 
in the complexion of agents in SE. Each version of 
Beita was used to produce 100 separate and unique 
histories—histories run through 508 time steps to time 
508 (t = 508). The uniqueness of each specific history 
was ensured by the application of a distinctive stream 


2 As a manipulation test for our baselme template we ran several 
experiments entailmg the removal of key operationalizations (van- 
able bias values, variable agent influence levels, the impermeable 
boundary surrounding Beita, and rules for the production of border 
cells). We then added these elements and observed the results in 
batches of 100 runs each. The results conformed to our expectations 
of the role each of these operationalmations, both individually and 
in combination, plays in producing the overall effect of Beita as a 
simulated country disposed toward, but not necessarily destined for, 
separatism. The results of these manipulations are not reported here 
but are available from the authors. 

2 The length of each history 1s considered to be 500 tme steps Our 
experience with Beita strongly suggests that only rarely would ex- 
tending the examination of dynamic processes beyond this pomt yield 
statistically different results. The reason observations are conducted 
att = 508, and nott = 500, is that each run includes eight initial time 
steps during which the likelihood of change of a bias value associated 
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of randomly generated “biases” affecting the sequence 
and extent of changes in the relative attractiveness of 
activating on different identities Our analysis com- 
pared the distributions of histories produced by incre- 
mental changes in these variables to the distribution 
of histories produced by the standard Beita landscape. 
Table 1 presents a summary of all of the experimental 
se 

In the first series of experiments, the effect of re- 
pression was studied by extending the size of the state 
authority structure in the SE region by an additional 
25%, 50%, 75%, or 100% of its original size. The ad- 
ditional regime bureaucrats were created by adjusting 
the identity complexions of agents in the SE region 
with existing identity complexions that excluded iden- 
tity 10, activating them on the identity of the regime 
officials in SE and the dominant incumbent identity at 
t = 0 in Beita (5), and endowing them with an influ- 
ence level of 2. The combination of a larger number 
of influentials, arrayed in relatively close proximity to 
one another, activated on the originally activated dom- 
inant regime identity (5), and not including within their 
repertoires the DRM (Disgruntled Regional Minority) 
identity (10), implements “repression” in that it makes 
it much more difficult for agents in the region of this 
web of influentials to activate on identities other than 
the identity activated by the regime bureaucracy. In 
other words, in a region with a repressive bureaucracy 
more agents will be constrained to activate on an iden- 
tity that is not assigned a bias as high as that assigned 
to some other identity in those agents’ repertoires. Re- 
pression thus interferes with the ability of agents to 
do the best they can for themselves with the political 
resources under their control. 

These repression manipulations produced four dis- 
tinct versions of Beita at t = 0, i.e., four different snap- 
shots. Running each of these snapshots, each with a 
larger regime bureaucracy in SE, produced 400 histo- 
ries (with one set of 100 observations att = 508 for each 
of the four renderings of the regime officials in SE). 

In the second series of experiments we studied the 
effects of increasing responsiveness by adding identity 
10 to the repertoires of 25%, 50%, 75%, and 100% 


with any grven identity is very high (50%, as opposed to 0.005% dur- 
ing the history rtself). This has the effect of “scrambling” the biases 
and allowing the history to begin “m media res,” rather than from 
an artificially “calm” position. Other settings that remained constant 
through all manipulations as well as m the standard or baseline Beita 
included a bias range of —2, +2, a moderated jump factor of 10,000 
(meaning that the likelihood of a bias changmg from one value to 
another was equal, regardless of the size of the increment entailed in 
that change), and a sight radius for all agents of 1, meaning that all 
agents could see the activated identities of the eight agents adjacent 
to them and no others. 


B Grven exactly the same parameter settings, exactly the same array 
of agents operating by the same rules, grven the same complexion 
of identities in agent repertoires, and assuming that all agents are 
activated on the same identities, applying any particular stream of 
bias values will always produce exactly the same (unique) history. Ex- 
peruments reported here are based on running “treated” landscapes 
100 times using the same set of 100 randomly generated streams of 
bias values and comparing the distribution of results to the histo- 
ries generated from the standard Beita landscape when those same 
streams of biases were applied. 
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TABLE 1. mental Manipulations 


Condition Manipulation 


Baseline 
Repression 


None 


Extending the presence of 
low- and mkt+-echelon RO 
In the SE by a share of thelr 
orginal size (25%—100%) 


Increasing affillation with DRM 
among Increasing proportions 
(25%—100%) of low-echeton 


Responsiveness 


RO In the SE 


Representativeness 
Power-sharing Increasing affillation with DRM 
among Increasing proportions 
(25%—100%) of low-echelon 
RO In the SE 


Semlautonomous Increasing the presence of 
Instttutions low-echelon DRM In the SE by 
a share of the regime's SE 
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No change compared 4 


Same as power-sharing 


Number of Simulations 
I Complexion Snapshots 
1 100 
100 
to baseline 


Affected agents remaln active 


on the regime's Identity (ID 5) 
and add the DRM (ID 10) 


to their repertoire 


Affected agents activate DRM 


(ID 10) and keep the regime's 

Identity (ID 5) In their 

repertoire 

100 


authortty structure (25%—100%) 
Note. RO, regime official; DRM, disgruntled regional minority; SE, southeast region. x 


of the original set of regime officials in SE. These re- 
sponsiveness operationalizations entailed the use of 
four separate snapshots at t = 0 and the generation 
of 400 histories, yielding four sets of 100 observations 
each. This was an attempt to operationalize a kind of 
regime bureaucracy less repressive in its effects and 
more oriented toward the provision of public goods to 
the inhabitants of SE. Thus, even though activated on 
an identity alien to that of the majority of agents in the 
region, the influentials in SE comprising the regime au- 
thority structure were nevertheless disposed to enable 
that population to realize its potential for activation on 
its own identity under favorable conditions, ie., when 
the bias for identity 10 might be relatively high.“ 

The third series of experiments included simulations 
of (a) power-sharing and (b) the granting of differ- 
ent degrees of autonomy to the potentially secessionist 
identity. Power-sharing was operationalized by increas- 
ing the proportion (25%—100%) of SE regime officials 
activated on the identity of the disgruntled regional 
minority (10). The effect of semiautonomous institu- 
tions was studied by extending the size of the authority 
structure. In this set of experiments the existing au- 
thority structure in SE was unchanged, but increasingly 
substantial webs of officials were added to it. These 
added arrays were comprised of agents with identity 
10 not only in their repertoires, but activated. Each of 
these new officials was also endowed, in its repertoire, 
with identity 5, the identity of the regime officials in SE 
and the dominant incumbent identity at t = 0 in Beita. 
The four levels of manipulation (25%, 50%, 75%, and 
100%) and two types of representativeness produced 
800 histories (eight sets of 100 histories each). 

Three criterion measures were developed to study 
the questions of interest. First, the number of agents ac- 


M We are grateful to one anonymous reviewer for this journal whose 
comments on our operationalizations helped us clarify their intent. 





tivated on identity 10 at t = 508 was used as a measure 
of ethnopolitical mobilization—the extent to which the 
potential for aggregated public expressions of a particu- 
lar collective identity was realized. Second, the amount 
of secessionist activity—the extent to which the neces- 
sary conditions for secessionism were met—measured 
as the number of border cells at t = 508, was used. 
Third, we identified those specific histories that met two 
criteria for outright secession. Histories at t = 508 were 
coded as secession if they included a substantial and 
clearly demarcated zone (or zones) within SE that was 
both (a) heavily dominated by agents activated on iden- 
tity 10 and (b) separated from the central state by coher- 
ent though not necessarily fully closed boundaries of 
border cells. Figure 4 displays an example of secession. 
Figure 5 displays an example from the same group of 
Beita histories that features some secessionist activity 
but is not coded as an instance of secession.” Within the 
sample of 100 baseline histories the three criterion mea- 
sures were significantly intercorrelated (all p < .01). 
The correlation between ethnopolitical mobilization 
and secessionist activity was r(100) = .68, the correla- 
tion between ethnopolitical mobilization and secession 
was r(100) = .43, and the correlation between seces- 
sionist activity and secession was r(100) = .78. 

In the following sections we compare the results from 
specific manipulations to the results obtained from 100 
histories produced with Beita “standard” (our baseline 


D Specifically, a history was treated as secession if two conditions 
were met at f = 508: (1) there were at least 84 border cells in the SE 

t, and (2) the average tension of agents activated on iden- 
tity 10 in the SE quadrant was lees than .80 (which means that, on 
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TABLE 2. Experimental Results 








Dependent Measure 
Amount of Ethnopolitical 
Manipulati Mobllizaton: Amount of Secessionist Frequency of Secession: 
___s manipulation Ss Agents Activated on ID 10 Activity: Border Cells Secessions per 100 
Condition Size at t = 508 (Mean) at t = 508 (Mean) Histories at t = 508 
Basellne 398.71 34.95 18 
Repression 25% 359.67 35.29 15 
50% 324.95 34.44 14 
75% 302.93 30.04 8 
100% 279.26 27.07 1 
Responsiveness 25% 404.33 35.78 17 
50% 416.31 33.71 14 
75% 422.49 33.36 14 
100% 430.18 31.18 13 
Representativeness 
Power-sharing 25% 464.84 37.76 15 
50% 549.76 37.60 9 
75% 693.81 7.24 0 
100% 736.35 0.97 0 
Semlautonomous 25% 488.87 26.10 6 
Institutions 50% 521.75 16.27 1 
75% 567.10 6.04 0 
100% 583.75 1.42 0 


Note: Numbers In boldface are statistically different from the baseline (p < .005). All values are in reference to SE only. 





condition). Table 2 presents our findings for all manip- 
ulations and for the standard baseline condition. Two 
types of tests were used. For the two continuous crite- 
rion measures, ethnopolitical mobilization and seces- 
sionist activity, statistical comparisons were made using 
paired sample t-tests (e.g., comparing the baseline his- 
tories with each set of experimental histories). For the 
dichotomous criterion measure, secession frequency, 
comparisons were made using McNemar’s chi-square 
test for nonparametric distributions. 


Repression 


We first examined repression—the effect of increas- 
ing the size of the regime bureaucracy in SE—and its 
effect on ethnopolitical mobilization, secessionist ac- 
tivity, and secession by identity 10. The findings for 
all of the repression manipulations are summarized 
in the first set of four rows in Table 2. In the column 
labeled “Amount of Ethnopolitical Mobilization” are 
the mean number of agents activated on identity 10 in 
SE at t = 508 for separate sets of 100 Beita histories 
using four different snapshots (with regime bureau- 
cracies in SE that are, respectively, 25%, 50%, 75%, 
and 100% larger than the regime bureaucracy in SE 
within standard Beita). We see that increasing the size 
of the regime bureaucracy progressively and effectively 
teduced public expressions of attachment to the poten- 
tially secessionist minority community as measured by 
activation on identity 10. 

However, the “success” of repression was not so 
clearly demonstrated with regard to our measures of 
secessionist activity and outrightsecession. Examining 


the column in Table 2 labeled “Amount of Secession- 
ist Activity,” we see that increasing repression did not 
significantly reduce the amount of secessionist activ- 
ity within SE compared to the baseline condition, al- 
though the trend was in this direction. As indicated in 
the column labeled “Frequency of Secession,” when the 
threat of secessionism to the central state is measured 
in terms of the frequency of actual secession, sufficient 
repression did reduce its occurrence. This reduction ap- 
proached statistical significance at the 75% level and 
was highly significant when the size of the original bu- 
reaucracy was doubled. 


Responsiveness 


This set of experiments with Beita was designed to in- 
vestigate the impact of institutional changes very dif- 
ferent from the repressive changes described above. 
Specifically, how would changes in the bureaucracy to 
make it more responsive to the potentially secession- 
ist identity affect patterns of mobilization, secession- 
ist activity, and secession in SE? As discussed in the 
introduction, various specific schemes of institutional 
reform are quite commonly recommended to reduce 
the threat of confrontation and secessionist struggles 
in the kind of situations Beita simulates. 

The results of the responsiveness manipulations are 
summarized in the second set of four rows in Table 2. 
With respect to changes in the amount of ethnopo- 
litical mobilization, our findings indicate that increas- 
ing responsiveness modestly but significantly increased 
the prevalence of agents activated on identity 10 in 
SE. The average number of Beita inhabitants publicly 
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expressing identification with identity 10 increased by 
roughly 8% from the baseline condition, when 100% 
of the entire authority structure in SE was responsive. 
While these results may disappoint those who would 
hope to reduce tendencies toward mobilization of com- 
peting ethnic identities by strategies of bureaucratic 
cooptation, they support those who contend that of- 
fering positions of power in the state’s authority struc- 
ture to regionally predominant minority groups is likely 
to result in more robust mobilizations. On the other 
hand, results from our experiments seem contrary to 
the worst fears of those who believe that any conces- 
sions to a potentially secessionist group will lead the 
state down a slippery slope toward secession. Our re- 
sponsiveness manipulations produced only a modest 
effect on the average amount of secessionist activity 
even when the entire network of regime officials was 
made more responsive (see Table 2), and the direc- 
tion of this change was toward diminished rather than 
heightened secessionism. At the same time, our findings 
do not provide support for those who believe that sim- 
ply reforming the governing bureaucracy to be more 
attentive to the concerns of a disaffected minority can 
substantially reduce the likely incidence of secession- 
ist activity. The results obtained for our third criterion 
measure, the frequency of outright secession, paralleled 
those reported for secessionist activity: a statistically 
nonsignificant decline with increasing responsiveness 
compared to the baseline condition. 


gd Hh pee pt Power-Sharing and 
Bullding Semlautonomous Institutions 


Most proposals for mitigating tendencies toward se- 
cessionism and confrontations with regional minori- 
ties seeking national self-determination entail more 
than reforming or reorienting the existing authority 
structure, whether through democratization or affirma- 
tive action programs. Almost all schemes for devolving 
power to potentially secessionist groups require an in- 
crease in the number of power-holders publicly repre- 
senting the identity of the disaffected group. This can 
be achieved in either of two ways: (a) through power- 
sharing by recruiting representatives of the minority 
group for positions within the existing authority struc- 
ture or (b) by building semiautonomous institutions 
designed to provide representation, expression, access, 
resources, and symbolic satisfaction to members of the 
out-group. The claim is that by increasing the represen- 
tativeness and/or scope of institutions exercising power 
over the lives of members of the potentially secessionist 
group, challenges to the regime can be avoided. Eth- 
nopolitical mobilization that otherwise might lead to 
confrontation with the regime would instead be chan- 
neled within boundaries constrained by loyalty to the 
central state and acceptance of the integration of the 
region within its jurisdiction. 

The bottom half of Table 2 presents summary data 
from the experiments designed to explore represen- 
tativeness using the operationalizations described ear- 
lier for the simulation of power-sharing and semiau- 
tonomous institutions. The “sharing” of power and the 
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“semi” autonomous aspect of these institutions is ex- 
pressed in the fact that all officials, whether or not ac- 
tivated on identity 10, have within their repertoires at 
the beginning of these histories identity 5—the dom- 
inant governing identity most prominently associated 
with the central state. The picture here is of a small au- 
thority structure, formerly domineering and rigid, that 
becomes not only sensitive to, but representative of and 
controlled by, agents identifying with the disaffected 
group. Examining first the impact on the amount of 
ethnopolitical mobilization, we found that both power- 
sharing and the building of semiautonomous institu- 
tions produced significant increases in mobilization of 
the DRM (10)}—even with only relatively small in- 
creases in the representativeness of the existing regime 
officials (i.e., the 25% and 50% conditions). In the 75% 
and 100% conditions, the resulting levels of public iden- 
tification with identity 10 were especially high. Overall, 
as the degree of representativeness increased, the im- 
pact of power-sharing became more and more potent 
in comparison to the parallel effects of building semi- 
autonomous institutions. 

Despite these increases in ethnopolitical mobiliza- 
tion, we find striking evidence of the constraining ef- 
fect of power-sharing or building semiautonomous in- 
stitutions on our second criterion measure, secessionist 
activity. When agents expressing DRM (10) were giyen 
real opportunities to exercise public power, secession- 
ist activity was significantly reduced even in the midst 
of robust increases in size of the population actively 
associating themselves with that identity. Interestingly, 
under conditions of “tokenism,” when only 50% or 
fewer of existing officials display identity 10, power- 
sharing did not diminish secessionist activity. However, 
a significant decline in secessionist activity did appear 
under power-sharing arrangements when a clear major- 
ity of the regime officials in SE began Beita histories by 
publicly expressing their attachment to identity 10. In 
contrast, the building of semi autonomous institutions 
effectively reduced secessionist activity even with only 
limited devolution of poilitical power. 

The data describing the frequency of secession itself 
mostly recapitulate the results reported regarding the 
reduction of secessionist activity associated with power- 
sharing and building semiautonomous institutions. It is 
worth emphasizing the support that these findings seem 
to offer to the argument that opening up very substan- 
tial opportunities for participation in local governance, 
while tending to increase rates of ethnopolitical mo- 
bilization, can greatly reduce secessionist activity and 
make secession itself extremely unlikely. 


DISCUSSION AND CONCLUSION 


Our overall results help explain why scholarship in this 
area has produced many seemingly conflicting claims. 
Even when, as in the experiments reported here, key 
variables (such as international involvement, relative 
economic position of the potentially secessionist re- 
gion, and distinctiveness of group identity) are held 
constant, the data often suggest nonlinear patterns. On 
the other hand, our findings also encourage support for 
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the analyses of researchers such as Hechter and Kohli. 
These scholars discern trade-offs that may be expected 
in choosing between repressive policies and policies 
that emphasize responsiveness and representation. The 
former are likely to produce short-term quiescence and 
longer term threats of secessionism and secession, while 
the latter are likely to lead to broader and noisier mobi- 
lizations but substantially lower threats of secessionism 
and, particularly, of secession. In this regard, it is worth 
reviewing the results of these experiments in terms of 
the three outcome variables of interest: extent of eth- 
nopolitical mobilization, amount of secessionism, and 
frequency of secession. 


Ethnopolltical Mobilization 


Analysts who have argued or implied that repression, 
domination, or control can work (Adam 1971; Fearon 
and Laitin 2003; Lustick 1980) receive strong support 
insofar as limiting the amount of ethnopolitical mo- 
bilization is concerned. Indeed, rates of mobilization 
decrease steadily as the size of the control apparatus 
increases. Relatedly, the wider mobilizations produced 
by increasing either the representativeness of the ex- 
isting regime bureaucracy or the size of a representa- 
tive gemiautonomous bureaucratic apparatus support 
at least parts of the arguments of scholars who warn of 
the energizing effects of democratization for patterns 
of nationalist or ethnopolitical mobilization (Snyder 
2000). 

To be sure, our model does not distinguish between 
violent and nonviolent forms of mobilization. If one 
imagines this mobilization as violent, then regardless 
of the longer-term consequences, regimes may find it 
advisable to engage in repressive or control policies, 
avoiding responsiveness or representativeness in the 
governance of a regionally concentrated, potentially 
secessionist minority, if only to avoid violent confronta- 
tions. However, if one imagines that secessionist strug- 
gles, per se, are more likely to be violent than others, 
then to judge the attractiveness or necessity of such 
policies a regime would also have to consider their ef- 
fect on the impetus given among those mobilized to 
engage in secessionist activity. 


Secesslonist Activity 


Repression, i.e., strengthening a rigid and unrespon- 
sive regime bureaucracy, may decrease mobilization by 
the latently secessionist identity, but it does not sig- 
nificantly decrease (nor increase) secessionist activity 
within that community. There is no strong rt in 
our experiments, in other words, for a direct relation- 
ship between increased repression and the robustness 
of secessionist movements. Similarly, increasing the re- 
sponsiveness of the authority structure did not substan- 
tially alter secessionist activity. 

In contrast, increasing representativeness encour- 
aged public participation by the potentially secession- 
ist, regionally concentrated minority and did in fact 
decrease the secessionist activity. Particularly strong 
effects in this direction followed the creation of semiau- 
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tonomous governing structures. Low levels of power- 
sharing, that is, small or moderate increases in the rep- 
resentativeness of the existing bureaucracy, have no 
significant effect on the amount of secessionist activ- 
ity. But with the regime bureaucracy in the less well- 
integrated region fully or nearly fully staffed by rep- 
resentatives of the latently secessionist minority, we 
registered significant decreases in secessionist activ- 
ity. Of particular interest, when the regime maintained 
its existing bureaucratic array and then added even a 
small-sized bureaucracy staffed by representatives of 
the regionally dominant identity, secessionist activity 
by this group decreased quite sharply. 

These findings lend support to the body of scholar- 
ship referred to above that has argued in favor of self- 
administration, limited autonomy, or other schemes 
to create settings for the exercise of public power by 
representatives of potentially secessionist groups as a 
means of preventing the development of secessionist 
struggles. Ready to accept higher levels of ethnopoliti- 
cal mobilization, these authors imagine that with semi- 
autonomous institutions to channel that mobilization, 
secessionist struggles can be averted. The mechanisms 
that are understood to produce this combination of ef- 
fects for these authors vary and are not always clearly 
specified. An advantage of the agent-based simulation 
approach is that we can understand how this seemingly 
odd combination of effects arises from the simple as- 
sumptions made about the conditions widely believed 
to contribute to secessionism. 

As the proportion of the population that publicly 
identifies with identity 10 in SE increases, larger num- 
bers of those who come to activate on that identity also 
have within their repertoires the identities they were 
previously activated on, including the original govern- 
ing identity 5 or other identities that may become dom- 
inant in Beita. As these agents are integrated into the 
ethnopolitically mobilized group, a larger proportion of 
the entire group comes to contain within their individ- 
ual repertoires the dominant identity in Beita—usually 
identity 5. This reduces the likelihood of secessionist 
activity. Moreover, as identity 10 becomes the prevail- 
ing activated identity in SE, the number of agents pub- 
licly expressing this identity, and who find themselves 
in neighborhoods inhabited predominantly by other 
agents also activated on identity 10, increases. With 
lower tension levels secessionist activity is also likely to 
decrease. Representative institutions, even if not fully 
autonomous, thus seem to inhibit secessionism. They 
do so by reducing the overall amount of alienation or 
polarization between regime-dominant and regionally 
dominant identity groups and by reducing the likeli- 
hood that individuals identifying with the regionally 
dominant group will be in tension with the majority of 
those with whom they have direct contact. 


Secession 
Increasing the size of a repressive bureaucracy did sig- 
nificantly reduce the frequency of secession itself, but 


only when the regime was willing to invest the re- 
sources necessary to double (or nearly so) the size of 
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the original bureaucracy in the SE region. Although in- 

ing the responsiveness of the existing bureaucracy 
did not decrease the frequency of secession, significant 
decreases were achieved by increasing power-sharing 
or by erecting semiautonomous institutions that com- 
plemented the existing regime identity-dominated bu- 
reaucracy. 

There are two reasons why—from the point of view 
of the regime—sharing power or building semiau- 
tonomous institutions for the regionally concentrated 
and disaffected minority group may be a more at- 
tractive approach than repression. First, significant de- 
creases in secession frequency were achieved with rel- 
atively smaller-scale increases in representativeness 
compared to the scale of increases in the repression nec- 
essary to produce comparable decreases in secession 
frequency (See Table 2). Second, it may reasonably be 
assumed that under conditions of both power-sharing 
and creation of a subsidiary bureaucracy fully staffed 
by agents activated on identity 10, revenues from taxa- 
tion would be more dependable and more robust than 
under the repression condition. On the other side of 
the political ledger, of course, may be the regime’s 
need to accept, in the absence of greatly increased re- 
pression, higher levels of ethnopolitical mobilization 
by the potentially disaffected minority—increases that 
are indeed associated with maintaining a more rep- 
resentative bureaucracy. It is striking, however, that 
even a relatively small investment in the development 
of minority self-governance through semiautonomous 
institutions reduced the probability that even the larger 
mobilizations associated with this condition resulted in 
a break from the state. 

Looking across all of our conditions and dependent 
measures it appears that the broadest and most com- 
pelling finding emerging from our data is that explana- 
tions for variation in amounts of ethnopolitical mobi- 
lization, even by members of communities that seem 
primed for secessionism, cannot be expected to cor- 
respond to explanations for patterns in the variation 
of amounts of secessionism or outright secession. This 
result not only helps explain the difficulty scholars have 
had making sense of cumulative findings, but also jus- 
tifies the search for techniques of investigation permit- 
ting more precision and control in the examination of 
the effects of specific variables on patterns of outcomes. 
In part it was the search for such techniques, by schol- 
ars who emphasize process-tracing in small-N studies 
and by those who search for their insights through the 
construction and refinement of large databases, that 
inspired our effort to deploy agent-based computer 
simulation modeling to this problem. 

Brief consideration of several real-world cases in 
terms of our findings illustrates what may be our most 
important conclusion—that noncoercive policies avail- 
able for the reduction of secessionism and secession 
may work only at the “cost” of a state accepting a sig- 
nificantly larger role in the public political arena for 
political expressions of historically “out-group” identi- 
ties. For what Rogers Brubaker has called “nationaliz- 
ing states,” featuring regimes dedicated to the exclusive 
expression of a particular identity and to serving the in- 
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terests of those attached to it, this cost may be perceived 
as too high (Brubaker 1996). Such states will be rather 
likely to face a choice between extended and extensive 
regimes of repression or the emergence of robust and 
possibly successful secessionist movements. 

Canada’s extraordinarily generous response to the 
potent threat of Quebecois separatism included chang- 
ing the country’s flag and transforming its political cul- 
ture from an “Anglo” state to a truly binational state. 
The extent to which French-speaking and Quebecois- 
identifying inhabitants of Quebec were granted the 
kind of semiautononous and representative institutions 
whose effects we have sought to investigate is reflected 
in the palpable discomfort many English speakers have 
felt in Montreal and other cities and towns in Quebec 
and the significant out-migration of these Canadians 
from the region. On the other hand, another effect of 
this real policy change by the Canadian regime has 
been to create conditions in which Quebecois sepa- 
ratists have repeatedly failed to gain majority support 
for referenda on secession itself. 

In July 2003 the French government sponsored a 
referendum in Corsica on a new statute that would 
significantly expand the power and prestige of semiau- 
tonomous political institutions in Corsica. This measure 
was explicitly justified as a means to combat Corsican 
separatism, which had produced chronic violence and 
protest against French rule of the island. Adding drama 
to the referendum was a French police announce- 
ment days before of the arrest of Ivan Colonna, a 
Corsican separatist leader charged with assassinating 
a French official. The referendum failed by a narrow 
margin. Though the overwhelming majority of non- 
Corsican French on the island opposed it, the statute 
had been supported by many mainstream Corsican na- 
tionalists and by the French government. Observers 
subsequently suggested that the timing of the arrest of 
the separatist leader, reflecting a successful exercise in 
“repression,” had shifted what would have been many 
“Yes” votes among Corsican nationalists into the “No” 
column. 

This episode would appear to illustrate several of 
our findings with Beita. First, the French government 
does appear to have concluded that strengthening rep- 
resentative and semiautonomous institutions would be 
a more attractive and expedient way to reduce seces- 
sionist pressures in Corsica than continued repression. 
Support of most Corsican nationalists for the referen- 
dum, at least prior to Colonna’s arrest, would seem to 
indicate their preference for a regime within which their 
identity would be expressed more vigorously and their 
interests honored more substantially than a concerted 
and direct struggle to separate from France. The strong 
opposition of the “French” inhabitants of Corsica is 
also understandable as an expression of their realiza- 
tion that the implication of combating secessionism via 
autonomy and Corsican representativeness rather than 
a centralized and repressive order enforced from Paris 
would be an unacceptable “indigenization” of the is- 
land’s political culture and public space. 

Finally, consider the case of India in its E 
ship with Kashmir—its only, Muslim majority state. 
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‘Aside from relatively meaningless symbolic frame- 
works adopted in the late 1940s, India has generally 
refused to recognize the ethnopolitical distinctiveness 
of Kashmir, where a nationalist movement with strong 
secessionist overtones has been active for 50 years de- 
manded, with varying degrees of intensity and violence. 
Elsewhere within India’s borders the state has empha- 
sized a federal structure that acknowledges and gives 
expression to a host of regional cultural and ethnopo- 
litical or linguistic variants. But whether dominated by 
the secularist Congress Party or, more recently, by the 
Hindutva-oriented BJP, the regime in New Delhi has 
always firmly opposed Kashmiri demands and empha- 
sized rigorous repression and direct rule of the territory 
by non-Kashmiris. 

Our finding that under such conditions repression 
can, if rigorous enough, prevent robust public ethnop- 
olitical mobilizations, but only at great cost and with- 
out eliminating the threat of secessionism, is well il- 
lustrated in this case. Although support for separatists 
from Pakistan and other external sources is clearly sig- 
nificant and lies outside the terms of the experiments 
reported here with Beita, the combination of Kashmiri 
exclusion from the public arena, continued high levels 
of separatist violence, and high levels of costly regime 
repression is quite consistent with our findings regard- 
ing the trade-offs likely to prevail between repression 
and representative, semiautonomous or responsive in- 
stitutions, as well as between preservation of the ex- 
clusiveness of a public political space and the stabi-, 
lizing consequences of increasing the Tesponsiveness, 
autonomy, and/or representativeness of governing in- 
stitutions. s 

As there are limits to other approaches, so too are 
there limits to agent-based modeling simulation in gen- 
eral, and PS-I in particular. We as yet have no reliable 
method for portraying a simulation of Beita of a par- 
ticular length as corresponding to a particular length 
of time in a country comparable to Beita in the real 
world. We are in the process of integrating other key 
variables of interest to scholars in this area into Beita 
and our study of its dynamics, but as-we do so we run 
the risk of creating, in our virtual space, the intractably 
entangled and complex causal relationships that make 
it difficult to isolate effects in the real world. The vari- 
able appearance of violence as an aspect of regime 
repression or of ethnopolitical mobilization and of se- 
cessionist struggles is of great interest but is not readily 
captured in a computational model such as Beita, based 
as it is on a cellular automata design. It is our hope, 
however, that having demonstrated the usefulness of 
a user-friendly platform such as PS-I, researchers will 
add this technique to their methodological quiver, 
whether as an “idea pump” or as a method for ex- 
ploring the logic of their arguments independent of the 
accidents of available data and confounding circum- 
stances that may be associated with particular cases or 
databases. 

Finally, having considered the results of our inves- 
tigations with Beita, having interpreted them as in- 
terventions in the scholarly debate over relationships 
among ethnopolitical mobilization, secessionism, and 
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various institutional responses, and having illustrated 
their application in a few real world cases, it is worth 
returning to the issue of “empirical validity.” From 
a Lakatosian (1970) perspective in which science ad- 
vances via the competition of technically incommen- 
surable research programs, the real test of a scientific 
enterprise is whether the results of its work articulate 
with the results of work done in other programs to form 
an interesting and progressive conversation within the 

>~gscholarly community. In this context we take satisfac- 
tion in the extent to which the findings drawn from 
our experiments with Beita yield nuanced answers to 
questions being asked on the cutting edge of the field, 
questions of great interest to researchers using radically 
different techniques, and questions politicians must an- 
swer to alleviate the real suffering that ignorance of 
processes associated with ethnopolitical conflict helps 
produce. 


APPENDIX: BEITA TECHNICAL DETAILS, 
MODEL SPECIFICATION, AND NOTATION 


This appendix presents the BEITA simulation environment 
in greater detail. It does not provide general information 
about the PS-I toolkit, nor does it present technical details 
regarding all of its simulation capabilities. We wish to em- 
phasize, however, that the templates, scripts, and software 
necessary to replicate our findings exactly are available to 
all researchers at the URL listed in the Acknowledgments. 
At least as important, however, is that researchers who may 
wish to explore the robustness of our findings to variation 
in particular parameter settings or coding rules may do so 
quite straightforwardly and systematically by adjusting those 
settings in the templates (snapshots) provided and observing 
the consequences when those adjusted templates are used to 
generate batches of 100 histories. 

The Appendix is divided into three sections. The first sec- 
tion addresses issues related to the definition of agents in 
BEITA and provides more details regarding the distribution 
of agents of different types in the landscape. In the second 
section we present the rules for repertoire and identity trans- 
formation in BEITA. The third section discusses the condi- 
tions for the emergence of border cells. 


Agents In BEITA 


BEITA is designed to approximate the characteristics of a 
multicultural, multiethnic, multiregional state, The landscape 
is a square grid (66 x 66) and contains 4,356 cells. At the be- 
ginning of each Beita history 4,096 of these cells are inhabited 
by agents. The remaining 260 “border cells” form an external 
and unchanging boundary along its four straight edges. (see 
Figure 1). 

BEITA contains agents from different types or agent 
classes. Agents belonging to different classes can be distin- 
guished visually by different icons. Agent behavior is af- 
fected by differences they may have in four parameters: (a) 
influence—the effect of one agent on other adjacent agents; 
(b) sight radius—the size of the agent’s “social” neighborhood 
or the number of agents in its proximity that it surveys before 


26 Such information has been provided elsewhere. See Dergachev 
2003 and Lustick 2002. 
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TABLE A1. A 
ht 


t Classes In BEITA 


Graphic Icon 


Sig 
Influence Radius Inactive Immutable (Figure 1) 


Basic agent 1 1 
Influential 2 


Influential 4 


2 
Influential 3 3 
4 
Border cell 0 


1 
1 
1 
0 


implementing rules governing identity activation and reper- 
toire composition; (c) imacttve—whether an agent affects 
other agents at all; and (d) immutable—whether an agent 
can change its activated identity, i.e., whether it can be aff- 
ected at all. Combinations of values on these parameters pro- 
duce the five agent classes in BEITA (Table A1). Note that 
only a few of the many agent classes producible in PS-I are 
included in Beita. 

In addition to the parameters discussed above, each agent 
in BEITA has its own unique location and a particular array 
of identities, with one identity activated at any time. Each 
agent’s identity repertoire is a subset of the total number of 
identities available in BEITA. (There is a total array of 20 
identities present in Beita—its “spectrum.”) But at any par- 
ticular time step, only the identities within an agent’s identity 
repertoire are available to that agent to be rotated into acti- 
vation on any particular time step. Identities in the spectrum 
are each assigned a number and a corresponding color. Each 
agent’s activated identity is viewable via this color coding (see 
Figure 1). Table A2 offers a summary of authority structures 
and patterns of activation for agents in BEITA (Baseline). 






Kentities associated with one of the other regions. 





TABLE A2. Summary of Authority Structures and Activation Patterns In BEITA (Baseline) 


Reglon Agent Class Repertolre Avg. Repertoire Sizə Influence Number 
NW High-echelon RO {[Reglme]} 3.00 4 4 
Mid-echelon RO {[Reglme], regional, parochial} 5.00 3 6 
Low-echelon RO {[Reglme], regional, parochial} 5.02 2 81 
RA {[Reglme], regional} 5.06 1 490 
RA {[Reglme}, other} 6.00 1 52 
PA {Regime, [other]} 6.02 1 391 
NE & SW High-echelon RO {[Regime], regional} 3.50 4 8 
Mid-echelon RO {[Regime], regional, parochlal} 5.00 3 10 
Low-echelon RO {[Reglme], regional, parochial} 6.00 2 135 
RA {[Reglme], regional, parochial} 6.40 1 622 
High-echelon k {Regime, [reglonal]} 4.00 4 1 
Mid-echelon Reg {Reglme, [regional], parochial} 4.60 3 5 
Low-echelon RegO {Reglme, [reglonal], parochlal} 6.00 2 33 
RegA {Reglme, [regional], parochial} 6.47 1 831 
RegA {[Regional], parochial} 6.00 1 43 
PA {Reglme, regional, [parochlal]} 6.00 1 403 
SE High-echelon RO {[Reglme]} 3 4 1 
Mid-echelon RO {[Reglme]} 3 3 3 
Low-echelon RO {[Regime]} 3 2 54 
RA {[Reglme], 16, 10, parochial} 5.61 1 113 
Strong DRM Influential = {[10], parochial} 3 4 2 
Mid DRM Influential {[10], parochlal} 3 3 1 
Low DRM Influential {[10], parochlal} 3 2 1 
DRM {Regime, [10], parochlal} 4.67 1 93 
FRM {Regime, [16], parochlal} 4.37 1 129 
PA Regime, 10, 18, 5.38 1 627 


Note: Brackets, active identity; curly braces, Identthes m repertoire; RO, regime official; RA, 
PA, agent activated on one of the parochial identities; RegA, agent actrvated on one of the regional identities; 
DRM, disgruntled regional minortty, FRM, favored regional mmomity; other, In the NW region some agents are 


False False 
False False 
False False 
False False 
True True 


None 
Cross 
Splral 
Circle 
Black color 


Repertolre and Identity Transformation 
In BEITA 


Once BEITA begins to move forward in time agents begin 
to interact, guided by a set of simple, but precise algorithms 
that govern their adaptive responses to information avail- 
able from their local neighborhoods and from bias signals. 
These rules are designed to operationalize the main tenets of 
constructivist identity theory and to remain consistent with 
social psychological analyses of individual and group behav- 
ior, including social identity theory and self-categorization 
theory (Tajfel and Turner 1986; Turner et al. 1987; Turner and 
Reynolds 2001). Under specified conditions the evaluation of 
the information may result in changing the agent’s activation 
or the composition of an agent's identity repertoire. 

In general, agents act in a boundedly rational fashion, 
adapting to their world by monitoring biases and align- 
ing their activated identities with the identity most promi- 
nently activated in their neighborhoods, For every mutable 
agent, the algorithms controlling these calculations take into 
account identities available within the agent’s repertoire, 





agent activated on one of the regime identites; 
O, regional officials; 
ubscribed to 
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identities activated within its neighborhood, its currently ac- 
tivated identity, biases currently assigned to different identi- 
ties, and the influence levels of agents in its neighborhood, 
including itself 

Each mutable agent updates its activation on every other 
time step. On each updating time step each agent counts all 
activated identities in its neighborhood (including its own 
identity), taking into account each agent’s influence, as well 
as the biases assigned to the identities. This operation returns 
an “identity weight,” or “count,” for each activated identity 
in an agent’s neighborhood. 

count, = Xe Influence[ A] + bias,. 
Ferera 

In addition to calculating the relative weight of each iden- 
tity in its neighborhood, agents must also identify candidate 
identities that could be discarded, rotated, swappedout, or ac- 
quired. 

A discard candidate is the identity with the smallest count 
in an agent’s repertoire (including the active identity): 

discard_candidate = argmin count,. 
{G@rperters 


A swapout candidate is the identity with the smallest count 
among the agents’ nonactive identities (i.e., all excluding the 
active identity): 
swapout_candidate = argmin count,. 
lernpariest 
rysectrrated 
A rotate candidate is the identity with the highest count 
among all the subscribed identities: 
rotate_candidate = argmax count,. 
1ewpartere 


Finally, an acquire candidate must not be part of the sub- 
scribed set and is the one with the highest count in the identity 
spectrum: 

acquire_candidate = argmax count,. 
1 frapar tere 


As time progresses the repertoire of active agents evolves 
according to the following rules. In this example the triggering 
thresholds for each operation are those assigned in BEITA 
to basic agents. 


1. If the count of the activated identity equals or is larger than 
the count for any other identity, the agent’s repertoire and 
activated identity stay the same. 

2. Lf the count for the rotate candidate is larger than or equals 
two, the agent activates on the rotate_candidate. 

3. If a discard_candidate and an acquire_candidate have been 
defined and the count for the acquire_candidate is equal 
to or larger than seven, the agent discards the candidate 
identity and activates the acquired identity. 

4. If both swapout and acquire identities have been de- 
fined, and the count of the acquire identity is equal to 
or larger than five, the acquire_candidate replaces the 
swapout candidate in the agent’s repertoire. 


new_repertoire = 

COUN greed > MAX count,, old repertoire. 

COUN rosats candace 21 activate(rotate_candidate). 

COUN ncpre _cancidiase > 7, activate(acquire.candidate), 
unsubscribe(discard_candidate). 

COUN repent casd = 5, subscribe(acquire_candidate), 
unsubscribe(swapout_candidate). 

otherwise old_repertotre. 
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Agent Transformation In BEITA 


In PS-I one can set conditions governing the transformation 
of an agent from one agent class to another. In BEITA we 
implemented a set of conditions that would allow the trans- 
formation of basic-agents into border cells. To be eligible for 
border transformation (bt) an agent must meet a set of three 
conditions. 

First, the activated identity must be defined as an oppo- 
sitional identity (OI). Oppositional identities are a subset of 
identities that are not the dominant identity (DI). DI is the 
identity, at any particular time, with a plurality of Beita agents 
activated on it. It is computed from the distribution of acti- 
vated identities: 


DI = argmax activated, . 


To qualify as an OJ, no more than 20% of the agents 
activated on that identity can have the DI identity in their 
repertoires: 


on 1 opposition_count, > 0.8 - activated, 
‘10 otherwise. 


Second, the activated identity agent must be defined as a 
subordinated identity (SI). This condition is met in any time 
step by identities activated by at least 10% of the agents in 
BEITA, excluding the DI. 


1 
st =| 
0 


Third, no individual agent meeting the first and second 
conditions can transform into a border cell unless it is expe- 
riencing a high level of tension (a value of at least three). 
Tension is calculated for each agent by summing the num- 
ber of agents in its neighborhood with an activated identity 
different from its own. 


tension[_A] = > 1. 


acirotek Ny = actrrated A) 


activated, > 0.1 - active_agent, 
otherwise. 


As indicated before, all three conditions (OJ, SJ, and ten- 
sion) must be met before an agent becomes eligible to trans- 
form into a border cell. 


eligible for_bt[A] = Slemantf) N Olara N (tension[A] > 3). 


During an updating time step an eligible agent has a 20% 
probability of actually transforming into a border cell. 
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ecent work across a wide range of issues in political economy as well as in American, comparative, 

and international politics tries to explain the inefficient use of power—revolutions, civil wars, high 

levels of public debt, international conflict, and costly policy insulation—in terms of commitment 
problems. This paper shows that a common mechanism is at work in a number of these diverse studies. 
This common mechanism provides a more general formulation of a type of commitment problem that 
can arise in many different substantive settings. The present analysis then formalizes this mechanism 
as an “inefficiency condition” that ensures that all of the equilibria of a stochastic game are inefficient. 
This condition has a natural substantive interpretation: Large, rapid changes in the actors’ relative power 
(measured in terms of their minmax payoffs) may cause inefficiency. 


ivil wars, revolutions, litigation, strikes, eco- 

nomic sanctions, international conflict, and the 

use of power in general pose an inefficiency puz- 
zle. Suppose that a group of actors is bargaining about 
how to resolve an issue or, more abstractly, about how 
to divide a “pie.” One or more of them can affect the 
outcome and possibly even impose a division through 
the use of some form of power—be it military, eco- 
nomic, legal, or more broadly political. The exercise 
of power, however, consumes resources, and, conse- 
quently, the pie to be divided among the bargainers 
before anyone tries to impose a settlement is larger 
than it will be afterward. As a result, there usually are 
divisions of the larger pie that would have given each 
bargainer more than it will obtain from an imposed 
settlement. The use of power, in other words, leads to 
Pareto inefficient outcomes. Why, then, do the bargain- 
ers sometimes fail to reach a Pareto superior agreement 
prior to the explicit use of power? 

A standard explanation of inefficiency appeals to 
asymmetric information.! Indeed, recent formal work 
in international relations theory on the causes of war 
frames the problem in terms of efficiency and focuses 
almost entirely on informational asymmetries* But a 
growing body of work across a wide range of issues 
in political economy as well as in American, compara- 
tive, and international politics explains inefficiency in 
terms of commitment problems that can arise even if 
the bargainers have complete information. The issue 
here is that bargainers are sometimes unable to com- 
mit themselves to following through on an agreement 
and have incentives to renege on it. These incentives 
may undermine the efficient outcomes. When they do, 
complete-information bargaining breaks down in the 
inefficient use of power. 


Robert Powell is Robson Professor, Department of Political Sci- 
ence, 210 Barrows Hall, 1950, University of California, Berkeley, 
CA 94720-1950 (RPowell@Socrates Berkeley edu) 

I appreciate helpful comments and criticisms from Daron 
Acemoglu, Jonathan Bendor, Rui de Figueiredo, James Fearon, 
Sven Feldman, Jeffry Frieden, Kira Markiewicz, Dan Reiter, James 
Robinson, Gerard Roland, Chns Shannon, Timothy Simcoe, Leo 
Simon, and Barry Weingast. I also gratefully acknowledge the sup- 
port of the Natonal Science Foundation (SES-0315037) 

See Ausubel, Cramton, and Deneckere 2002 for a recent review. 
2 Fearon (1995) discusses this approach and Powell (2002) surveys rt. 


For example, Acemoglu and Robinson (2000, 2001) 
link democratic transitions, costly coups, and revolu- 
tions to the inability of the faction in power to commit 
to future redistribution policies, Fearon (1998, 2003) 
shows that the inability of a central government to com- 
mit to honoring a power-sharing agreement can lead 
to prolonged civil wars. In Alesina and Tabellini 1990 
and Persson and Svensson 1989, political parties cre- 
ate inefficient levels of public debt because the parties 
cannot commit to future spending levels. Democratic 
decision-making in Besley and Coate 1998 may lead to 
inefficient outcomes if the current decisions of those in 
power affect the identity or preferences of future deci- 
sion makers. Political parties in de Figueiredo 2002 may 
impose inefficient administrative procedures to protect 
their programs from their political opponents because 
competing parties cannot commit to refraining from 
overturning each other’s policies when they take of- 
fice. And Fearon (1995, 404-8) and Powell (1999, 128- 
32) demonstrate that a rapidly shifting distribution of 
military power combined with the states’ inability to 
commit to an agreement can lead to war. 

Despite the importance of commitment problems, 
we lack many general results about the basic mecha- 
nisms through which actors’ inability to commit leads 
to inefficient outcomes. The results we do have typically 
focus on specific models, as in the examples above, 
or the analysis demonstrates the existence of ineffi- 
cient equilibria in settings where there are Pareto supe- 
rior, efficient equilibria.” Absent a compelling theory 
of equilibrium selection, inefficient equilibria that are 
dominated by efficient ones provide at best a weak ex- 
planation of inefficiency. 

This paper shows that a common mechanism is at 
work in a number of the diverse studies cited above. 


3 For example, the inefficient equilibria in infinitely repeated games 
are Pareto dominated by efficent ones if the players are sufficiently 
patient. Similarly, the inefficient equilibria are Pareto dominated by 
efficient ones in models of strikes (e.g, Fernandez and Glazer 1991) 
or, more generally, in bargaining models in which the bargainers can 

costs on each other between offers (¢.g., Busch and Wen 1995 
and Muthoo 1999). Analogous results obtain in bargaming games in 
which the players can renege on or retract an accepted offer (Muthoo 
1990, 1999, 194-200) By contrast, Anderlini and Felli (2001) and 
Merio and Wilson (1995) do obtain inefficient, Pareto undominated 
equilibna. 
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This common mechanism affords a more general 
formulation of a type of commitment problem that can 
arise in many different substantive settings. The present 
analysis then formalizes this mechanism as an “ineffi- 
ciency condition” that ensures that all of the equilibria 
of a complete-information stochastic game are ineffi- 
cient. This condition has a natural substantive interpre- 
tation: Large, rapid changes in the bargainers’ relative 
power cause inefficiency. More precisely, the equilibria 
must be inefficient even with complete information if 
at some time (along any efficient path) the expected 
per-period shift in at least one of the actors’ minmax 
payoffs is larger than the bargaining surplus. 

The next section briefly reviews Acemoglu and 
Robinson 2000, 2001, de Figueiredo 2002, Fearon 2003, 
and Powell 1999. This review shows that, broadly speak- 
ing, the central problem confronting the actors in these 
models is deciding how to divide a flow of pies in a 
substantive setting in which (i) the actors cannot com- 
mit to how they will divide the pies in future periods, 
and (ii) the payoffs the actors can lock in through the 
inefficient use of power varies over time.* The review 
also characterizes the common commitment problem 
that can arise in this situation. The subsequent section 
formalizes this commitment problem as an inefficiency 
condition for stochastic games. 


A COMMON COMMITMENT PROBLEM 


Complete-information bargaining breaks down in 
costly coups in Acemoglu and Robinson 2000, 2001, 
in secessionist civil wars in Fearon 2003, in inefficient 
policy insulation in de Figueiredo 2002, and in war in 
Fearon 1995 and Powell 1999 for the same basic reason. 
Resource constraints, the inability to commit to future 
transfers, and a rapidly shifting strategic environment 
create a situation in which every efficient path is dy- 
namically inconsistent. 

In Acemoglu and Robinson’s (2000, 2001) analysis 
of political transitions, a rich elite and a poor majority 
vie for political control of the state and the benefits that 
such control brings. One of these factions is in power at 
the start of any period, and times are either “normal” 
or “bad” (e.g., there is a severe economic downturn), 
with probabilities 1 —s and s. Whether times are nor- 
mal or bad is revealed at the start of each period and is 
common knowledge. 

When the poor are in power, they move first by set- 
ting the tax rate for that period. The rich can then accept 
this policy or initiate a coup. Accepting ends the period 
with the agreed tax policy. Launching a coup (in bad 
times) brings the elite to power but is also inefficient, 


4 This formulation contrasts with Merlo and Wilson (1995). In their 
game, the bargainers negotiate about dividing a pie the size of which 
varies stochastically. If, as they suggest, each period’s pie represents 
the present value of the expected flow of benefits, then thear model 
can be interpreted as one in which the bargainers are dividing a flow 
of pies and can commit to agreements about how they will divide 
the future flow. But the unique subgame perfect equibbnum of their 
game is always efficient if there are two players and transferable 
utility. The mechanism highlighted below can produce ineffiaency 
even ın these circumstances 
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as it destroys a fraction of the economic income from 
that period." The elite then sets the economic policy for 
that period and begins the next period in power. 

When the rich are in power at the start of a period, 
they decide the tax rate and whether or not they want 
to extend the franchise to the poor. (Since the me- 
dian voter is assumed to be poor, this is equivalent to 
turning power over to the poor.) If the rich relinquish 
power, the poor take over, set policy for that period, 
and start the next period in power. If the rich retain 
power, the poor can accept the tax rate or launch a 
costly revolution, which, again, destroys a fraction of 
that period’s income. Accepting ends the period with 
the agreed policy in place and the elite in power at the 
start of the next period. Launching a revolution effec- 
tively ends the game, with the poor assuming power, 
the rich losing everything, and the threat of any future 
coup eliminated. 

When one group is in power it can set a tax policy 
favorable to the other group as a way of trying to buy 
that group off and thereby avoid a coup or revolution. 
However, Acemolgu and Robinson identify conditions 
under which the poor cannot offer the rich enough to 
buy them off. In these circumstances, the rich always 
launch a coup when out of power and times are bad and 
relinquish political control when in power and times are 
bad. Thus, the county oscillates between democratic 
and authoritarian regimes. 

A dynamic commitment problem drives this oscilla- 
tion. When times are bad and the poor are in power, the 
poor would like to buy the rich off and thereby avert a 
coup. To do this, the poor must promise to give the rich 
their certainty equivalent of launching a coup, i.e., how 
much the rich would expect to get were they to mount 
a coup. However, the amount that can be transferred 
in any one period is constrained by that period’s total 
output and the fact that the poor cannot set a tax rate 
below zero. As a result, buying the rich off requires 
that the poor keep taxes low for more than one period. 
But the poor cannot commit to future tax rates, and, 
with probability 1 — s, normal times will return in the 
next period. If they do, the threat of a coup will evap- 
orate, and the poor will have an incentive to renege 
on a promise of lower taxes.° Foreseeing this, the rich 
initiate a costly and therefore inefficient coup. 

Fearon (2003) sees a commitment problem at the 
heart of some secessionist civil wars. His analysis grows 
out of an effort to explain the empirical patterns of 
civil wars. (Fearon and Laitin [2003] also describe some 
of these patterns.) Fearon (2003) begins by rejecting 
asymmetric information as a plausible account of some 
types of prolonged civil wars. 


{I]t strains credulity to imagine that the parties to a war 
that has been going on for many years, and that looks very 
much the same from year to year, can hold any signifi- 
cant private information about their capabilities or resolve. 
Rather, after a few years of war, fighters on both sides of 


5 By assumption, launching a coup in good times is too costly (1.6, is 
dommated by acceptmg the poor’s ideal policy). 

6 Recall that the payoffs are restricted m such a way that mounting 
a coup in normal trmes 1s strictly dominated. 
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an insurgency typically develop accurate understandings of 
the other side’s capabilities, tactics, and resolve. Certainly 
both sides in Sri Lanka (for instance) fight on in the hope 
that by luck and effort they will prevail militarily. But it is 
hard to imagine that they do so because they have some 
private information that makes it reasonable for them to be 
more optimistic about the odds than the other side is. In 
the absence of significant private information, why can’t 
they cut a deal on the basis of a more-or-less common 
understanding of the terms of the military stalemate? (19) 


The answer Fearon (2003) develops is a commitment 
problem arising from “a temporary shock to govern- 
ment capabilities or legitimacy [that] gives coup plot- 
ters or rebels a window of opportunity” (22). In his 
model, a government, G, and a rebel group, R, nego- 
tiate about regional control. At the start of a round 
(see Figure 1, which is based on Fearon’s Figure 2), 
the government is strong or weak with probabilities 
1—e and e, respectively, and whether the government 
is strong or weak is common knowledge. The govern- 
ment then makes a take-it-or-leave-it offer c, € [0, 1] 
that represents the regional control that the govern- 
ment intends to keep for itself during that period. In 
effect, the government proposes a division of the social 
pie. If the government is strong, the rebels’ only alter- 
native is to accept the offer and the period ends with 
the government’s receiving c; and the rebels’ obtaining 
l-e. 

If the government is weak, the rebels can accept the 
government’s offer or fight. Accepting ends the period 
and brings the government and rebels payoffs of c, and 
1 — c, respectively. A new period then begins with the 
government strong or weak with probabilities 1 — s and 
e and the government deciding what to offer. 

If the rebels fight, then the government and rebels 
receive payoffs kg and kr during that period. By as- 
sumption, kg+kr<1 so that fighting is inefficient. 
Fighting ends in one of three ways: With probabil- 
ity a, the government wins and a new period begins, 
with the government strong or weak with probabilities 
1—e and s and the government making a new offer. 
With probability £, the rebels prevail and the game 
ends with secession. Finally, the fighting results in a 
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stalemate with probability y = 1 — a — £, in which case 
the government and rebels have to decide whether 
or not to keep fighting. If either of them chooses to 
fight, they receive the payoffs to fighting, kg and kp, 
and the war continues for another period. As before, 
the government wins with probability œ, the rebels 
win with probability £, and a stalemate occurs with 
probability y. The war continues in this way until 
one side of the other wins or both decide to stop 
fighting. 

Fearon shows that in some circumstances there must 
be fighting in any subgame perfect equilibrium even 
though it is inefficient. To see the basic intuition, con- 
sider a period in which the government is weak. To 
induce the rebels not to fight, the government must 
concede enough to them so that they prefer these con- 
cessions to fighting. However, the rebels’ continuation 
payoff to fighting typically exceeds one. (This payoff 
includes the payoff to fighting in the current period, 
kp, plus the expected payoffs in subsequent periods.) 
This and the fact that the government can transfer no 
more than one to the rebels in any single period (by 
setting c; =0) means that buying the rebels off entails 
a promise to transfer resources to the rebels for more 
than one period. 

The government strictly prefers this transfer to fight- 
ing because the latter is costly. The government there- 
fore would like to be able to commit itself to following 
through on this promise. But it cannot. With proba- 
bility 1 — e, the government will be strong in the next 
period, the threat of rebellion will disappear, and the 
government’s payoff to reneging on its promised trans- 
fer will exceed its payoff to following through on it. 
Anticipating this, the rebel group fights while it has the 
chance. 

A similar complete-information commitment prob- 
lem arises in a very different substantive context. de 
Figueiredo (2002) asks why elected officials might de- 
liberately pursue inefficient policies. He postulates a 
policy environment in which changing circumstances 
mean that a political party, if it were sure that it would 
remain in power, would prefer not to lock in a rigid 
policy so that it could adjust its policy to changing 
circumstances. In other words, locking in a policy is 
inefficient. Nevertheless, de Figueiredo shows that a 
party prefers to lock its policy in when it is unlikely to 
remain in power. 

de Figueiredo’s analysis begins with a “reciprocity” 
game between two political parties, A and B. In this 
infinite game, A is in power with probability y in any 
period and Bis in power with probability 1 — y. During 
any round in which a party is in power, it implements its 
own policy and decides whether or not to overturn the 
other party’s policy (assuming that the other’s policy is 
still in place). A party receives one during any period 
in which its policy is in place and the other party’s 
is not, 6 €(0,1) during any period in which both 
parties’ policies are in place, and zero during any period 
in which its policy is not in place and the other 
party’s is. 

If £ is larger than y and 1— y (and the discount 
factor is sufficiently high), then both parties prefer 
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cooperating by not overturning the other party’s 
policy.’ Moreover, this cooperative outcome can be sus- 
tained in a subgame perfect equilibrium by the threat 
that should one party ever deviate by overturning the 
other party’s policy, then neither party will ever coop- 
erate again. 

To introduce the inefficiency puzzle, suppose that a 
party has an additional option when it assumes office 
for the first time. It can insulate its policy by creating bu- 
reaucratic or political obstacles that make it difficult to 
change. For example, the party in control might create 
an administrative agency whose procedures are subject 
to judicial review. (See de Figueiredo 2003, 2002 and de 
Figueiredo and Vanden Bergh 2001 for additional ex- 
amples and discussion.) Formally, once a party insulates 
its policy, the other party cannot overturn it. Insulation, 
however, is costly. 

If political uncertainty is low (ie, y is far away 
from $), then at least one party engages in inefficient 
insulation.’ Suppose that Ais politically weak and un- 
likely to hold power in general (i.e., y is small) but that 
A happens to be in power in the current period. A can 
lock its policy in place so that it obtains a < 1 during any 
period in which only its policy is in place and af during 
any period in which both parties’ policies are in place. 
(The fact that a <1 ensures that insulation is costly.) 
To forgo the opportunity to lock this payoff in, A must 
believe that B will refrain from overturning A’s policy 
in future periods sufficiently often that 4’s payoff to not 
insulating its policy is higher than its payoff to doing so. 

But the only reason B would refrain from overturn- 
ing 4’s policy is that A would subsequently impose a 
costly punishment on B that outweighs B’s gain from 
overturning A’s policy. However, the only way that A 
can punish B is by overturning B’s policy when A is 
in power. Consequently, a politically weak A will be 
unable to impose much punishment on B because it 
is unlikely to be in power very often. Indeed, if A is 
sufficiently weak, it cannot impose enough punishment 
on Bto deter B from overturning A’s policies whenever 
B is in office. In these circumstances, A prefers to lock 
its policy in because B is very likely to be in power in 
the next period and, if so, to overturn A’s policy if it 
has not been insulated. Once again, inefficiency results 
when one actor must make concessions across multiple 
periods to buy another actor off (ie, B must refrain 
from overturning .A’s policy to induce A not to insu- 
late). But a shifting strategic environment undermines 
the credibility of these promised concessions. 


7 Suppose that each party always overturns the other party’s pol- 
icy. If A is ın power rt obtains one in that period and its expected 
payoff in all future periods is y 1+(1—y)O=y. Hence, A's av- 
erage payoff (in the limit as the discount factor ô goes to ane) is 
(1—8)[1+y/(1—6)] > y, and B's average payoff 1s 1—y. Now 
suppose that the parties cooperate by never overturning the other’s 
policy. Then each party’s average payoff 1s arbitrarily close to b. (Be 
cause the second party to assume power obtams zero until ıt comes 
to power, its average payoff is slighly less than £ and the other party’s 
average payoff is slightly more than £. But these differences go to 
zero as the discount factor goes to one.) Both parties therefore prefer 
cooperation if 6 > max{y, 1 — y}. 

* Pohtical uncertamty 1s low if y is far from 4 because one party 1s 
likely to hold power most of the tme. 
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de Figueiredo’s conclusion that insulation is most 
likely to occur when political uncertainty is low con- 
trasts with the conclusion derived from nongame the- 
oretic work that argues that inefficient insulation is 
most likely to occur when political uncertainty is high 
(e.g., Moe 1990). Moreover, he finds empirical support 
for this claim in his analysis of when states adopt the 
line-item veto (de Figueiredo 2003) or an administra- 
tive procedures act (de Figueiredo and Vanden Bergh 
2003). Both of these are ways for those in control of 
a state’s legislature to lock in or at least insulate their 
policies. 

The strategic environment shifts stochastically in 
the previous examples. Times are good or bad with 
probabilities 1—s and s, the government is strong or 
weak with probabilities 1—¢ and £, and Aor B is in 
power with probabilities y and 1— y. The strategic 
environment shifts deterministically in Powell’s (1999) 
study of preventive war, where, nevertheless, a similar 
complete-information commitment problem can arise. 

In Powell’s model of states’ efforts to cope with shifts 
in the distribution of military power, a declining state 
and a rising state are negotiating about revising the ter- 
titorial status quo q € [0, 1]. The declining state, D, be- 
gins the game by either proposing a revision xp € [0, 1] 
to the status quo or attacking. Attacking ends the game 
in a costly lottery. In this lottery, the rising state, R, 
wins all of the territory and the future benefits from 
that territory with probability pp and D wins every- 
thing with probability 1— pp. R’s payoff to fighting is 
therefore po "79 8/(1—r)+(1— po) È ~0 /(0—r) 
= (po — r)/(1 — 8), where 8 is the states’ common dis- 
count factor and r is R’s cost of fighting. D’s payoff to 
fighting is defined analogously.’ 

If D does not attack and makes an offer instead, then 
Rcan accept, reject, or fight. Accepting ends the round. 
Rand D and receive payoffs xo and 1 — x9, x) becomes 
the new territorial status quo, and D begins the next 
round by either attacking or making a new offer. If R 
rejects D’s initial offer, the status quo remains in place, 
R and D receive payoffs q and 1—4, and D begins 
the next round by either attacking or making a new 
offer. Finally, R’s attacking in response to D’s offer ends 
the game in a costly lottery with the payoffs described 
above. 

To formalize the shifting distribution of power be- 
tween D and R, Powell assumes that the rising state’s 
probability of prevailing starts out at some pp. It then 
increases by A in each of the next T periods (ie., 
Pe = Po +tA for 0<t<T), after which it remains at 
Pr=po+TA. All of this is common knowledge. 

Powell’s primary focus is on bargaining when D is 
uncertain of R’s cost of fighting. But he does show 
that complete-information bargaining breaks down 
in war if there are large and rapid shifts in the 


9 More precisely, de Figueiredo shows that fiscal conservatives who 
are politically weak are significantly more likely to propose a line- 
item veto than are fiscal conservatives who are in a strong political 
position, i e., Hkely to remain in control of the state legislature. 

10 The states are assumed to be risk neutral here for expositional 
case. Powell (1999) allows the states to be risk neutral or averse. 
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distribution of power. To highlight the basic idea, ob- 
serve that the declining state can lock in a payoff 
of (1— p,—d)/(1—8) if it fights at time t. In con- 
trast, D’s payoff to not fighting is bounded above 
by 1 +8[1/(1 — 8) — (p: +A —r)/(1 — 8)] fort < T. The 
first term is the best that D can do in the current 
period. The second term is an upper bound on D’s 
future payoffs. That is, this term is the discounted dif- 
ference between the total flow of benefits, which is all 
there is to be divided between the bargainers, and how 
much of that total R can lock in for itself by fighting in 
the next period. Clearly, R has to get at least this much 
along any efficient path if it is to be induced not to fight. 
Therefore, the difference between all that there is to 
be divided and what R can lock in constitutes an upper 
bound on what D can get in the future. Consequently, 
Dstrictly prefers fighting if what it can lock in by doing 
so is strictly greater than this upper bound on what it 
can get if it does not fight. In symbols, D prefers to fight 
if (1— p,—d)/(1—8) > 1 +8[1/(1— ô) — (p: + A —r)/ 
(1—8)] or, equivalently, if 8A >(1— 8)pı +4d+ ôr. 
Hence, complete-information bargaining is sure to 
break down in inefficient fighting if the per-period shift 
in the distribution of power A is larger than the average 
amount consumed by fighting r +d and if the discount 
factor is close enough to one.!! 

Less formally, the rising state would like to induce 
the declining state not to fight by committing itself to 
abiding by a territorial division that D prefers to fight- 
ing. To do this, R must forebear from fighting. But R’s 
increasing military strength increases its payoff to fight- 
ing and thereby undermines it promise not to attack. 

In sum, the actors in the preceding examples face 
the same broad strategic problem. The bargainers are 
trying to divide a flow of benefits in a setting in which 
they cannot commit to future divisions. Each actor also 
has the option of using some form of power to lock in 
a share of the flow. But the use of power is inefficient 
and destroys some of the flow. Finally, a shifting envi- 
ronment changes the amounts the bargainers can lock 
in. 
Complete-information bargaining breaks down in 
each case for the same basic reason. To avoid the inef- 
ficient use of power, one bargainer must buy off a tem- 
porarily stong adversary (i.e., a bargainer who can lock 
in a high payoff). Resource constraints mean that the 
transfers needed to do this must stretch across a “con- 
cession phase” lasting multiple periods. But during this 
phase, the once-weak bargainer is very likely to become 
strong enough to want to renege on the promised trans- 
fer. This prospect undermines that bargainer’s ability 
to credibly commit to the transfer, and the bargaining 
breaks down. 


AN INEFFICIENCY CONDITION 


This section formalizes the common mechanism at 
work in the previous examples in terms of an ineffi- 
ciency condition. When this condition holds, all of the 


11 Fearon (1995, 402-6) discusses the same kind of commitment 
problem. 
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equilibria of a two-actor stochastic game are inefficient. 
The section also shows that the inefficiency condition 
has three natural substantive interpretations. 

The inefficiency condition applies to stochastic 
games which are a generalization of repeated games. 
In the latter, the strategic environment remains con- 
stant. The actors play the same game over and over. In 
a stochastic game, the strategic environment changes. 
The game the actors play in any period may depend 
on the game they played in the previous period, what 
they did in that period, and additional random factors. 
For example, the game that the rich and poor play in 
Acemoglu and Robinson 2001 depends on which of 
them was in power in the previous period, on whether 
or not the out-of-power actor tried to depose the other 
actor, and on random fluctuations in the economy, e.g., 
whether times are normal or bad. 

To specify some elements of a stochastic game T 
somewhat more formally, let {4}¥_; denote the set of 
states or stage games and q be a transition function.’ 
The states define the various games the actors might 
play in any round. The transition function q(n|k, s) is 
the probability that the next state will be A, given that 
the current state is A, and that the players took actions 
s in Ax. Play begins in I in a given state, and each actor’s 
payoff is the present value of the sum of its stage-game 
payoffs where 6 is the players’ common discount factor. 
When deciding what to do, the actors know the current 
state as well as the entire history of previous states and 
what the actors did in those states. 

To specify the inefficiency condition, let M,(k) be 
j’s minmax payoff for the two-player stochastic game 
starting in state 4x. That is, j can assure itself of 
an expected payoff of M,(k) starting from state k. It 
is important to emphasize that this payoff is not the 
minmax payoff of the stage game A+. M,(k) is the min- 
max payoff of the continuation game starting in state k. 
Consequently, j’s payoff in any subgame perfect equi- 
librium starting from state A, must be at least as large 
as M,(k). 

Now consider an efficient profile e and the path 
p(e) that it traces out. That is, e is a pair of strategies 
¢=(¢1, e2) for players 1 and 2 such that the expected 
payoffs to following these strategies are Pareto opti- 
mal in the stochastic game, and p(e) are the states that 
are reached with positive probability if the actors play 
according to e.!* If either player has an incentive to 
deviate from this path, then e is not an equilibrium. 
And a player is sure to have an incentive to deviate if 
there exists a state along the path at which that player’s 
minmax payoff is strictly greater than its payoff to con- 
tinuing to play according to e. If, moreover, such a state 
exists along every efficient path, then there are no effi- 
cient equilibria. The inefficiency condition ensures that 


12 Abusing the definitions but greatly casing the exposition, I use 
“stage game” and “state” synonymously. See Fredman (1986, 124- 
25) for a complete description of a stochastic game. 

13 Generalizing the inefficiency condition to more than two players 
n straightforward. 

14 More precisely, let x (h | e) be the probability of history A given e. 
Then A is along the path p(e) if and only if a (h|e)>0 
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this is the case by finding an upper bound on a player’s 
payoff to continuing along an efficient path and then 
requiring this upper bound to be strictly less than the 
player’s minmax payoff. 

To specify an upper bound on j’s continuation payoff 
to following e starting at Ax, let B, be the maximum 
expected flow of future benefits starting in A+. Loosely, 
B, is all that there is to be divided between the two 
players starting from Ax. It is the present value of the 
expected flow of pies. (B, is defined formally in the 
Appendix.) 

Observe that the other player, i, can guarantee that 
it will obtain a certain amount of this flow. Let a‘, be 
a lower bound on i’s payoff in A, given that the play- 
ers are following an efficient path. (This payoff and 
the expectation discussed below also are defined for- 
mally in the Appendix.) Player i must get at least this 
much in the current round if play follows e. This ac- 
tor can then assure itself of getting at least its minmax 
payoff M, (n) if the next stage game is A,. Hence, i’s 
expected future payoff starting at A, is bounded be- 
low by the (discounted) value of i’s expected minmax 
payoff E,[M,(n)] where the expectation is based on 
what is known at A,. Thus, i is sure to receive at least 
a, + 8E,[M,(n)] starting at Ay. 

Putting all of this together, j’s continuation payoff 
to following e starting from A, is bounded above by 
By —[a,+6E[Mi(n)]]. Roughly, this is the difference 
between all there is to be divided if the actors play 
efficiently and what i can assure itself. Now define the 
inefficiency condition for actor j to be 


M, (k) > By — [a; +8 [M(n)]]. (1) 


When this holds, j will have an incentive to deviate from 
e at Ax, and consequently, the efficient path e cannot 
be an equilibrium. 

In practice, a closely related condition often turns 
out to be easier to use than (1). An action in state k 
is conditionally dominated if, starting from that state, 
every strategy that puts positive weight on that action 
is strictly dominated. For example, trying to depose the 
faction in power in normal times is conditionally dom- 
inated in Acemoglu and Robinson. By construction, 
launching a coup or revolution in good times is so costly 
that the out-of-power faction is always strictly better 
off if it waits for bad times before acting. Let M (Ak) 
be j’s minmax payoff starting in A, given that the other 
player i does not play conditionally dominated strate- 
gies. Then the analogue of condition (1) is 


M;(k) > Be — [ax + 8 [Mi (n)]]. a’) 
These conditions lead immediately to: 


Proposition 1. A two-actor stochastic game T has no 
efficient Nash equilibria if for each efficient path there 


15 In the examples above and m applied work more generally, a’, 1s 
usually quite easy to determine. A looser lower bound is to normalize 
the players’s utilities so that all of the payoffs m every stage game are 
nonnegative and then take a, =0 This leads to a somewhat looser 
version of the inefficiency condition in (1). 
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exists a state A, at which inefficiency condition (1) or 
(1’) holds for one of the players. 


Proof. See the Appendix. 


To develop substantive interpretations for (1) and 
implicitly (1’), it will be useful to relax these conditions 
slightly. If necessary, normalize the payoffs so that each 
player always receives at least zero in any period along 
any efficient path. This means that a} > 0. (The exam- 
ples above already satisfy this condition.) Clearly, con- 
dition (1) holds and the game has no efficient equilibria 
if M,(k) > Bx —-3&%[M,(n)]. Rewriting this inequality 
yields two expressions that have natural substantive 
interpretations: 


M; (K) +5 ELM, (n)] > Be, (2) 
5 Ex[Mi(n)] — M(k) > Be- [M (k) + MK]. O 


The left side of condition (2) is the amount that j can 
assure itself or “lock in” starting in state k plus the 
(discounted) expected value of what i can lock in the 
next period if the actors follow the efficient path at k. 
Condition (2) simply says that the sum of these lock-ins 
exceeds the total amount there is to be divided. When 
this is the case, it is impossible to satisfy both players’ 
claims on the flow of benefits. 

The second interpretation is more dynamic. The left 
side of (3) is the expected shift in i’s minmax payoff. In 
a rough sense this shift measures how much more pow- 
erful i will become and, implicitly, how much weaker j 
will become. The right side is the size of the bargaining 
surplus, i.e., the difference between what there is to 
be divided, &, less the sum of what each player can 
assure itself. Thus, (3) holds and there are no efficient 
equilibria when the expected shift in one of the player’s 
minmax payoff is larger than the bargaining surplus. 
Less formally, large, rapid changes in the bargainers’ 
relative power (measured by shifts in their minmax 
payoffs) cause inefficiency. 

Shifts of this kind are what drive the inefficiency 
in Acemoglu and Robinson 2000, 2001, Fearon 2003, 
de Figueiredo 2003, and Powell 1999. (The Appendix 
establishes the relationship between these examples 
and conditions (1) and (1’) more formally.) The rich 
launch a costly coup when times are bad and the cost 
of deposing the opposition is low, because normal times 
are likely to return in the next period and the rich will 
be weak. The rebels fight when the government is weak 
in Fearon because, with a high probability, the govern- 
ment will be strong in the next period. A weak party that 
happens to find itself in office in de Figueiredo insulates 
its policies because it is likely to be out of office in next 
period. And a declining state in Powell fights if it will be 
much weaker in the next period. These changes alter 
the players’ minmax payoffs and result in the inefficient 
use of power. 

As discussed in the previous section, these shifts un- 
dermine an actor’s ability to buy the other actor off by 
limiting the amount that the former can credibly com- 
mit to transfering to the latter. For example, the poor in 
Acemoglu and Robinson 2001 cannot credibly commit 
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to lower taxes long enough to dissuade the rich from 
mounting a coup. A third substantive interpretation of 
condition (1) highlights the role that the actors’ lim- 
ited ability to make transfers plays in the commitment 
problem. 

To develop this interpretation, assume that maximiz- 
ing the sum of the current and future benefits also maxi- 
mizes the current benefits. That is, maximizing the total 
flow of pies does not require accepting a smaller pie in 
the current period. (All of the models discussed above 
satisfy this “separability” condition, which is formal- 
ized in the Appendix.) This means that the maximum 
flow of current and future benefits B equals the maxi- 
mum benefits there are to be had in the current period, 
Cx, plus the discounted maximum flow of future bene- 
fits Fy, i.e., Bk = Cp + ô Fk. 

Corollary 1 below shows that condition (1) implies 
that 


M, (k) > a + 6[ Fk — Ex[M,(n)]], (4) 


where 2} is the maximum payoff j can achieve in state 
Ax along any efficient path. Put another way, this is the 
most that i can transfer to j in state A, on an efficient 
path. The term in brackets is the difference between 
the future flow of benefits and what i can assure itself in 
the future. This difference is therefore an upper bound 
on what i can credibly promise to transfer to j in the 
future. Were i to try to transfer more than this, then 
i’s future payoff in some state n would be less than its 
minmax payoff M,(n), thus giving i an incentive to re- 
nege. Hence, condition (4) can be interpreted as saying 
that j’s minmax payoff in k is larger than the amount 
that i can transfer to j in the current period plus what it 
can credibly promise to transfer in the future given the 
expected shift in i’s power as measured by its minmax 
payoff 


Corollary 1. If the maximum flow of benefits is sepa- 
rable as described informally above and formally below, 
then condition (1) implies condition (4). 


Proof. See the Appendix. 


Finally, it is interesting to consider the efficient 
equilibria in infinitely repeated games in light of the 
inefficiency conditions As the folk theorem shows 
(Fudenberg and Maskin 1986), there are always effi- 
cient equilibria in an infinitely repeated game if the 
actors do not discount the future too much. The ex- 
istence of these equilibria means that the inefficiency 
conditions must not hold in an infinitely repeated game. 
Why not? 

In a repeated game, there is a single stage game, say 
A, which is repeated infinitely often. This means that 
the strategic environment is completely stable in the 
sense that the continuation game is always the same, 
namely, an infinite repetition of A There are no large, 
rapid shifts to undermine efficiency in repeated games. 

To put this point more formally, let m; be j’s minmax 
payoff in the two-actor stage game A where the pay- 
offs have been normalized so that they are all nonneg- 
ative. Then j’s minmax payoff in the continuation game 
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is the present value of having m, in every period. This 
means that M, =m,/(1—8). Moreaver j’s expected 
minmax payoff in the continuation game starting in 
the next period is also M,, because the continuation 
game never changes. As a result, condition (3) becomes 
—(1—8)M, > B—(M,+M,), where Bis the maximum 
flow of future benefits But the fact that each player 
must get at least as much as its minmax payoff in equi- 
librium implies that the maximum flow of benefits must 
be at least as large as the sum of the minmax payoffs, i.e., 
B> M, + M,. Hence, (3) never holds in the unchang- 
ing strategic environment of an infinitely repeated 
game. 


CONCLUSION 


A common mechanism is at work across a very wide 
range of recent work in American, comparative, and 
international politics. In each case, a temporarily weak 
actor wants to induce its adversary to refrain from an 
inefficient use of power, e.g., launching a coup, starting 
a civil war, insulating its policy, or going to war. Because 
the use of power consumes resources, avoiding its use 
saves resources and means that there is enough for the 
weaker actor to buy its adversary off. But resource con- 
straints mean that the transfers needed to accomplish 
this will take several periods to complete. However, 
the strategic environment is shifting sufficiently rapidly 
that the temporarily weak actor is likely to become 
strong enough during this concession phase that it will 
renege on its promised transfers. This undermines the 
credibility of these transfers and leads to the inefficient 
use of power. In short, large, rapid shifts in relative 
bargaining power can lead to bargaining breakdowns 
even if there is complete information. 

This common mechanism provides a unifying per- 
spective on these analyses and a more general formula- 
tion of a fundamental strategic problem that can cause 
breakdowns and the inefficient use of power in very 
diverse substantive settings. Seeing the basic mecha- 
nism driving this inefficiency more clearly also poses a 
challenge for future work. The models discussed above 
and the more general condition (1) “black box” these 
shifts, e.g., the government is either strong or weak with 
probabilities 1— s and s in Fearon 2002. Opening up 
this black box and specifying the microfoundations for 
these changes is an important task for future work. 

It is, for example, relatively easy to see how the dis- 
tribution of power can shift quickly and dramatically in 
the context of legislative bodies and a two-party system. 
Electorally weak parties that happen to capture a ma- 
jority of seats acquire the significantly greater powers 
of the majority party. Insofar as these parties are likely 
to lose the next election and be in the minority for a sub- 
stantial period, there will be a large and rapid shift in the 
expected distribution of power. In addition to its being 
easier to see how the distribution of power can shift 
rapidly in this context, the direct empirical evidence in 
support of this mechanism is strongest in this substan- 
tive setting (e.g., de Figueiredo 2003, de Figueiredo and 
Vanden Bergh 2001). 
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It it is less clear how the distribution of power can 
change so rapidly in other contexts. In Powell’s model, 
for example, rapid shifts in the distribution of military 
power lead to war. But he suggests that the shifts in the 
distribution of military power due to differential rates 
of economic growth are empirically too small to ac- 
count for war through this mechanism, although he also 
recognizes that it is extremely hard to identify plausible 
parameter values in such spare models (1999, 133). 

The mechanism defined in condition (1) is a formal 
result. It shows that complete-information bargaining 
breaks down for the same fundamental reason in a 
number of seemingly unrelated games that have been 
used to study an important and diverse set of substan- 
tive issues. But even if a specific model satisfies condi- 
tion (1), whether this mechanism is really at work in 
actual cases depends on how well the model represents 
the cases. One way to advance our understanding of 
these cases and of the general mechanism is to begin to 
elaborate the microfoundations underlying these shifts 
in order to compare them to what appears to be hap- 


pening on the ground. 


APPENDIX 


This appendix proves Proposition 1 and Corollary 1. It then 
formalizes the relation between conditions (1) and (1’) and 
the inefficiency conditions derived in the equilibrium anal- 
yses in Acemoglu and Robinson 2000, 2001, Fearon 2002, 
de Figueiredo 2003, and Powell 1999.16 

The first step in proving Proposition 1 is to define the 
bounds B, and a, and the expected minmax payoff F| M, (n)]. 
Let & be the set of strategy profiles of the stochastic 
game T and V!(o|A,) be j’s continuation payoff if play 
follows ce £ and starts from state A, after history hx. 
Then define the expected flow of benefits to be the max- 
imum of the sum of the two players’ continuation values: 
By = max{V!(o | Ay) + V (0 | Ag): € £}, where this maxi- 
mum clearly depends only on A; and not on the history lead- 
ing up to that state. If, moreover, the game has transferable 
utility, then the sum of the bargainers’ utilities in each period 
is the size of that period’s pie. Bẹ in this case is simply the 
present value of the expected flow of pies. 

As for aj, let a,(s) be 1’s payoff in A; if the players take 
actions s. Take E to be the set of efficient profiles in r. 
Also, let hy be any history leading to state A, A be the 
set of all such histories, and 1(hy|o) be the probability of 
hy given that play follows o € £. Then, a 1s the minimum 
payoff i receives at A, along any efficient path. In symbols, 

= min{aj(e(Ay)):e €E, hy € Hy, and z(h |e) > 0}, where 
eth) denotes the actions the players take at A, following A, 
given that they are playing according to e. 

To specify K[M(n)], observe that i’s expected min- 
max payoff starting ın the next penod given what 
is known in the current period is simply [M (n)]= 
Lank, e(hy))M,(n). The proof follows directly from 
these definitions. 


Proof of Proposition! Player ) will have an incentive to 


deviate from the path P(e) any e€ € if there exists a state 
A, such that M,(4)> V (elh) for some Ay such that 


16 Space limitations make rt impossible to repeat these analyses here, 
so the present discussion presumes that readers can refer to them. 
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x(hą|e)>0. Condition (1) ensures that this is the case. 
Leto* be a strategy profile that maximizes the flow of benefits 
starting from A; € P(e), ie, o* satisfies B= V! (o* | A) + 
Vi(o*| Ay). Because o* maximizes V!(o | Ay) + V (0 | Ax), 
it follows that V/’(o*| Ay)+V'i(o*| Ad) > V! (elh) + 
V'(e| hi). This, along with V'(e | he) > a, + 3 24[M,(A,)] and 
condition (1), implies that 


M,(k) > By — [a +8 ELM (n)]] 
> V (e |h) + V (e | Ae) — [a + 8 IM (n)]] 
> Vi(e| hy). 


Hence, there will be no efficient equilibria if (1) holds for 
all ee €. An analogous argument shows that there are no 
efficient equilibria if (1’) holds. E 


To formalıze the separability condition needed in Corol- 
lary 1, let Cy, be the maximum of the sum of the actors’ 
payoffs in state Ay: Cy m max{a(s) +a,(s):5 € S}, where Sk 
is the set of action profiles in 4,.!’ Fis smply the maximum 
of the expected sum of the actors’ future payoffs given 
that the present state is A,: im max{$ 3 q(n|ko) 
[V (0 | An) + V (0 | A,)]:0 € E}. Clearly, Be < Cy + 8 Fy. The 
separabılity condition requires that this condition hold with 
equality, 1e., By = Ce + 3 Fy. This yields: 


Proof of Corollary 1. Define the upper bound on j’s 
payoff at A, along any efficient path p(e) to be äj m 
max{aj(e(hy)):¢€ E, hy € Hy, and x (h; | e) > 0}. Substituting 
for By in condition (1) then gives M, (k) > Cr +8 Fx — [a4 + 
ô E[M (n)]]. 

The proof is complete if C: — ai > a}. To see that this is 
80, let p(e*) be an efficient path that gives J its maximum dj 
in state A,. Then the maximum sum of the players’ payoffs 
in state A, is at least as large as what e* gives them whenever 
they are in Ay. In symbols, Cy > af (e*(Ag)) + ay (e*(he)) =2j + 
a,(e*(Ay)), given that x (h4 |e*)>0. But a; is the minimum 
that i obtains at A, along any efficient path. So, a,(e*(Ay)) > a 
as long as x(hą|e*)>0. Hence, Cy>aj +a, or G= 
æ > a. E 


Turning to the equilibrium analyses in Acemoglu and 
Robinson 2000, 2001, Fearon 2003, de Figueiredo 2003, and 
Powell 1999, conditions (1) and (1’) are based on minmax 
payoffs. Consequently, they might be much looser than or 
not very closely related to the equilibrium conditions in the 
preceding articles. Were this the case, it would indicate that 
the mechanism described in (1) and (1’) was not the source 
of the inefficiency in those examples. If, in contrast, there is 
little or no slack between the equilibrium conditions and (1) 
or (1’), then this mechanism 1s capturing the source of the 
inefficiency in those examples. 


17 Tn some cases a technicalrty must be overcome. In Fearon 2003 
and Powell 1999, for example, there are game-ending moves in some 
of the stage games, e g , the rebels win or one of the countnes goes to 
war. The payoffs to these moves mclude the payoffs obtained in the 
current period plus the flow payoffs from unmodeled future periods 
This future flow may be large compared to the players’ per-period 
payoffs from other actions in the stage game. Hence, maximizing the 
sum to the actors’ payoffs m this stage game does not correspond 
to maximuzing the sum of the actors’ “current” payoffs, which Cy is 
intended to represent To finesse this issue, cach stage game Ay with 
game-ending action profiles {5} a should be replaced with a stage 
game and a set of null games G), .. , Gr such that playing s4 in 
AL ts in a null state G,. Each player has only one move in G,, 
the state remains G, thereafter, and the payoff to playmg sg in A, 
and G, is the average of the payoff to playing x} in Ax. 
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The latter turns out to be the case. The slack between 
condition (1’) and the equilibrium conditions in Acemoglu 
and Robinson 2001 1s due solely to their assumption that 
taxation induces a deadweight loss. Were there no such loses, 
Acemoglu and Robinson’s “general results would not be al- 
tered” (2001, 941) and there would be no slack. Moreover, 
condition (1) is identical to the equilibrium condition needed 
to guarantee inefficiency in Fearon 2003 and, in the limit as 
the discount factor goes to one, in Powell 1999. The slack 
between (1) and the equilibrium condition in de Mgueiredo’s 
game is due to the nonlinearity of the Pareto frontier and 
disappears as this frontier becomes linear. 

The inefficiency in Acemoglu and Robinson 2001 arises 
when the poor are in power (ie., the regime is democratic) 
and times are bad. In these circumstances, the poor may or 
may not be able to buy off the rich through lower taxes and 
thereby prevent an inefficient coup. Whether or not the poor 
can prevent a coup depends on the cost of launching a coup 
as well as the values of other parameters. To simplify the 
analysis, Acemoglu and Robinson focus on Markov perfect 
equilibria and derive a condition sufficient to ensure that 
there will be costly coups in any Markov perfect equilibrium. 
However, this condition does not guarantee that there will 
always be coups in non-Markov equilibria, for the rich can 
punish the poor more severely in a non-Markov equilibrium if 
the poor renege on a promised level of taxation. This harsher 
punishment means that the poor can credibly commit to lower 
taxes in a non-Markov setting, and this makes it easier to 
sustain efficient equilibria. Hence, the first step in compar- 
ing condition (1’) to the sufficient equilibrium condition in 
Acemoglu and Robinson’s game is to extend their analysis 
to the non-Markov case. To ease the analysis, we also focus 
on the case in which there is no deadweight loss to taxation. 
(Acemoglu and Robinson [2001, 914] assume a deadweight 
logs to taxation as a matter of convenience in order to avoid 
corner solutions.) 

The first step in the analysis 13 to identify the worst equi- 
librium for each player, i.e., the equilibrium that gives each 
player its lowest equilibrium payoff. Because this is the worst 
equilibrium, an efficient allocation can be supported in equi- 
librium if and only if both players are deterred from deviating 
from this allocation by the threat to revert to this worst equi- 
librium. Hence, a condition sufficient to ensure inefficiency 
in the non-Markov case is that at least one of the players 
prefers to deviate from every efficient allocation even when 
threatened ın this way. 

To construct the worst equilibrium, observe first that the 
rich and poor actors in Acemoglu and Robinson’s game are 
really aggregates of identically behaved individuals. Let R 
and P denote these aggregate actors and r and p denote rich 
and poor individuals. Now consider the following strategies: 
The rich, R, set a tax rate of zero whenever they are in power 
and mount a coup whenever they are out of power and times 
are bad. The poor, P, set their optimal tax rate r™ whenever 
they are in power and launch a revolution whenever they are 
out of power and the times are bad. Absent deadweight losses, 
this optimal rate is 100%. (By assumption, the tax revenues 
are redistributed through transfers the size of which cannot 


18 This close relationship between the inefficiency condition based 
on minmax payoffs and the equilibrium conditions is actually not 
very surprising. In applied work, one often simphfies the analysis by 
constructing models in which the most severe punishments one actor 
can impose on another are part of a subgame perfect equilibrium, 
e.g. defecting in every round in a prisoner’s dilemma. When this 1s 
done, little will be lost in looking at minmax payoffs of continuation 
games rather than the incentive compatiblity constraints on equilib- 
num payoffs m continuation games. 
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depend on whether an individual is nch or poor. The poor, 
therefore, maximize their net income by taxing everything 
away and redistributing it evenly across the population.) 

These strategies constitute a subgame perfect equilibrium. 
It is also clear that this is the worst equilibrium for both play- 
ers. Call this worst equilibrium Z, and let Z’ (ø) for j = porr 
denote individual j’s payoff in the continuation game starting 
in state o given that Rand P play according to Z. 

Now consider a path along which P sets a tax rate of 
t(o |x), where o is the current economic state, i.e., whether 
times are normal or bad, and x 1s the history of economic 
states leading up to o. This path can be supported in equi- 
librium if and only if the following strategies are subgame 
perfect: P offers t(o | sr) and R does not mount a coup if the 
current state is ø and the history of previous states is x. If 
either player ever deviates from these actions in any period, 
P and Rstart playing according to Z. Let V/(o |) for j=r 
or p be j’s continuation payoff to playing according to these 
strategies starting in state o after history 7. 

Following Acemoglu and Robinson’s notation, take A, to 
be individual j’s capital stock. The fraction of the popu- 
lation that 1s poor is A, so the total capital stock, h, satisfies 
h=(1—A)h, +Ah,. Times are normal and bad with proba- 
bilities 1—s and s, respectively, and a unit of capital yields 
an income of one in normal times and an income of a<1 
ın bad times. Consequently, setting a tax rate of r in normal 
times yields a revenue of r[(1 — A)h, + AA,] and a net transfer 
to a poor person of A?(t,n)=t[(1—A)h, + AA] — thp = 
t(1—A)(h, —A,). The net transfers A(t, n), A?(r, b), and 
A' (t, b) are defined analogously where the balanced budget 
requirement implies that (1—A)A’(t,0) +AA?(t,0)=0 in 
state o. 

This implies that the poor’s payoff starting in a bad state 
following history x is 


V?(b| x) = ah, + A?(r(b| 7), b) 
+I- s)V” (nir) +sV?(b|x')), 
where x’ = {x, b} and £ 1s the common discount factor. The 
first two terms on the right side of the equation are p’s payoff 
in the current period and the third term is its discounted ex- 


pected continuation payoff Thus, the continuation value of 
the aggregate actor P is 


V?(b|x) = A[ah, + A?(r(b| 7), b)] 
+B[1—s)V?(n[x)+sV"(b[ x], 


where individual and aggregate payoffs are related by 
V?(o |x) mAV?(c |x) for state o. 

Because the nch and poor simply divide each period’s in- 
come as long as there is no coup, the aggregate income of the 
poor plus the aggregate income of the rich equals the present 
value of the expected flow of income: 


BI —s)h+ sah] 

1-8 i 
Combining the previous expressions and using h = (1 — à )h, 
+Ah, and (1—A)A’(r, b) +AA?(t, b) =0 give 

V¥(b| mr) = (1—A)ah, +(1—1)A" (t (x)) 


p1 -—s+sa)h 
+ POO" Sia- s)V" a1!) 
+sV?(b| x’). 
The net transfers to the rich are negative if the 
poor set a positive tax rate, so A’(t(b|7)) is bounded 


VP (b| x) + VR(b| x) =ah+ 
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above by zero. Incentive compatibility at the aggregate 
level (which is equivalent to the individual level) also 
means that V?(n|x’) > Z?(n) and V?(b| x’) > Z?(b), where 
Z?(o) mi 2Z?(c). Hence, 


VMbIn) < (~A)ah, + PEDA 


— BI(1 - s) Z?” (n) +s Z”(b)]. 


But incentive compatibility also requires the aggregate payoff 
of the rich to be at least as large as their  payott to deviating 
and getting Z instead, i.e., V8(b| x) > Z*(b). So a condition 
sufficient to ensure that there are no efficient equilibria is 


BA -s+sa)h 
~ 1-B. , 
— B[(1—s)Z?(n) +s Z” (b). A1) 


This is the condition needed to ensure inefficiency based 
on an equilibrium analysis of the game. It is also identical 
to condition (1’). The left side of (1’) is R’s minmax payoff 
in conditionally undominated strategies. This excludes the 
possibility that either player would try to depose the other 
in normal times, as these are conditionally dominated. P, 
therefore, minmaxes R (ın conditionally undominated strate- 
gies) by setting a tax rate of 100% whenever the poor are in 
power and launching a revolution whenever the poor are out 
of power and times are bad. This gives a rich individual r.and 
the mch R minmax payoffs of Z’ (b) and Z¥(b) starting from 
bad times. It also means that the left side of (1’) is Z¥(b). 

As for the right side of (1’), the total flow of benefits start- 
ing from bad times is just B =ah +$[(1 — s}h + sah]/(1 — B). 
The worst that the poor can do ın bad tımes along an effi- 
cient path is what they would receive if they paid no taxes 
and therefore received no transfers, i.e., ìAahp. As for their 
expected minmax payoff starting in the next period, R min- 
maxes P (in conditionally undominated strategies) by setting 
a tax rate of zero whenever the rich are in power and launch- 
ing a coup whenever the rich are out of power and times are 
bad. This means that P’s expected mınmax payoff before the 
state is revealed is (1 — s) Z7 (n) +s Z? (b). Condition (1’) then 


Z¥(b) > (1—A)ah, + 


gives 
Z¥(b) > ah + ae dah, 
— B[(1—s)Z?(m) +5Z°(b)], 
which reduces to (A1). 


Turning to the relation between condition (1) and Fearon’s 
(2003) equilibrium conditions, his Proposition 3 establishes 
equilibrium conditions that ensure that there exists no effi- 
cient subgame perfect equilibrium. Let V2 be the govern- 
ment’s expected continuation payoff going into a peace pe- 
riod given that the government never offers anything and 
the rebels fight at every opportunity. Sumularly, let Vg be 
the rebels’ continuation payoff but starting from a state in 
which the government 1s weak. (See Fearon 2003 for a more 
detailed specification of Vf and V’. The notation used here 
is consistent with his.) Then there are no efficient equilibria 
in Fearon’s model if and only if VY + 8VF >1/(1—8). 

This requirement is identical to condition 1 above. The 
rebels’ minmax payoff starting in a period in which the gov- 
ernment is weak is what they obtain by fighting in every 
period and is V¥’. ing this is the notation used in 
condition (1) gives Mp(weak) = V;’. Since the pie to be di- 
vided in each period is one, R=1/(1—4). The minimum 
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R can get in any period in which it is weak along any ef- 
fiaent path is zero. Finally, if the government and rebels 
reach an efficient allocation when the government is weak, 
the next period will be what Fearon calls a peace period and 
the government’s expected minmax payoff entering that pe- 
riod is VÈ. This leaves E,(Mc) = V2. Condition 1 then gives 
Ve > /a —8)—0-— VË, which is identical to the equilib- 
num condition. 

Inefficiency condition (1) and the equilibrium condition 
needed to ensure inefficiency in de Figueiredo 2003 are iden- 
tical if the Pareto frontier is linear, i.e., 1f 6 = 4. To establish 
this, take 8 = 4 and assume without loss of generality that A 
is the weaker party (i.¢., y < $). To specify the equilibrium 
conditions leading to inefficient insulation, recall that a party 
only has the option of insulating the first time it comes to 
power and consider the subgame in which B comes to power 
in the first round and, therefore, before A This subgame 15 
outlined in Figure 2. 

Evaluating Æs decisions, suppose that A is deciding 
whether to insulate when it first comes to power, ie., 
at (1i) in Figure 2.% If A insulates, it obtains œ in the 
current period plus an expected continuation payoff of 
8[ay + $a(1—y)]/(1 — 8). If A does not insulate, B’s unique 
best response is to overturn A’s policies whenever possible. 
(With £ =}, there are no gains to cooperating on not over- 
turning each other’s policies.) This leaves A with a payoff of 
one whenever it is in power and zero whenever it 1s out of 
power. This yields 1 + dy /(1 — 8). A, therefore, weakly prefers 
to insulate at (i1) if 


of 14 EA] 


1-6 1-3 (2) 





or, equivalently, if a>a,4;ym(1—801—y)]/[1-80—-y) 
(1 — f)]. Similarly, A weakly prefers to insulate at (ii) of 
a>a47=1-8(1—y). Clearly, a4) 1 < XAJ N- 

Now consider B’s decision at (i) if A overturns B’s policy 
whenever possible. Assume further that a4);<a<ay,)y 
which means that A only insulates if B did. Then B weakly 
prefers to insulate uf æ > ap = [1 — 8(1 — yJ — 8y )]/[1 — 8 + 
18(1 — y)]. Algebra then shows that a, > a4; x. B, therefore, 


- 





19 de Figueredo’s equhbrum analysis assumes that A either 1n- 
sulates or not regardless of whether B has ever been in office be- 
fore But A knows whether B has held power before and, therefore, 
whether B will have the option of insulating when it next comes to 
power. Accordingly, Acan condition its decision on this information 
This affects the cut points that define the equilibrium conditions at 
which a party insulates but does not affect de Figueiredo’s general 
conclusions. 
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prefers not to insulate if æ € (a417, «a y). This ımplies that all 
the equilibria in subgame (i) are inefficient only 1f æ > a4; x- 

A similar analysis of the subgame in which A comes to 
power in the first round demonstrates that there will be in- 
sulation here only if a > ag)n. The fact that y < 4 then gives 
aan <an y. Hence, the equilibrium condition needed to en- 
sure inefficiency is that æ > aay or, equivalently, that A2 
holds strictly. 

To compare this to (1), consider node (ii) where A has 
come to power for the first time and B did not insulate when 
it had the chance. Let M, denote A’s minmax payoff starting 
from this state. As for the upper bound on 4’s continuation 
payoff defined by the right side of (1), observe that, because 
B’s policy is in place when Acomes to power at (ii), the max- 
imum of the sum of the two players’ continuation payoffs is 
one in each period. This gives By =1/(1 — 8). Moreover, the 
minimum B can get at (ii) if A plays efficiently (Le., does not 
insulate) is zero. And B’s expected minmax payoff starting 
in the next period is what it obtains if each player always 
overturns the other’s policy: (1— y)/(1— 8). Condition (1) 
then becomes 


1 l-y 
Ma > aT fa IT 
by 
ipn. A3 
> aos (A3) 


Proposition 1 shows that there are no efficient equilibria 
whenever A3 holds. Consequently, the only difference be- 
tween A2 and A3 can occur if there are no efficient equilibria 
(aan <a) and if A3 still does not hold. But a > a4) 7 and A2 
imply that A's best reply to being minmaxed is to insulate. So, 
Ma=a[l + 8[y + $(1— y)]/(1 — 8)]]. Thus, A3 is identical to 
A2 if f =}, and the slack between these conditions is due to 
the nonlinearity of the Pareto frontier that arises if > 4. 

As shown above, bargaining breaks down in war in Powell’s 
model if 


l-p,-d 


1 Bt+A-r 
1-8 i 


>1+6 | I3 I3 

This inequality 1s equivalent to condition (1): The left side is 
the declining state’s minmax payoff which it can obtain by 
attacking. Since the size of the pie is one in every period, 
B,=1/(1 — 8). The rising state’s minimum payoff ın any pe- 
riod along any efficient path 1s zero. And the rising state’s 
minmax payoff in the next period is what it can obtain by 
fighting at that time: (p, + A—r)/(1—8). 

Turning to the equilibrium condition, there is a status quo 
distribution of territory q in Powell’s model, 1.e., the rising 
state controls q € [0, 1] of the territory at the outset of the 
game. This means that the declining state obtains a payoff of 
1—4 if does not fight in the last period, say t, prior to the 
msing state’s having a credible threat. Thereafter the declin- 
ing state keeps the rising state indifferent between fighting 
and accepting the rising state’s offer in each period. Thus, 
the declining state’s continuation payoff if it does not fight 
at time t is 1 — q + 8[1/(1 +8) — (p + A —r)/(1 — 8)], where 
(p, + A —r)/(1 — 8) is the rising state’s payoff to fighting ın 
the next period when ıts probability of prevailing has in- 
creased to p, + A. This implies that the equilibrium condition 


2 If p > 4, there are gams from cooperation and equilibria in which 
the parties do not always overturn each others’ policies. This mcreases 
As potential payoff if Adoes not insulate at (u), and this raises a4) N 
The maximum of the sum of the two players’ contmuaton payoffs 
also increases to Bk =28/(1 — 5). This makes comparing (1) and the 
equilibrium conditions with £ > $ very complicated and tedious 
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that yields fighting is 
1+p—d 1 Pt+A-r 
1-3 1-448] 1-3 


The slack between this equilibrium condition and (1) is due 
solely to payoffs in the initial period and this difference goes 
to zero as the discount factor goes to one. 
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retrenchment, the dominant thrust of which is that remarkably few welfare states have experienced 


O ver the last decade, students of the welfare state have produced an impressive body of research on 


shifts. This article questions this now-conventional wisdom by reconsidering the 
post-1970s trajectory of the Amencan welfare state, long considered the quintessential case of social policy 
stability. I demonstrate that although most programs have indeed resisted retrenchment, U.S. social policy 
has also offered increasingly incomplete risk protection in an era of dramatic social change. Although 
some of this disjuncture is inadvertent—an unintended consequence of the very political stickiness that 
has stymied retrenchment—I argue that the declining scope of risk protection also reflects deliberate and 
theoretically explicable strategies of reform adopted by welfare state opponents in the face of popular and 
change-resistant policies, a finding that has significant implications for the study of institutional change 


more broadly. 


as the welfare state continued to provide the 

inclusive social protection that defined its goals 

and operations in the immediate decades after 
World War II? According to much received scholarly 
wisdom, the answer is yes. As Paul Pierson writes in 
one of the earliest and most influential assessments, 
“Economic, political, and social pressures have fos- 
tered an image of welfare states under siege. Yet if one 
turns from abstract discussions of social transformation 
to an examination of actual policy, it becomes diffi- 
cult to sustain the proposition that these strains have 
generated fundamental shifts” (Pierson 1996, 173). A 
wave of research, relying on both large-scale statistical 
modeling and detailed historical analysis, has largely 
ratified this evaluation (see, e.g., Bonoli, George, and 
Taylor-Gooby 2000; Esping-Andersen 1999; Huber and 
Stephens 2001; Pierson 1994, 2001; and Weaver 1998). 
In this now-conventional view, welfare states are under 
strain, cuts have occurred, but social policy frameworks 
remain secure, anchored by their enduring popular- 
ity, their powerful constituencies, and their centrality 
within the postwar order. 

This article challenges this conventional wisdom and 
presents an alternative interpretation based on a com- 
paratively informed historical analysis of the post-1970s 
trajectory of the American welfare state—long consid- 
ered the quintessential example of welfare state stabil- 
ity in the face of fiscal and political challenge (see, e.g., 
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Huber and Stephens 2001 and Pierson 1996). This al- 
ternative account rests not simply on a reconsideration 
of the evidence. It rests, too, on a new perspective on 
social policy reform that broadens the range of policies 
and forms of change under consideration. In enlarging 
and shifting the focus of analysis—from formal rules to 
their social consequences, from the welfare state nar- 
rowly defined to the broader public-private economy 
of welfare, and from the highly visible politics of large- 
scale reform to the subterranean political processes 
that shape ground-level policy effects—this concep- 
tual framework illuminates and clarifies the sometimes- 
covert strategies that political actors adopt when trying 
to transform embedded policy commitments. In short, 
this article not only presents a new interpretation and 
explanation of the specific trajectory of the American 
welfare state, but also offers a new conceptual lens that 
lays bare the “hidden” means by which policies can 
be changed by actors employing strategies of stealth, 
obstruction, and indirection. 

Above all, however, the evidence and arguments pre- 
sented in this article give cause for questioning the 
conventional story about welfare state resilience in 
the United States. Although most U.S. public social 
programs have indeed resisted radical retrenchment, 
the American social welfare framework has also, in 
crucial areas, offered increasingly incomplete protec- 
tion against the key social risks that Americans con- 
front. One reason for this, as suggested by the 1996 
overhaul of the Aid to Families with Dependent Chil- 
dren program (commonly known as “welfare”), is that 
some policies have experienced major formal revision. 
But this, I contend, is only relatively small part of the 
larger story. More crucial are two less visible sources of 
change, both of which have occurred without significant 
formal alterations in policy. First, in policy areas that 
rely substantially on public-private or intergovernmen- 
tal cooperation, the shifting aims of benefit sponsors 
and administrators has transformed the ground-level 
operation of formally stable policies, at times quite radi- 
cally. Second, and perhaps even more important, recent 
decades have witnessed an accelerating process that I 
call “risk privatization,” in which stable social policies 
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have come to cover a declining portion of the salient 
risks faced by citizens, As a result of this process, many 
of the most potent threats to income are increasingly 
faced by families and individuals on their own, rather 
than by collective intermediaries. 

For those familiar with comparative research on the 
welfare state, this last point will resonate with the com- 
mon observation that advanced industrial societies are 
marked by a growing mismatch between traditional so- 
cial policies and the new social risks that citizens face. 
As Gøsta Esping-Andersen (1999, 5), the dean of wel- 
fare state scholars, puts it, “The real ‘crisis’ of contem- 
porary welfare regimes lies in the disjuncture between 
the existing institutional configuration and exogenous 
change. Contemporary welfare states . . . have their ori- 
gins in, and mirror, a society that no longer obtains.” 
And yet, contrary to the normal framing of this disjunc- 
ture as a result of exogenous shocks to stable systems, 
I argue that many of the most glaring mismatches that 
have arisen in the United States should be seen instead 
as a direct outgrowth of political struggle—a manifes- 
tation of an important but often hidden “second face” 
(Bachrach and Baratz 1962) of welfare state debate. No 
less important, I emphasize that crucial policy changes 
have in fact taken place over the past three decades, 
despite general stability in formal policies. Their key 
source, however, is not large-scale legislative reform, 
but a set of decentralized and semiautonomous pro- 
cesses of alteration within existing policy bounds. Thus, 
in focusing on active changes in policy rules, analysts 
have missed fundamental ways in which the welfare 
state’s role and effects are changing. 

The implications of this argument therefore extend 
well beyond social policy reform, intersecting with in- 
creasingly prominent questions in institutional theory 
about the causes and character of institutional change 
(see, e.g., Pierson 2004 and Thelen 2003). The central 
implication is that there is not one single pattern of 
institutional change, whether it be the “big bangs” of 
sudden transformation or the “silent revolutions” of 
incremental adjustment. Rather, institutional change 
takes multiple forms, and strategies for institutional 
change systematically differ according to the character 
of institutions and the political settings in which they 
are situated. By exploring these sources of variation, 
I show that actors who wish to change popular and 
embedded institutions in political environments that 
militate against authoritative reform may find it pru- 
dent not to attack such institutions directly. Instead, 
they may seek to shift those institutions’ ground-level 
operation, prevent their adaptation to shifting external 
circumstances, or build new institutions on top of them. 
These are strategies for change that are little studied 
and even less well understood. They are also strategies, 
I shall demonstrate, that critics of the welfare state— 
rebuffed in their direct assaults on social programs— 
have increasingly attempted to pursue, sometimes with 
considerable success. 

The choice of the United States as the focus of these 
claims may appear unconventional. Analysts who dis- 
agree on much typically view the American welfare 
state as lying on a wholly different plane from other 
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regimes, or at least on the outer frontier of the “lib- 
eral” world of meager, market-oriented welfare states 
(Esping-Andersen 1990). Nor can it be denied that the 
recent American experience is distinctive in a number 
of crucial respects (Smeeding 2002). At the same time, 
however, the United States, with its multiple institu- 
tional “veto players” (Tsebelis 1995), has long been 
treated as the quintessential example of welfare state 
resilience, indeed, as the principal validating case of the 
leading approach to retrenchment, the “new politics of 
the welfare state” perspective associated with the work 
of Pierson (1994, 1996). If, as I argue, the surface stabil- 
ity of U.S. social programs has in fact masked major de- 
clines in collective protection, then a strong case can be 
made that prevailing analytic perspectives have over- 
looked critical dimensions of policy change—abroad as 
well as in the United States. Moreover, certain unusual 
aspects of the U.S. framework are becoming increas- 
ingly common elsewhere, making the American expe- 
rience a guide to the long-term effects of these nascent 
but powerful trends! 


THE ANALYSIS OF RETRENCHMENT 


The beginning of the recent wave of interest in re- 
trenchment can be conveniently dated to Pierson’s 
(1994) pathbreaking book on welfare state reform in 
Britain and the United States, Dismantling the Welfare 
State? A chief reason for the influence of the book is 
its precision about the dependent variable. “Retrench- 
ment,” notes Pierson, “is one of those cases in which 
identifying what is to be explained is almost as difficult 
as formulating persuasive explanations for it.” Spend- 
ing cuts alone do not exhaust the definition; analysts 
need also to consider structural reforms that move the 
welfare state toward a more “residual” role, in which 
government does little to shift the distribution of in- 
come and services in a progressive direction. Retrench- 
ment thus describes “policy changes that either cut 
social expenditure, restructure welfare state programs 
to conform more closely to the residual welfare state 
model, or alter the political environment in ways that 
enhance the probability of such outcomes in the fu- 
ture” (Pierson 1994, 17). The last of these—long-term 
changes in the political environment—Pierson labels 

“systemic retrenchment,” to distinguish it from imme- 
diate changes in programs, which he terms “program- 
matic retrenchment.”* 

Having defined retrenchment, Pierson goes on to 
evaluate the success of British and U.S. conservatives 
in pursuing it. He concludes that “the fundamental 
structure of social policy remains comparatively sta- 
ble” (Pierson 1994, 182). Expanding the welfare state 


1 On the increasing role of private benefits in rich nations, see Adema 
and Hinerhand 1998. 

2 In some respects, then, this article ıs as an attempt to revimt 
Pierson’s arguments about “systemic retrenchment.” For the most 
part, however, the changes I describe fall between systemic and 
programmatic retrenchment, involving the creation of new policies, 
internal changes that occur without formal revision, and erosion of 
programs in the face of external change. 
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involved imposing diffuse costs in return for concen- 
trated benefits. Cutting social programs, by contrast, 
entails imposing concentrated costs in return for dif- 
fuse gains—a far more difficult political prospect. More 
important, social programs are popular, and they have 
created powerful constituencies well positioned to fight 
retrenchment. In short, the chances for retrenchment 
are—to use a phrase Pierson deploys in more recent 
writings—highly “path dependent” (Pierson 2000). 
Past social policy choices create strong vested interests 
and expectations, which are extremely difficult to undo 
even in the present era. 

Pierson’s argument is logical, and it carries a straight- 
forward prescription—namely, that analysts should 
study efforts to introduce residualizing reforms into 
existing programs. A large body of writing has followed 
this prescription and, in doing so, made major advances 
in our understanding of welfare state reform. Indeed, 
even predominantly quantitative work now routinely 
concedes that analysis of retrenchment requires care- 
ful probing of political decision-making to verify that 
spending trends actually reflect collective choices that 
alter public programs (e.g., Huber and Stephens 2001). 

For all its virtues, however, Pierson’s approach also 
has real limits? The first and simplest is its empha- 
sis on authoritative changes in existing social welfare 
programs. Although this may seem an obvious focus— 
after all, changing formal policies is a central means of 
changing the distribution of social benefits—it excludes 
from consideration a host of “subterranean” (Hacker 
2002, 43) means of policy adjustment that can occur 
without large-scale policy change: from “bureaucratic 
disentitlement” (Lipsky 1984) caused by the decisions 
of front-line administrators to decentralized cutbacks 
in social welfare benefits caused by the actions of non- 
governmental benefit sponsors and providers. Perhaps 
more important, in emphasizing affirmative decisions, 
Pierson also excludes from consideration a wide range 
of agenda-setting and blocking activities that may well 
be quite crucial in shaping the welfare state’s long-term 
evolution. Like the pluralists of the 1950s and 1960s, 
retrenchment scholars have assessed power mainly by 
tracing observable decisions. The influential critique 
made against pluralism thus carries weight here too: 
By looking only at affirmative choices on predefined 
issues, retrenchment analyses tend to downplay the im- 
portant ways in which actors may shape and restrict the 
agenda of debate and prevent some kinds of collective 
decisions altogether. 

Most critical in this regard are deliberate attempts 
to prevent the updating of policies to reflect changing 
social circumstance. In the struggle over health care 
reform in the early 1990s, for example, advocates of 
expanded government responsibility embarked on an 
ambitious campaign to extend health coverage to coun- 
teract the declining reach of private health benefits 
(Hacker 1997; Skocpol 1996). Their efforts, in turn, fell 
victim to a concerted counter-mobilization among af- 





3 To be fair to Pierson, he has acknowledged some of these limits 
(Pierson 2001, 2002) and conceded that he underestimated the extent 
of retrenchment in Britain durmg the 1980s 


fected interests and political conservatives, who denied 
that government should step in to deal with the increas- 
ing hardships caused by skyrocketing costs and dwin- 
dling protections. Whether these efforts were necessary 
or unnecessary, poorly executed or simply doomed to 
fail, their defeat had enormous implications for the 
scope and character of U.S. social policy, as well as for 
judgments about the relative influence of pro- and anti- 
welfare-state forces in American politics. Yet from the 
standpoint of the conventional approach to retrench- 
ment, the failure of health care reform is a nonevent. 

This example only hints at the broad range of political 
processes and policy outcomes occluded by a single- 
minded focus on formal policy change. Historically, 
welfare states have been directed not just toward en- 
suring social protection against medical costs, but also 
toward providing security against a number of major 
life risks: unemployment, death of a spouse, retirement, 
disability, childbirth, poverty. Yet the incidence and ex- 
tent of many of these risks have changed dramatically 
over the past three decades, leading to potentially sig- 
nificant transformations in the consequences of social 
policy interventions, even without formal changes in 
public social programs. To be sure, we should not as- 
sume that the welfare state should naturally adjust to 
deal with changing risk profiles, or that gaps between 
risks and benefits are always deliberate—as they clearly 
are, for example, in the case of active attempts to pre- 
vent policies from being updated to achieve their histor- 
ical goals in the face of demands to upgrade them And 
yet, we cannot ignore these disjunctures either. Welfare 
states, after all, constitute institutionalized aims as well 
as an arsenal of policy means for achieving them, and 
their development over time must be assessed in that 
dual light. 

In fact, even within the relatively narrow concep- 
tion of the welfare state that Pierson adopts, there 
are important policies he largely overlooks. Notable 
here are two overlapping policy realms central to the 
U.S. social policy framework: tax expenditures with so- 
cial welfare purposes and regulatory and tax policies 
governing privately provided social welfare benefits 
(Hacker 2002; Howard 1997).* Recent OECD research 
shows that the United States has an extremely large 
employment-based private benefit system that is ex- 
tensively buttressed and shaped by government pol- 
icy (Adema 1999; Adema and Einerhand 1998; Adema 
et al. 1996): Controlling for tax burdens, for example, 
such “publicly subsidized and regulated private bene- 
fits” (Hacker 2002, 11) constituted more than a third 
of U.S. social spending in 1995. Furthermore, the distri- 
bution and character of private benefits have changed 
dramatically in recent decades, with rates of coverage 
plummeting among lower-income workers and benefit 
plans providing increasingly insecure income guaran- 
tees. Leaving policies that govern private social benefits 
out of the analysis entirely, as nearly all retrenchment 


4 These policy areas correspond nicely with Richard Titmuss’s (1976) 
categories of “fiscal welfare” and “occupational welfare ” For an ar- 
gument along smilar lines about the distinctive regulatory basis of 
Australia’s welfare state, see Castles and Mitchell 1993. 
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studies do, thus misses a critically important dimension 
of social policy change, particularly in the United States. 


EVERYDAY FORMS OF RETRENCHMENT: 
DRIFT, CONVERSION, AND LAYERING 


The changes that have taken place within the world of 
private benefits are an example of what I term “drift”— 
changes in the operation or effect of policies that occur 
without significant changes in those policies’ structure.’ 
The major cause of drift in the social welfare field is a 
shift in the social context of policies, such as the rise of 
new or newly intensified social risks with which existing 
programs are poorly equipped to grapple. The hallmark 
of change of this sort is that it occurs largely outside 
the immediate control of policymakers, thus appearing 
natural or inadvertent. The question for policymakers 
becomes whether and how to respond to the growing 
gap between the original aims of a policy and the new 
realities that shifting social conditions have fostered. 

Esping-Andersen (1999) and others who discuss this 
type of change imply that it is largely an apolitical 
process. To the extent that arguments in this vein con- 
cern the politics of reform, their ambition is limited to 
explaining welfare state responses to the disjunction 
between risks and benefits once that disjunction has 
become apparent. Yet the emergence of risk—benefit 
mismatches should itself be seen as a process that is 
highly mediated by politics. In an environment of new 
or worsening social risks, opponents of expanded state 
responsibility do not have to enact major policy reforms 
to move policy toward their favored ends. Merely by 
blocking compensatory interventions designed to ame- 
liorate intensified risks, they can gradually transform 
the orientation of programs. Of course, social policy 
drift may sometimes be wholly inadvertent. But much 
of it is quite clearly mediated by politics, a result not 
of failures of foresight or perception, but of deliberate 
efforts by political actors to prevent the recalibration 
of social programs. 

An example from the post-1970s American experi- 
ence will help to clarify the point. In 1974, Congress 
passed the Employee Retirement Income Security Act 
(ERISA) to regulate employment-based fringe ben- 
efits, especially pensions. Virtually unnoticed at the 
time was a seemingly minor clause that exempted from 
state-government regulation all health plans directly fi- 
nanced by employers (commonly called “self-insured” 
plans). Prior to ERISA, states had sole authority to reg- 
ulate private health insurance, and most health plans 
were independently run by insurers and covered mul- 
tiple workplaces, pooling medical risks across many 
firms. Yet in the wake of the sweeping law, as the 
states became by default the primary locus of health 
coverage expansions, ERISA’s “preemption” of state 
regulation increasingly thwarted efforts to stem the ris- 
ing tide of medically uninsured Americans (Gottschalk 
2000). Seeing an opportunity to escape regulation and 
limit their sharing of risk with other firms, corpora- 
tions rushed to set up self-insured plans. And, crucially, 


5 See the discussion of “utility drift” in Rae 1975. 
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once they did so, they needed only to prevent revision 
of ERISA’s preemption clause to hold new govern- 
ment interventions at bay—an aim they relatively easily 
achieved, given their lobbying strength, the complex- 
ity of the issue, and the status-quo bias of American 
political institutions. The ability of employers and their 
allies to block a government response to the continued 
decline of risk-pooling in American health insurance is 
thus a textbook example of politically mediated policy 
drift. 

Drift is not, however, the only means by which poli- 
cies may change without formal revision. In addition, 
what Kathleen Thelen (2003) terms “conversion” may 
also cause ground-level change. According to The- 
len, conversion occurs when “existing institutions are 
redirected to new purposes, driving changes in the 
role they perform and/or the functions they serve.” 
Although Thelen does not put the issue this way, adap- 
tation of policies through conversion reflects the reality 
that most institutions or policies allow actors working 
within their constraints to pursue multiple ends. This is 
one reason why institutions are not simply, as William 
Riker (1980) has put it, “congealed tastes” for favored 
states of the world—identical, at root, to any other col- 
lective choice. Instead of single-use tools, institutions 
are usually versatile multitaskers (Schickler 2001), and 
this versatility is itself a crucial variable shaping the 
strategies of actors who wish to change them. Although 
mutability of this sort is particularly characteristic of 
political institutions, it is also true of many large-scale 
public policies—which, as institutional frameworks for 
the achievement of complex ends, frequently grant sub- 
stantial flexibility to those carrying out their mandates. 

Consider for a moment a highly simplified model of 
the options open to political actors who wish to change 
an existing policy. In the starkest calculation, they must 
decide whether to “work within” this extant policy 
framework to achieve their ends or “work outside” it by 
revising or eliminating it. Seen this way, it immediately 
becomes clear that two questions loom large. First, how 
easily can these actors achieve their aims through the 
existing framework? And, second, how costly would it 
be to replace it with a policy more closely tailored to 
the ends they desire? If the answer to the first question 
is “very easily,” then the actors may pass up challeng- 
ing even a policy that would be relatively costless to 
change. If the answer to the second question is “very 
costly,” then they may try to work within even a policy 
framework that is heavily biased against the ends they 
seek. 

The place to begin, then, is to distinguish between 
“internal” policy changes that occur without formal re- 
vision, on the one hand, and formal policy changes, on 
the other. The ability to alter a policy internally is influ- 
enced primarily by a policy’s specific characteristics: its 
structure, its goals, its distinctive “feedback” (Pierson 
1993) effects. Some policies, for example, have clear and 
consistent goals; others do not. Some have procedures 


6 Large-scale policies are not usually treated as institutions. Yet, as 
relatrvely enduring sets of rules that shape and constram behavior, 
they are in fact consistent with most definitions 
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that are clearly specified and understood; others do not. 
Some give central leaders strong tools for controlling 
front-line agents; others do not. At one extreme, then, 
are policies whose dictates are unambiguous and whose 
front-line agents have little discretion. On the other 
are policies whose rules are opaque and contested, and 
whose implementation by front-line agents is highly 
variable. In general, the conversion of a policy should 
be easier when it delegates administration or lacks clear 
overarching rules or aims, as in decentralized federal- 
state programs or subsidy arrangements that shape vol- 
untary private benefits. 

In the realm of social policy, public retirement pro- 
grams provide perhaps the best example of the first pole 
of the continuum, while tax breaks for voluntarily pro- 
vided workplace benefits exemplify the second (Hacker 
2002; Howard 1997). Traditional retirement programs 
base benefits on minutely specified formulae that ac- 
count for nearly every aspect of workers’ careers and 
earnings: Once in place, public pensions virtually run 
themselves. In contrast, tax breaks for voluntary work- 
place benefits usually allow employers quite wide dis- 
cretion in the structure and level of benefits, who they 
are (and are not) offered to, and, ultimately, whether 
they are offered at all. Indeed, within the typically loose 
constraints of the tax law, employers are free to use 
pensions and health benefits—or to not use them—for 
whatever goals they please: employee goodwill, human 
capital formation, asset accumulation, union thwarting, 
and a host of other ends (Hacker 2002). 

Nearly all retrenchment studies, however, restrict 
their scope to policies with explicit and elaborate rules 
governing eligibility and benefits—such as pensions, 
unemployment insurance, and sick pay. These are poli- 
cies for which it makes sense to begin by focusing nar- 
rowly on policy rules and attempts to change them. And 
yet there are many key realms of social policy in which 
the link between policies and effects is much weaker. 
To take an example just mentioned, regulatory and tax 
policies governing private benefits leave virtually un- 
fettered discretion to employers, allowing companies to 
change what they do within these guidelines fundamen- 
tally. This is an extreme but not unique example: Many 
social policies divide authority between units of govern- 
ment or between government and private actors, such 
as providers, unions, and employers. And even pro- 
grams run entirely by public organizations may allow 
significant “street-level bureaucracy” (Lipsky 1980), 
making problematic the assumption that what a policy 
dictates is what is actually done. Moreover, such decen- 
tralized arrangements are, it appears, becoming more 
prevalent (Clayton and Pontusson 1998; Gilbert 2002; 
Rein and Wandensjö 1997). If this is so, it may become 
increasingly difficult to judge policy effects simply by 
reading statute books or examining disputes over policy 
rules. We will need to look at what really happens on 
the ground. 

The architecture of a policy, however, is not all that 
matters. A less studied but no less important force shap- 
ing the internal mutability of policies is the degree to 
which a policy gives rise to self-reinforcing “policy feed- 
backs” that cement in place stable constituencies, op- 


erating procedures, and definitions of mission (Mettler 
and Soss n.d.; Pierson 1993). Although research on 
the feedback effects of policies is still in its youth, it 
is clear that social policies do differ markedly in the 
extent to which they give rise to politically efficacious 
support coalitions. Social Security, for example, pro- 
motes widespread mobilization among the aged, who 
are well poised to fight cuts (Campbell 2003). Cash 
assistance for the poor, by contrast, gives rise to an 
extremely weak, fragmented, and politically demobi- 
lized constituency, which was unable to present an ef- 
fective and united front against the 1996 welfare re- 
form law (Soss 2000; Weaver 2000). In general, policies 
are more durable if they create or encourage the cre- 
ation of large-scale organizations with substantial setup 
costs, directly or indirectly benefit sizable organized 
groups or constituencies, and embody long-lived com- 
mitments upon which beneficiaries and those around 
them premise crucial life and organizational decisions 
(Hacker 2002, 55). 

In contrast to policy conversion, which hinges prin- 
cipally on policy-specific factors, the ability to formally 
alter a policy is mainly determined by the basic decision 
rules and partisan balance that characterize a political 
system. As institutionalists have long argued, opportu- 
nities for policy change are systematically shaped by the 
distribution of decision-makers’ preferences regarding 
the status quo and alternatives to it, as well as by key in- 
stitutional features of political systems, particularly the 
degree to which procedural rules create a status-quo 
bias (Immergut 1992; Krehbiel 1998; Tsebelis 1995). 
According to George Tsebelis’s (1995) “veto players” 
framework, for example, policy stability increases when 
more actors or decision-making bodies must assent to 
change, when the ideological distance between them is 
greater, and when they are more internally cohesive. 
All this suggests that the American political context of 
the 1980s and 1990s—with bicameral and presidential 
divisions, frequent periods of split party control, and 
increasingly polarized parties—was particularly inhos- 
pitable to large-scale legislative change. 

In sum, although the prospects for internal policy 
change are shaped by a policy’s specific characteristics, 
formal policy change depends principally on whether 
the basic political structure and partisan context privi- 
leges the status quo. When it does, pragmatic advocates 
of change may find it more attractive to adapt existing 
policies to their ends than to wage a frontal assault. 
For this reason, political settings that militate against 
authoritative change encourage reformers to seek the 
conversion or erosion of existing policies. In these con- 
texts, not only do reformers find it difficult to establish 
new policies or replace existing policies, but they are 
also better able to block efforts to close gaps between 
a policy’s original aims and its actual effects. 

Figure 1 sums up the argument. As the bottom-right 
quadrant indicates, when a policy is both easily con- 
vertible and situated in a change-conducive political- 
institutional setting, it is highly vulnerable to formal 
revision, whether through reform, replacement, or 
elimination. This is the type of change with which vir- 
tually all institutional and choice-theoretic models of 
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policy formation are concerned. It is also, quite obvi- 
ously, not the normal state of affairs in welfare state 
politics. 

The most illuminating possibilities for the study of 
retrenchment, therefore, lie in the other three quad- 
rants. When existing policies resist conversion but the 
politicalinstitutional context permits the creation of 
new policies, the dominant pattern of change is likely 
to be what Eric Schickler (2001, 13) terms “layering,” 
in which proponents of change work around institu- 
tions that have fostered vested interests and long-term 
expectations “by adding new institutions rather than 
dismantling the old.” When the political—institutional 
context poses formidable barriers to authoritative re- 
form but a policy is highly mutable, by contrast, the 
dominant pattern is instead likely to be “conversion” 
(Thelen 2003), in which policies are adapted over time 
rather than replaced or eliminated. Drift, for its part, is 
most likely when a policy poses high hurdles to internal 
conversion (meaning it is hard to shift it to new ends) 
and when the status-quo bias of the external political 
context is also high (meaning it is hard to eliminate 
or supplant existing policies). Drift, as noted, may be 
inadvertent. Or it may be the result of active attempts 
to block adaptation of institutions to changing circum- 
stances, Finally, all these forms of change, if successful 
in undermining support coalitions or the ability of poli- 
cies to achieve their goals, should increase the ability 
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to convert, alter, or eliminate existing policies in the 
future. 

As we shall see, each of these forms of retrenchment 
was on display in the 1980s and 1990s. Drift was the most 
pervasive dynamic, as critics of the welfare state grew 
increasingly adept at using the famously fragmented 
American political system to block legislative reforms 
that would close the growing gulf between social risks 
and benefits. It was not, however, the only pattern of 
the period. When policies posed opportunities for de- 
centralized cutbacks, either because support coalitions 
were weak or policies relied on public-private or inter- 
governmental partnerships, opponents of the welfare 
state were quick to seize them. And when insurgents 
gained sufficient leverage to enact legislative reforms 
(yet not to dismantle existing policies outright), the em- 
phasis correspondingly shifted away from conversion 
and the encouragement of drift and toward the layering 
of new policies onto old. Layering in fact aptly describes 
conservatives’ use of openings in the early 1980s (due to 
Reagan’s election), the late 1990s (due to the GOP cap- 
ture of Congress), and the early 2000s (due to unified 
Republican control of Congress and the White House) 
to create tax breaks encouraging individualized private 
benefits that compete with public programs. 

Because these changes largely occurred without for- 
mal revision, examining them call for an analysis at- 
tuned to the internal reworking of otherwise stable 
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policies and the shifting interaction of policies and their 
environment. This is, of course, a formidable challenge. 
We are a long way from having good data on what has 
happened to benefit rules (but see the fledgling efforts 
by Korpi and Palme 2003; and Scruggs and Allan 2003), 
much less on how these rules are implemented or actu- 
ally affect citizens. But the claim that drift, conversion, 
and layering are crucial does carry with it prescrip- 
tions that run counter to the methodological thrust of 
much previous work on retrenchment. First, and most 
straightforward, it suggests that we should be interested 
not only in the structure of policies, but also in their 
effects—not only in rules governing benefits or eligibil- 
ity, that is, but also in the outcomes that those rules pro- 
duce as they are actually carried out by front-line policy 
actors in the context of other sources of social protec- 
tion and shifting social conditions. Second, and no less 
important, it indicates that our explanations must take 
seriously the prospect that policy reformers will seek to 
change policies without formal revision, employing in- 
stead less visible means of change. In all these inquiries, 
however, one question should be central: Have welfare 
states continued to provide the inclusive risk protection 
that once defined their structure and goals? 


NEW SOCIAL RISKS, OLD SOCIAL POLICIES 


Despite many observations about the “new social risks” 
and welfare state rigidities in coping with them, the 
changing ability of social policies to deal with major life 
contingencies has not been intensively studied. This ne- 
glect reflects a larger blind spot in the vast literature on 
the welfare state. While everyone knows that welfare 
states serves vital insurance functions, most commen- 
tary assumes rather reflexively that income redistribu- 
tion is, if not the defining goal of social programs, at least 
the strongest indicator of their performance.’ Yet the 
reasons for making risk protection a key independent 
topic of concern are compelling. Not only are the largest 
social programs—pensions, health insurance, unem- 
ployment compensation, survivors’ benefits—centrally 
about insuring against risks to income, but also many as- 
pects of the welfare state that we do not typically think 
of as risk protection (such as child care and worker re- 
training) contain important insurance elements insofar 
that they cushion families against the income shock of 
major life events. 

Risk protection and income redistribution are re- 
lated but distinct. Although social insurance does re- 
distribute income, its principal goal is to “moderate the 
risks of current income loss or inadequacy by providing 
secure cash or near-cash entitlements on the occurrence 
of defined risks” (Graetz and Mashaw 1999, 65). The 





7 Thus Huber and Stephens 2001 Innit their definition of retrench- 
ment to policy changes that decrease the degree of redistribution 
from rich to poor (and from men to women) 

8 On the central place of social insurance in the welfare state, see, in 
particular, Baldwin 1990, Barr 1998, Iversen and Soskice 2001, and 
Moene and Wallerstein 2001 For more general discussions of risk 
and risk protection, see Baker and Simon 2002, Beck 1992, and Moss 
2002 


bounds of social insurance thus delimit the scope of 
shared risk—the degree to which potent threats to in- 
come are spread across citizens of varied circumstances 
(risk socialization) or left to individuals or families to 
cope with on their own (risk privatization). To “priva- 
tize” risk, in this parlance, is thus to fragment and un- 
dermine collective insurance pools that offer reduced- 
cost protection to higher-risk and lower-income citizens 
in favor of arrangements that leave individuals and fam- 
ilies responsible for coping with social risks largely on 
their own. 

Intuitively, the boundaries of such collective risk 
pools can be changed in three ways. The first is explicit 
alterations of rules governing eligibility or benefits— 
the subject of most retrenchment analyses. The second 
and more subtle means is a change in those rules’ imple- 
mentation. Do all those eligible receive legally specified 
benefits? Do policies permit discretion on the part of 
administrators or providers? The final source of change 
is a shift in the constellation of risks itself. Risks may 
become more severe, leading to an effective decline 
in protection, or new risks can arise that fall outside 
the universe of shared responsibility. As already noted, 
neither of the latter two forms of change is likely to be 
picked up by the conventional focus on active reform. 
Nor, it should be said, are these forms of change likely 
to be captured fully by data on redistribution, which can 
tell us whether more or less is redistributed at any time, 
but not how well policies protect citizens over time.’ 

Aboutone point there can be little question: The con- 
stellation of risks that citizens face has changed signif- 
icantly in the past three decades due to linked changes 
in work and family (Esping-Andersen 1999; Katz 2001; 
Skocpol 2000). In the employment sector, the shifts in- 
clude rising levels of earnings inequality, growing in- 
stability of income over time, increased employment 
in services and in part-time and contingent work, and 
increased structural (rather than cyclical) unemploy- 
ment. In the realm of family relations, the changes in- 
clude rising rates of divorce and separation, declining 
fertility (a root cause of population aging), and the in- 
creasing prevalence of single-parent families. Connect- 
ing the two domains is perhaps the most important and 
fundamental shift in the world of work and family—the 
dramatic movement of women into paid employment. 
Each of these changes has placed new strain on social 
protections constructed during an era in which the risks 
that families faced flowed almost entirely out of the em- 
ployment status of the male breadwinner. In the brave 
new world of work and family, even stable full-time 
employment of household heads is not a guarantee of 
economic security, and citizens are barraged by a host 
of risks emanating from families themselves. 

Foremost among the economic changes is a major 
transformation in the employment opportunities and 





9 In response, some scholars have turned to panel studies of mcome 
(Burkhauser and Duncan 1991, DiPrete and McManus 2000, and 
Goodin et al. 1999) But although this research 1s longitudinal, it 
does not currently permit assessments of the extent to which family 
income dynamics have changed over time (the only exception is the 
preliummary findings reported shortly). 
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earnings of less skilled male workers that began in the 
1970s. In a startling break with the past, “the earnings of 
less skilled American men began dropping after 1973 
and fell precipitously during the 1980s” (Blackburn, 
Bloom, and Freeman 1990, 31). Moreover, average 
rates of unemployment among these workers escalated 
dramatically, and the nature of unemployment also 
changed, shifting from cyclical layoffs during economic 
downturns toward permanent job losses (Farber 2003). 
At the same time, employment in the (often low-wage) 
service sector and in part-time and contingent positions 
that offered relatively low pay and few or no benefits 
increased, and the median length of tenure of male 
workers dropped significantly. 

The most easily tracked manifestation of these trends 
is a marked increase in economic inequality. Between 
1979 and 2000, for example, the post-tax and -transfer 
income of the top 1% of American households on the 
income scale increased by 201% in real terms, and that 
of the top fifth by more than 68%. By contrast, the post- 
tax and -transfer income of the bottom fifth of house- 
holds increased by just 8.7%, while that of the second 
fifth and middle fifth rose by 13.3% and 15.1%, respec- 
tively (Greenstein and Shapiro 2003). The growth in 
inequality of wealth during this period was even more 
dramatic (Wolff 2002). 

This is, to many, the story of the post-1970s Ameri- 
can experience: the reversal of longstanding expecta- 
tions about rapid across-the-board rises in standards of 
living. Yet simultaneously, and in many ways in con- 
cert, the 1970s ushered in equally profound changes in 
American families. Most striking by far was the contin- 
ued entry of women into the paid workforce, a trend 
that by 2000 had made two-earner families, once an 
exotic species, the majority of married couples. The 
increasing prevalence of two-earner families must be 
seen in part as a private response to the economic pres- 
sures families face—a form of intrafamily risk sharing 
that decreases vulnerability to interruptions of earnings 
or the high cost of services that housewives once pro- 
vided yet, at the same time, increases the probability 
that a breadwinner will be subject to earnings losses 
(Warren and Tyagi 2003). 

But if two-earner families became more common, 
marriages did not become more durable. Rates of 
divorce and single parenthood (in most cases, mother- 
hood) increased dramatically. Lone mothers are dispro- 
portionately less educated women, who have increas- 
ingly delayed marriage but not child-bearing, in part 
because the men they are most likely to marry have 
suffered economically. More educated women, by con- 
trast, are delaying child-bearing but not marriage and 
having fewer children, in part because the opportunity 
costs of child-bearing have risen (Ellwood and Jencks 
2001). 

Whatever their causes, these changes in family struc- 
ture are clearly a significant contributor to inequality 
and hardship. The rise of two-earner families exacer- 
bated family income inequality because high-earning 
women tend to marry high-earning men. On the other 
side of the coin, single-parent families are, unsurpris- 
ingly, much more likely to have low incomes than two- 
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parent families. And with dual paychecks now a pre- 
requisite for middle-class life, divorce and separation 
have come to represent potent risks to family well- 
being. A partial glimpse of these effects can be gleaned 
from statistics concerning the characteristics of people 
in poverty. Although poverty rates dipped in the strong 
economy of the late 1990s, they rose over the 1970s and 
1980s and are rising again. But no less striking than the 
overall rise is the change in the characteristics of those 
affected: Poverty among the elderly fell sharply in the 
1970s and has remained relatively low since, while a siz- 
able and increasing portion of the poverty population 
is made up of parents with young children. 

A similar, but in many ways more nuanced, por- 
trait is provided by evidence on the number and char- 
acteristics of Americans filing for bankruptcy. As is 
well known, personal bankruptcy has risen dramati- 
cally, with filings increasing fivefold between 1980 and 
2002, to more than 1.5 million (White 2003, 1). Less 
well known is that the characteristics of filers have also 
changed. Elizabeth Warren (2003) reports, for example, 
that women have emerged as the largest single group 
of filers, their share of filings rising eightfold between 
1981 and 2001. Revealingly, half of filers cite health 
problems, childbirth, a death in the family, or substan- 
tial medical bills as a prime reason for filing. By com- 
parison, a 1970s study found just 11% of filers citing 
one or more of these reasons in 1964 (cited in Jacoby, 
Sullivan, and Warren 2001). 

The rise in economic inequality and the changing 
character of the poor and bankrupt are each strongly 
suggestive of the changing composition of social risks 
that citizens face. Yet perhaps the most powerful evi- 
dence of increased risks to family income is the growing 
instability of income over the past two decades. Robert 
Moffitt and Peter Gottschalk (2002), for example, have 
documented a marked increase in the variability of 
male wages during the 1970s and 1980s—an increase 
driven more by instability of wages than by instability 
of employment. Looking specifically at family income, 
I and Nigar Nargis of the University of Dhaka have 
recently traced changes in volatility over the past three 
decades using the Panel Study of Income Dynamics 
(see Hacker 2003). The analysis confirms that income 
inequality across families increased dramatically over 
this period. Our results also show, however, that the 
over-time variance of family income more than doubled 
between 1974 and 1998, even when controlling for fam- 
ily size and factoring out the secular increase in mean 
income. Indeed, at its most recent peak in the mid- 
1990s, family income was roughly five times as unstable 
as it was in the early 1970s. This is a potent indication 
of the increased risks to income that American families 
confront. 

These trends have exposed serious gaps in the 
American framework of social protection—which, 
although widely criticized, is also widely misunder- 
stood (Marmor, Mashaw, and Harvey 1990). Compar- 
ative researchers, for example, commonly describe the 
American welfare regime as one in which “bene- 
fits cater mainly to a clientele of low-income, usually 
working-class, state dependents” (Esping-Andersen 
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1990, 26). But although public social spending is lower 
in the United States than in other affluent democra- 
cies, public cash assistance for the poor represents only 
a tiny fraction of the total, and means-tested benefits 
as a whole make up less than a third. This picture is 
considerably reinforced when we consider tax expen- 
ditures and private social benefits, both of which pri- 
marily benefit upper-income Americans (Hacker 2002; 
Howard 1997). 

The bulk of public and private social spending in the 
United States, as in other rich democracies, is devoted 
to major areas of social insurance—particularly health 
insurance and pensions. In part because the United 
States is the only nation in which contributory public 
health insurance is limited to the aged, public spending 
is highly concentrated on the elderly (Lynch 2000). By 
contrast, public and private support for working adults 
and families with children is comparatively anemic. 
The United States lacks universal government health 
insurance and family allowances, benefit levels under 
casb-assistance programs that aid families are low and 
falling, public and private support for child care is ex- 
tremely modest, and employers have been reluctant 
to provide paid family leave even as they have cut 
back other benefits for spouses and children. Unlike 
Germany, Japan, and the Nordic countries, the United 
States also lacks universal long-term health care for the 
elderly. In some of the key areas affected by the new and 
newly intensified risks just examined, then, U.S. social 
policy was already comparatively meager at the outset 
of the period under study. If anything, as we shall see, 
that comparative meagerness has only become more 
glaring over the past three decades. 


THE ANEMIC AMERICAN RESPONSE 


In principle, U.S. social policy could have adapted to 
changing realities. As the pathbreaking feminist writ- 
ings on the welfare state show (e.g., Orloff 1993 and 
Stetson and Mazur 1995), some nations—most strik- 
ingly, the Nordic welfare states—have dramatically in- 
creased their provision of services that help families 
balance work and child-rearing. Many of these same 
nations have also tackled the new realities of the labor 
market with active employment and training polices 
(J. Levy 1999). Putting aside some modest exceptions, 
however, the United States clearly did not follow this 
path. Increases in the Earned Income Tax Credit for 
low-wage workers (Howard 1997), shifts of money from 
cash assistance to child care and job retraining, and 
new family leave legislation were all steps toward a 
response. But low-wage workers continued to receive 
only meager public supports. Family leave rules did 
not apply to small employers and did not provide in- 
come support to leave-takers. Government assistance 
for child care remained scant and frequently unavail- 
able even for eligible families (D. U. Levy and Michel 
2002). Despite newly intense job insecurity, unemploy- 
ment insurance contracted for lower-income and inter- 
mittent workers (GAO 2000). And although failing to 
uphold the direst predictions, the welfare reform legis- 
lation of 1996 removed important elements of the safety 


net for the most disadvantaged (for a comprehensive 
analysis, see Weaver 2000). Perhaps most striking was a 
massive decline in employment-based health and pen- 
sion protections among lower-wage workers—which 
was only weakly offset by public coverage expansions. 

Suzanne Mettler and Andrew Milstein (2003) pro- 
vide concrete dollar figures for some of these changes. 
The inflation-adjusted value of the minimum wage, 
unemployment benefits, and benefits under the Food 
Stamps and Aid to Families with Dependent Children 
programs all declined during this period, while union- 
ization rates plummeted in the face of aggressive anti- 
union policies. Although, as Pierson (1994) argues, de- 
clining unionization does not necessarily imperil public 
programs that enjoy strong support, it is difficult to 
deny that it has weakened the leverage of those who 
wish to reorient social policy toward new risks or that 
it has strengthened the political standing of employers, 
particularly in negotiations over private benefits. 

A further glimpse into these trends is provided by the 
cross-national measures of redistribution provided by 
the Luxembourg Income Study (LIS). The LIS statis- 
tics show that inequality before taxes and transfers rose 
sharply during the 1980s in the United States, which has 
the highest level of inequality among wealthy nations. 
Yet compared with other countries, the United States 
appears to have done considerably less to offset the 
global rise in inequality during this period. Averaging 
across the 12 other nations for which LIS data exist, 
for example, the reduction in inequality created by 
taxes and transfers increased 10% between the first and 
the last observations. In the United States, by contrast, 
taxes and transfers reduced inequality slightly less by 
the end of the.series (1997) than at the outset (1986). 
In short, income inequality increased dramatically in 
the United States, but income redistribution actually 
declined. This pattern stands in stark contrast to the 
experience of nearly every other advanced industrial 
democracy. 

It is important to emphasize that these were not un- 
contested issues. There were, most obviously, major 
attempts to scale back public social programs in the 
early 1980s and then after the ascendance of the GOP 
in Congress in 1994. Although these efforts had only 
limited success, they were not without effect. Perhaps 
more important, these struggles unquestionably helped 
produce a major shift in policy discourse, immortalized 
in President Clinton’s 1995 declaration that the “era of 
big government is over.” Although in both periods con- 
servatives quickly moved to protect themselves against 
charges that they were hostile to popular programs, the 
larger drift was clearly toward the conservative pole of 
the debate. Proposals for major structural reform of 
public programs gained ground, liberals found them- 
selves vying with conservatives over the depth of their 
commitment to make welfare recipients work, tax cuts 
that threatened future social spending passed into law, 
and calls for the creation of new social interventions 
all but vanished from public debate. This new climate 
has shaped the orientation and structure of the few new 
policy innovations that have been put in place, leading 
to an increased emphasis on tax expenditures, market 
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incentives, and private provision. In more decentral- 
ized and discretionary programs, it has also shaped the 
character of front-line administration and even, some 
evidence suggests, the degree to which citizens take 
advantage of benefits for which they formally qualify 
(D. U. Levy and Michel 2002; Zedlewski 2002). 

In addition, although few big new policy departures 
took place, a series of often-unnoticed incremental 
changes have produced, or seem likely to produce, sig- 
nificant longer-term effects. Most notable here are a de- 
liberate expansion of tax-favored investment accounts 
for retirement—sold as an alternative to both older 
company pension plans and Social Security—the cre- 
ation and expansion of opportunities for private health 
plans to contract with Medicare and Medicaid, and a 
significant loosening through both legislative changes 
and administrative processes (such as waivers) of fed- 
eral restrictions on state and local social welfare ac- 
tivities. Waivers, in fact, were deliberately used by the 
Republican-led executive branch in the late 1980s be- 
cause the “left was strong enough to veto certain poli- 
cies in the legislative context that it has been unable to 
stop when pursued through the waiver process” (Teles 
1998, 141)—a telling example of strategic adaptation to 
a political context preventing legislated policy reform. 
Moreover, all of these more subterranean changes, 
whether through drift, conversion, or layering, have 
been aided by the inherent difficulty in a fragmented 
polity of closing gaps that have opened between origi- 
nal policy aims and ground-level policy effects. 

Indeed, overshadowing and dominating these other 
events were active campaigns to block legislation that 
might extend social protections to new risks or limit 
the weakening of existing protections. The Family and 
Medical Leave Act, for example, passed in 1993 only 
after it was whittled down for more than a decade—and 
vetoed twice by President George H. W. Bush. But this 
was a (marginally) successful example: Most proposals 
to close the growing gap between social risks and ben- 
efits ended up in the political graveyard, stymied by 
fiscal constraints, actual or threatened filibusters and 
vetoes, and formidable conservative resistance. The sig- 
nal case of policy drift of this sort, as discussed earlier, 
is the failure to pass any proposal for expanded health 
coverage, despite declining private coverage, President 
Clinton’s strong advocacy, and public enthusiasm for 
action. 

This is an impressionistic tally, to be sure. But, we 
shall see, its message is confirmed by a closer review of 
recent developments in the two largest areas of U.S. so- 
cial policy: health insurance and pensions. These policy 
areas not only comprise the majority of social spend- 
ing in the United States (and, indeed, in all affluent 
democracies); in addition, by virtue of their size and the 
unambiguous popularity of the policies that constitute 
them, they are also widely seen as the most resilient 
components of the postwar welfare order. Yet as the 
next two sections detail, in both these bedrock areas, 
relative stability in public programs has masked major 
declines in the ability of social policies to provide inclu- 
sive risk protection. As both employment-based social 
benefits and government programs have eroded, so- 
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cial risks have shifted from collective intermediaries— 
government, employers, large insurance pools—onto 
individuals and families. Efforts to address new and 
newly intensified risks have failed, and new policies 
sharply at odds with established ones have been cre- 
ated and expanded. Although the paths of health and 
pension policy differ in crucial and revealing ways, their 
overarching trajectories appear the same: toward a sig- 
nificant privatization of risk. 


THE UNRAVELING OF AMERICAN HEALTH 
INSURANCE 


By the 1970s, the basic structure of American health 
insurance was firmly in place. For most Americans— 
more than 80% by the mid-1970s—private health in- 
surance provided the first line of protection against 
the risk of medical costs. Historically, employment- 
based health insurance was provided by large commer- 
cial and nonprofit insurers, which pooled risks across 
many workplaces (and, originally, even charged all sub- 
scribers essentially the same rate—a practice favor- 
able to higher-risk groups). Workplace health benefits 
were (and are) also heavily subsidized through the tax 
code, which treats virtually all workplace health ben- 
efits as exempt from taxation as compensation. e 
revenue loss created by this tax break exceeded $188 
billion in 2004 [Sheils and Haught 2004].) From 1965 on, 
the federal Medicare program provided public cover- 
age for elderly—and, later, nonworking disabled—and 
the joint federal-state Medicaid program covered poor 
people on public assistance. the working disabled, and 
the indigent aged. 

Since the 1970s, the private foundation of this system 
has undergone a radical contraction—in what amounts 
to a textbook case of drift and conversion within the 
bounds of stable formal policies. From a peak of more 
than 80% of Americans, private insurance coverage fell 
during the 1980s and early 1990s to less than 70%. 
Employment-based protection was the biggest casu- 
alty: Between 1979 and 1998, the share of workers 
who received health insurance from their own employ- 
ers fell from 66% to 54%—a trend that, in a growing 
workforce, translates into tens of millions of workers 
without protection (Medoff and Calabrese 2000). At 
the same time, employers have grown less willing to 
cover workers’ dependents, and they have required 
that workers pay a larger share of the cost of cover- 
age, which has discouraged some from taking coverage 
even when it is offered. The result has been a marked 
rise in the number of medically uninsured Americans. 
For more than a decade, the number of Americans 
uninsured for the entire year has been rising at the 
rate of about 1 million a year and now hovers around 
43 million, with some 75 million—one of three 
nonelderly Americans—uninsured at some point dur- 
ing a two-year period (Families USA 2003). Almost 
nine of 10 uninsured Americans live in families headed 
by at least one worker. 

The gravest effects have been felt by those on the pe- 
riphery of the labor market: the young, the low-skilled, 
the low-paid. Among the lowest-paid 20% of workers, 
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for example, the share who receive health benefits from 
their employers fell from almost 42% to just over 26% 
between 1979 and 1998 (Medoff and Calabrese 2000). 
These trends refiect multiple factors, including declin- 
ing unionization and changing employment patterns. 
But above all, they mirror the simple reality that med- 
ical costs have risen much faster than median wages, 
outstripping the ability of workers and their employers 
to finance protection (Kronick and Gilmer 1999). With 
employers free to drop coverage, and workers under 
financial pressure to decline ıt even when it is offered, 
the risk of medical costs is being shifted from insurers 
and employers onto workers and their familjes. 

This view is reinforced when we consider one of the 
most fundamental transformations in American health 
insurance since the 1970s: the rise of “self-insurance” 
among employers. As already discussed, corporate self- 
insurance—the paying of medical claims directly—was 
encouraged by the 1974 Employee Retirement Income 
Security Act, which protects self-insured health plans 
from most state insurance regulations and lawsuits in 
state courts. But an additional crucial underlying mo- 
tive for self-insurance has been the desire of larger 
employers to limit the cross-subsidization of the med- 
ical expenses of workers outside their own employ- 
ment pool. Rather than purchase insurance from ex- 
ternal companies that provide coverage to multiple 
firms (and, as noted, traditionally charged relatively 
similar rates to all subscribers), employers increasingly 
financed just their own workers’ claims, thereby pooling 
risks within—and only within—their own labor force. 
Self-insurance has thus seriously worsened the situ- 
ation of smaller employers, which have employment 
groups too small to self-insure safely, while encourag- 
ing private insurers to weed out subscribers with high 
expected costs. The chronically ill, the near-elderly, and 
those with expensive conditions have all faced increas- 
ingly serious barriers to obtaining insurance as a result. 

Meanwhile, employers (and in some cases unions, 
which jointly manage many self-insured plans) have 
joined with conservative politicians to beat back any 
attempt to revisit the provisions of ERISA that ex- 
empt self-insured health plans from regulation (see 
Gottschalk 2000). The ERISA Industry Committee, an 
organization of large employers created in 1976, has 
been perhaps the most vociferous champion of federal 
preemption of state regulation, supporting “legislation 
that preserves and strengthens ERISA preemption and 
reduces government interference with employers’ ef- 
forts to provide cutting-edge, comprehensive health 
care benefits to their employees” (ERIC 2003). As a 
consequence, government regulation of private health 
plans has changed relatively little since the mid-1970s, 
despite a massive swing away from inclusive risk pro- 
tection in the private sector. 

Although Americans’ prime source of health pro- 
tection is eroding, public programs have largely failed 
to fill the gap. Medicare—a centerpiece of U.S. social 
insurance—has essentially been caught in a holding 
pattern (Marmor 2000): Its popularity and the veto- 
ridden American political structure have prevented 
radical retrenchment, but it has grown increasingly in- 


adequate as costs have rapidly outstripped the pro- 
gram’s constrained spending. In a striking demonstra- 
tion of drift, Medicare beneficiaries devote a larger 
share of income to medical care today than they did at 
Medicare’s passage (Moon 1993, 10-11). At the same 
time, employment-based coverage for retirees and sup- 
plemental private benefits have been in a tailspin, as 
insurers and employers find that they cannot bear the 
risks Medicare does not cover. These risks are thus 
shifting by default to beneficiaries and their families. 

Medicare has not been static, of course. But few of 
the changes made can be described as expansionary. 
Even the prescription drug benefit enacted in 2003 will 
cover only a very small share of seniors’ expected drug 
expenses (while outlawing supplemental coverage that 
fills its huge gaps in protection). And other recent pol- 
icy changes, including some contained within the 2003 
prescription drug law itself, pose the possibility that 
Medicare’s protections could deteriorate even further. 
The crucial example here is Medicare contracting with 
private health plans, an effort at policy layering that 
originated in demonstration projects first pursued by 
the Reagan administration. Conservatives have aggres- 
sively pursued the transformation of contracting into 
a full-fledged system of competing, risk-bearing pri- 
vate plans, which they hope will undermine the uni- 
fied constituency that has blocked direct benefit cuts 
in the past. Although studiously careful not to chal- 
lenge Medicare directly, the strongest advocates of a 
competitive system clearly believe that the traditional 
program should, as Republican House Speaker Newt 
Gingrich infamously put it in 1995, “wither on the vine.” 
(Gingrich, in fact, was unusually candid about Medi- 
care reformers’ covert strategy, noting of Medicare that 
“we don’t get rid of it in Round One because we don’t 
think it’s politically smart” [Toner 1995].) 

In contrast, coverage of the poor has unquestion- 
ably grown: first, with federally mandated extensions of 
Medicaid in the 1980s and, second, with the creation of 
the state-federal Childrens’ Health Insurance Program 
(CHIP) in 1997. These were important expansions, all 
the more remarkable because they occurred in such a 
hostile climate. Before ending the story, however, three 
important points should be emphasized. First, the ex- 
pansion of Medicaid has only partially offset the decline 
in private coverage. Second, the trend toward expand- 
ing coverage appears to have run its course. And third, 
the 1996 welfare reform bill has created a massive exo- 
dus from the welfare rolls, with those who leave moving 
into the low-wage employment sector, where private 
coverage is rare. Millions eligible for CHIP and Medi- 
caid are not enrolled, and this is likely to become more 
true as time limits on welfare kick in. In sum, public 
coverage expansions appear more like Band-Aids on a 
festering wound than an inexorable expansion of public 
protection. 

In strategic terms, critics of Medicaid have been 
greatly aided by the joint federal-state structure of 
the program, which has facilitated cutbacks by foster- 
ing interstate competitive pressures in favor of bud- 
getary stringency, while making cutbacks more difficult 
to identify and assign responsibility for. Since 2000, 
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federal waivers have been aggressively used to en- 
courage state-based program restructuring by the Bush 
administration (Park and Ku 2001), which also hopes 
to shift from the current guaranteed matching for- 
mula to so-called block grants, in which the states 
are provided a fixed amount of funds. Like Medicare 
reform, Medicaid block grants last became a major 
issue in the mid-1990s—when, as now, advocates of 
block grants espoused “an ideological commitment to 
shrink the welfare state and return power to states from 
Washington” (Weaver 1996, 52). 

No discussion of the recent evolution of U.S. health 
insurance is complete without mention of the stunning 
defeat of the Clinton health plan—arguably the most 
dissected legislative failure in modern history (Hacker 
1997; Johnson and Broder 1996; Skocpol 1996). Rather 
than rehash the saga, I wish simply to emphasize that 
its defeat represents perhaps the best evidence of po- 
litically mediated policy drift. The Clinton health plan 
and its major competitors reflected a belief that the 
American policy of relying on voluntary employer pro- 
vision of health benefits was increasingly unworkable 
as a secure foundation for risk pooling. The opposi- 
tion to the plan, centered among hard-core political 
conservatives, employers, insurers, and private medi- 
cal interests, in turn reflected not simply the recog- 
nition that many of these groups would be immedi- 
ately hurt by the plan, but also the awareness that 
its passage would create a new and valued entitle- 
ment for anxious middle-class and working-class vot- 
ers whose long-term political allegiances were very 
much up for grabs. Thus conservative activist William 
Kristol warned that the Clinton plan would “relegit- 
imize middle-class dependence for ‘security’ on govern- 
ment spending and regulation” and “revive the reputa- 
tion of... the Democrats ...as the generous protector 
of middle-class interests” (quoted in Skocpol 1996). On 
the other side, Clinton explicitly cast his crusade as an 
effort to undo the policy drift of the past two decades— 
drift that had created, in the words of the White House’s 
Health Security report, “growing insecurity.” “From the 
1940s through the 1970s,” the report explained, “the 
United States made steady progress toward broader 
health care coverage. ... Beginning in the 1980’s, how- 
ever, the number of Americans lacking health insur- 
ance has increased steadily—while health care costs 
have increased at ever-rising rates” (Domestic Policy 
Council 1993). 

In the end, the Clinton plan was brought down by 
much the same dynamic that stymied conservatives’ 
efforts to dismantle public programs: the easily ignited 
fears of Americans that reform would compromise the 
social protections on which they relied—in this case, 
private insurance (Hacker 2002). But what is crucial to 
emphasize is that U.S. leaders debated whether social 
policy would adapt to the changing job market and de- 
clines in private protection. The privatization of risk 
in American health insurance occurred without major 
policy reforms, but it was very much a matter of political 
struggle. 

In sum, when one considers the broader framework 
of U.S. risk protection in health care, the direction of 
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change is clearly toward a marked narrowing of the 
bounds of collective protection, driven principally by 
the conversion and politically mediated drift of poli- 
cies away from their original scope and purpose. To be 
sure, major public programs have been preserved. The 
demise of conservative efforts to scale back Medicare 
and Medicaid in the mid-1990s is a powerful illustra- 
tion of the hurdles thrown up by American political 
institutions and the enduring popularity of established 
programs. But resilience in the overall framework of 
American health insurance has not prevented a ma- 
jor shift in the distribution and intensity of the risks 
faced by citizens. The Medicare program has stagnated 
in the face of rapidly rising costs. The Medicaid pro- 
gram has expanded, but not nearly enough to offset 
the implosion of private coverage. There has been a 
massive decline in private health protection, which has 
increasingly ceased to be available or affordable for 
lower-wage workers. Serious efforts to deal with this 
have been effectively blocked by a formidable constel- 
lation of ideologically committed opponents and vested 
interests. The outcome has been a significant privatiza- 
tion of risk. 


INDIVIDUALIZING RETIREMENT SECURITY 


The American approach to retirement security is also 
a public-private hybrid, blending public social insur- 
ance and employment-based benefits—and, increas- 
ingly, tax-favored savings accounts. But pension policy 
differs crucially from health policy in the respective 
roles of public and private benefits. Whereas Medicare 
and Medicaid emerged after the large-scale develop- 
ment of private health insurance, private retirement 
pensions largely built on top of the public foundation 
of Social Security. This supplementary role was embod- 
ied most concretely in the practice of “integration,” 
in which employers that qualified for tax breaks for 
their private retirement plans were allowed to reduce 
pension benefits sharply for lower- and middle-income 
workers to reflect expected Social Security benefits. It 
was also embodied in the 1974 ERISA statute, which 
regulated private plans to ensure that they would be 
secure counterparts to the public foundation estab- 
lished by Social Security and even created a quasi- 
public insurance company to protect defined-benefit 
plans against insolvency. Put simply, while employers 
offered health insurance as workers’ first line of de- 
fense, they offered retirement pensions to “top off” ex- 
pected Social Security benefits—a role sanctioned, reg- 
ulated, and insured by the federal government. Thus, in 
its underlying structure—guaranteed, insured benefits 
based on earnings and years spent working—the pri- 
vate pension system looked very much like the public, 
though it was much more favorable to the highly paid 
than was Social Security. 

This vision of the division of labor between public 
and private still has relevance, but it is much less accu- 
rate or widely shared than in the past. First, since the 
1970s, Social Security has been under serious financial 
pressure. Slower wage growth and increases in the ratio 
of retirees to workers precipitated the passage of two 
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major legislative overhauls, in 1977 and 1983. Although 
preserving the program, albeit at reduced levels, these 
reforms have effectively ended its expansion. 

Second, employers have rapidly shifted away from 
the traditional “defined-benefit” plans that were the 
subject of ERISA. Instead, they have adopted so-called 
defined-contribution plans (such as the familiar 401[k] 
plan) that are not tied to Social Security and, unlike 
defined-benefit plans, place most of the risk of invest- 
ment onto workers. Although this momentous trans- 
formation is mostly a case of conversion, in which em- 
ployers have restructured their plans within relatively 
stable federal guidelines, it is important to note that 
defined-contribution plans were enabled and greatly 
encouraged by new and expanded federal tax subsidies 
layered onto the existing retirement system during pe- 
tiods of conservative ascendance. As with health insur- 
ance, there has also been a basic decline in employers’ 
support for retirement benefits—and, in tandem, a ma- 
jor privatization of risk. 

As employers have moved away from defined bene- 

fits and decreased their commitment to pensions since 
the 1970s, employer pension contributions have signif- 
icantly decreased as a share of pay. Like the decline in 
private health insurance, the fall in pension contribu- 
tions is symptomatic of the broader reversals in the eco- 
nomic outlook of less-educated workers. Between the 
early 1980s and the mid-1990s, the value of pension ben- 
efits to current workers dropped in every income group, 
but by far most rapidly among the lowest paid work- 
ers, who already had the lowest coverage levels (Pierce 
1998). In addition, tax breaks for private pensions and 
other retirement savings options heavily favor better- 
paid employees: Two-thirds of the nearly $100 billion 
in federal tax breaks for subsidized retirement savings 
options accrue to the top 20% of the population (Orzag 
2000). 
Although the post-1970s economic transformation 
was the underlying spur for these dramatic forms of 
conversion and drift, its impact has been deeply medi- 
ated by politics. The 1980s signaled the beginning of an 
ongoing tug-of-war between two increasingly homog- 
enized and polarized parties, with Republicans seek- 
ing to create and liberalize individual retirement op- 
tions and Democrats fighting to place new restrictions 
on existing pension tax subsidies and limit the top- 
heavy skew of individual accounts. The overall thrust of 
policy has nonetheless been in the more conservative 
direction—toward the expansion of tax-favored plans 
and toward the loosening of restrictions both on eligi- 
bility for them and on the purposes for which they can 
be used. 

The path of IRAs illustrates the overall pattern. In- 
cluded in ERISA as a retirement savings device avail- 
able only to workers without private coverage, IRAs 
were expanded and made available to all workers in 
the early 1980s. In 1997 and 2001, they were liberalized 
again, permissible uses of the accounts were broadened 
to include education and housing expenses, and a new 
plan—called “Roth IRAs”—was created that would 
require account holders to pay taxes up front and then 
avoid all future taxes on their accounts (including estate 


taxes). Because, at the time, the vast majority of Ameri- 
cans already could establish traditional IRAs, the main 
effect of these changes has been to make tax-favored 
accounts more available and attractive to upper-income 
households. 

The story of so-called 401(k) plans is different but 
similar. In contrast with IRAs, which are individual 
accounts sponsored by the federal government, 401(k) 
plans are employer-sponsored retirement accounts that 
operate under section 401(k)—a provision added with 
little debate to the tax code in 1978. In 1981, a private 
benefits expert pressed the IRS to rule that the provi- 
sion extended to pensions in which workers put aside 
their own wages, much as in an JRA (Crenshaw 1999). 
The Reagan IRS agreed, and corporate sponsorship of 
401(k) plans exploded. In 2001, as part of that year’s 
tax-reduction plan, Republicans successfully pressed 
for dramatic liberalization of 401(k)s and IRAs and 
the creation of “Roth 401(k)s” similar to Roth IRAs. 

The explosive growth of 401(k) plans and IRAs over 
the past decade represents one of the most impor- 
tant developments in the political history of U.S. pen- 
sion policy. During the 1980s, contributions to IRAs, 
401(k)s, and Keogh plans for the self-employed rose 
dramatically (Venti and Wise 1997, 85), and by 1998, 
their assets were almost a third as large as the Ameri- 
can economy (U.S. Census Bureau 1999, Tables 851 and 
852). 

Behind this transformation lies a new conception of 
pensions, for these retirement accounts have few of 
the characteristics of either Social Security or older 
defined-benefit plans. These accounts are voluntary for 
individual workers, participants have a significant de- 
gree of control over investment choices, and benefits 
are often paid as a lump sum upon employment sep- 
aration or achievement of a specific age and, increas- 
ingly, can be accessed for purposes besides retirement. 
Because they are voluntary, many younger and poorer 
employees who are offered them choose not to partic- 
ipate or contribute little. And the risk of poor invest- 
ment decisions or bad financial luck falls entirely on 
participants—as became painfully clear in the wake of 
the recent stock-market downturn. 

The strength of the stock market in the last decade 
obviously helps explain the enthusiasm for individual- 
ized investment accounts. But the shift must also be 
seen as rooted in linked economic and political devel- 
opments of the past two decades. By the 1980s, defined- 
benefit pensions no longer offered the attractions to 
employers that they had in the more stable employment 
climate of the 1950s and 1960s, with its strict managerial 
hierarchies and large, unionized manufacturing firms. 
Nor, as Social Security’s tax-to-benefit ratio grew less 
favorable, did employers have a strong incentive to set 
up integrated plans whose expense would be partially 
offset by the federal program. 

No less important, however, are the underlying po- 
litical motives that lie behind the expansion of private 
accounts. For years, conservatives despaired of ever ef- 
fectively challenging Social Security. Even at the height 
of Reagan’s influence, the conservative push for reform 
was quickly crushed by the weight of past programmatic 
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choices. These past defeats, however, fostered a new 
awareness on the part of critics that Social Security 
could only be fundamentally reformed if there existed 
a “parallel system” of private individual accounts that 
could eventually be portrayed as a viable alternative 
to the public program (Butler and Germanis 1983, 551, 
553). Conservatives therefore retooled their strategy 
to encourage private retirement savings through ever 
more flexible and individualized means, acclimating 
Americans to private accounts and layering the insti- 
tutional infrastructure for a full-fledged private system 
on top of the core public program of Social Security. 

The motives for this approach have been care- 
fully analyzed by Stephen Teles (1998), who argues 
that “conservatives have slowly built up counter- 
institutions, counter-experts, and counter-ideas .. . [in] 
an attempt to solve the political problem of social se- 
curity privatization.” The core of this strategy, Teles 
concludes, was to “carve out a competing policy path, 
one that would slowly undermine support for Social 
Security and preserve the idea of privatization for the 
day when it was politically ripe” (14-15). This is layer- 
ing par excellence. 

Whether the day will ever be ripe remains a very open 
question. The reluctance of elected politicians to con- 
sider plans for even partial privatization of Social Se- 
curity is overwhelming—all the more so, in light of the 
stock market and federal budgetary turnaround. The 
difficulty of reforming mature pay-as-you-go-pensions, 
which stems from the massive expectations and accu- 
mulated fiscal commitments they embody, stands out 
as the ultimate example of programmatic path depen- 
dence and policy feedback. Nonetheless, these barri- 
ers should not blind us to the significant change that 
has already occurred. As corporations and individu- 
als have shifted to more individualized plans, the ex- 
plicit links between the public and private systems 
have steadily eroded, undermining some of the self- 
reinforcing mechanisms that previously secured Social 
Security’s privileged position. And most American em- 
ployers have lost their direct stake in the program’s 
health, as their own plans have broken off from the pub- 
lic pension core around which they previously revolved. 
These transformations are perhaps most visible in the 
growing role of tax-favored retirement accounts linked 
to the stock market and in the changing balance of pub- 
lic and private pension benefits—a balance that tilted 
toward the private side of the scale for the first time in 
the 1980s. Whatever else these momentous shifts fore- 
tell, they clearly signal a major privatization of risk. 


RETHINKING RETRENCHMENT 


In the end, then, the conventional story about retrench- 
ment appears to be only half-right. The path depen- 
dence of large-scale social welfare interventions is un- 
deniable. Yet the character of path dependence has 
varied greatly across different programs and policy 
domains. In some, such as Social Security, path depen- 
dence has implied relative stability both in formal poli- 
cies and in their outcomes. In others, such as employer- 
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provided benefits and some state-based programs, for- 
mal policies have been relatively stable but outcomes 
have not. A critical explanation for this difference is 
that in the latter areas, departures could occur with- 
out active policy change, because formal policies cre- 
ated opportunities for unilateral (or near-unilateral) 
action by the administrators or providers of benefits. At 
least as important as internal policy conversion of this 
sort, however, are politically rooted failures of public 
action—which retrenchment studies, focused as they 
are on large-scale policy reform, have largely missed. 
Even as the scope of American social protection eroded 
in crucial domains, concerted efforts to close the grow- 
ing gap caused by this ongoing policy drift were repeat- 
edly stymied. 

By no means is this the last word on recent trends in 
American social protection. The need for comprehen- 
sive data on the ground-level effects of risk-protection 
policies is pressing, and scholars have only started to 
move toward assembling the types of evidence that 
might allow more conclusive answers. Nor, I want to 
stress, is the foregoing intended as a refutation of re- 
search on welfare state retrenchment that shows that 
big programmatic reforms have been quite rare. My 
point is not that public social policies in the United 
States have been radically scaled back, but that, for 
a variety of reasons, their ability to achieve the goals 
embodied in them has noticeably weakened. This is an 
argument that, while not infrequently advanced, has 
not been intensively interrogated, and its refinement 
could go a long way toward reconciling the conflict- 
ing views that continue to characterize the burgeoning 
body of research on welfare state reform. 

The American experience suggests the considerable 
utility of this shift in focus, demonstrating a general pat- 
tern that I have described as “privatization of risk with- 
out privatization of the welfare state.” Although public 
social policies have indeed largely resisted the political 
and economic onslaught of recent decades, efforts to 
update them to changing social risks have failed (drift), 
their ground-level operation has shifted in directions 
at odds with their initial goals (conversion), and new 
policies that subvert or threaten them have been put 
in place (layering). The result has been a significant 
erosion of U.S. social protection, despite the absence 
of many dramatic instances of policy reform. Because 
the American experience is widely considered to be the 
strongest evidence of welfare state resilience in the face 
of conservative opposition, this is a notable finding in 
itself But it also carries lessons for our understanding 
of welfare state restructuring in other nations, and of 
the character, cause, and consequence of policy reform 
more generally. 

In extreme form, American developments provide 
a window into transformations taking place in many 
affluent democracies, as fiscally constrained welfare 
states confront new and newly intensified social risks. 
As Esping-Andersen (1999) argues, these risks have 
strained the capacity of existing social welfare frame- 
works. Unlike Esping-Andersen and others, however, 
I have argued that the growing gap between risks and 
benefits is not simply a result of exogenous shocks to 
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stable welfare states. Instead, I have highlighted two 
key respects in which the gap between risks and policies 
grows directly out of the politics of welfare state reform. 
First, while the literature on retrenchment has focused 
on active legislative reform, considerable evidence sug- 
gests that changes in policy goals and operation have 
occurred even in cases where formal policy rules have 
been relatively stable. Conversion of this sort is espe- 
cially likely, I have argued, when policies lack powerful 
support coalitions and when program structures em- 
body principal-agent relationships that leave substan- 
tial control over the delivery of benefits to actors other 
than the authorities charged with establishing policy 
rules. 

Why change of this form has been mostly in the di- 
rection of restricted protection is an important ques- 
tion. In the case of subnational policymaking, there are 
of course the well-known constraints on redistributive 
spending that states face due to interstate competition 
for capital and skilled labor (Peterson 1981). But the 
changing orientation of front-line policy actors, such as 
caseworkers, also appears crucial, and much more work 
needs to be done to understand the actions and motives 
of front-line policy agents. In the case of employment- 
based benefits, the reasons for the retreat from inclusive 
risk protection may appear far more obvious. Yet it 
was employers, after all, who constructed the extensive 
private systems of risk socialization that they are now 
so busy dismantling. Their abandonment of the old or- 
der appears to reflect not just the declining worth of 
private benefits for corporate strategies, but also the 
absence of effective political counterweights in either 
government or the private sector. The weakening of 
organized labor may not imperil the welfare state, but 
in the world of private benefits, the precipitous fall of 
unions does matter greatly. 

The second cause of risk privatization that is endoge- 
nous to the politics of reform is precisely the fierce 
assault on public programs that Pierson (1994, 1996) 
and others have seen as ultimately so ineffectual. My 
reason for highlighting conservatives’ ability to reframe 
debates, block new initiatives, and create parallel pol- 
icy paths is not that I wish to.equate these dimensions 
of accomplishment with the large-scale reforms that 
retrenchment studies have searched for (and mostly 
found lacking). Although I believe that U.S. conser- 
vatives have been more successful than received schol- 
arly wisdom acknowledges in achieving self-reinforcing 
incremental reforms, my essential argument is simply 
that, in a context where social risks are changing and 
policy drift is ubiquitous and consequential, critics of 
existing programs have not had to enact major reforms 
to move toward many of their favored ends. Merely 
by delegitimizing and blocking compensatory interven- 
tions designed to correct policy drift, opponents of the 
welfare state have gradually transformed the orien- 
tation of social policy. Fights over the welfare state 
concern more than whether programs will be cut or 
scrapped. They also concern the degree to which social 
policies will uphold longstanding goals and adapt to the 
world around them. We vastly understate the strength 
of the welfare state’s opponents if we do not see the 


extent to which they have succeeded in this latter 
debate. 

This “second face” (Bachrach and Baratz 1962) of 
conservative influence exposes an important soft spot 
in retrenchment scholarship. Retrenchment studies 
have argued that fragmented constitutional structures, 
have very different implications in the era of retrench- 
ment than in the era of expansion: The same institu- 
tional fragmentation that once hindered the passage of 
large-scale programs now presents an effective barrier 
to conservative attempts at retrenchment (Huber and 
Stephens 2001; Pierson 1994; Swank 2001). Yet this 
argument does not go far enough in acknowledging 
the conditional character of institutional effects. In the 
United States since the late 1970s, conservatives have 
had two central projects—cutting back existing policies 
and preventing new initiatives or the updating of exist- 
ing ones—and whereas institutional fragmentation has 
indeed hindered the former project, it has facilitated 
the blocking activities that are the central strategic el- 
ement of the latter. Furthermore, fragmentation not 
only creates multiple veto points. It also creates mul- 
tiple venues in which conservatives can pursue their 
aims while hindering efforts by defenders of existing 
programs to undo the policy drift and parallel policy 
paths that result. 

More generally, as we shift our gaze beyond episodes 
of large-scale retrenchment to take in processes of wel- 
fare state adaptation (or failures of adaptation, as the 
U.S. case seems to be), the political struggles that we 
find bring together the “old” and “new” politics of 
the welfare state in interesting ways. In the battle to 
scale back existing programs, we see the new politics 
writ large: the perilous obstacle course of veto players, 
loss aversion, and mobilized constituencies. Yet when 
we begin to consider the ways in which welfare states 
have responded to shifting constellations of risk and the 
weakened ability of established systems of social provi- 
sion to cope with them, we see more affinities between 
present struggles and those that lay behind the welfare 
state’s rise. There is good reason to believe, for example, 
that the power of leftist parties and organized labor— 
and of emergent forces like feminist coalitions—are 
quite important in determining whether and how wel- 
fare states adapt to new social realities. As just dis- 
cussed, moreover, there is also good reason to believe 
that the institutional factors that help explain the size 
and scope of welfare states have effects similar to those 
that they had in the past on contemporary efforts to up- 
grade existing policies. The crucial difference between 
past and present—and here the effects of past choices 
indeed loom large—is that current struggles take place 
in the shadows of massive systems of social provision, 
which pervasively shape the challenges and opportuni- 
ties that today’s leaders confront. 

To capture the interaction of old and new politics, 
this article has outlined a general framework for study- 
ing policy change based on the premise that opponents 
of existing policies weigh the relative costs of working 
within an existing policy framework, on the one hand, 
or of replacing or eliminating the framework through 
authoritative change, on the other. This calculation 
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suggests that, in political settings that make authori- 
tative change difficult, insurgents may not seek formal 
revision of policies, but may instead work to alter such 
policies through active internal reform or the block- 
ing of adaptation to external circumstances. Although 
Tused this framework to illuminate the strategies of op- 
ponents of the welfare state—and, in turn, to question 
the conclusion that there has been limited retrench- 
ment of U.S. social policy—the argument has substan- 
tially broader applicability. Indeed, it hints at a solution 
to the old rational choice conundrum “Why so much 
stability?” (Tullock 1981) that does not rest on ad hoc 
distinctions between institutions and outcomes (Riker 
1980) or on claims about the inherent uncertainty of 
reform (Shepsle 1986). Rather, it suggests that policy 
design choices are not equivalent to preferences re- 
garding states of the world simply because policies can 
be used to achieve multiple ends. Reformers always 
face the fundamental question of whether the sacri- 
fices they must make to work within an existing policy 
outweigh the costs of formal revision. 

Within this framework of expectations, I also devel- 
oped a set of propositions about the strategies that wel- 
fare state reformers will follow under different condi- 
tions that were well borne out by my analyses of pension 
and health policy. Faced with status quo-biased political 
institutions and popular social programs, conservative 
opponents of the welfare state have turned to strategies 
designed to abet policy drift, undermining longstand- 
ing programs while blocking efforts to adapt policies 
to shifting social risks. When the support coalitions be- 
hind policies have proved weaker or the latitude for 
internal change greater, they have turned to strategies 
of internal conversion, altering policies’ aims or oper- 
ation without revamping their formal structure. And 
when the political barriers have declined in response 
to favorable electoral or political winds, conservatives 
have successfully layered new policies that embody new 
goals on top of existing change-resistant programs. 

Moreover, the role that private benefits play in a 
particular policy area—whether they serve as the core 
source of benefits, as in health policy, or as a supple- 
mentary source, as in pension policy (Hacker 2002)— 
influences the reform strategies that opponents of the 
welfare state adopt in the precise fashion that the con- 
ceptual framework suggests it should. When private 
benefits play a core role, as in health care, opponents 
need only play defense, keeping new state interven- 
tions at bay and abetting externally caused policy drift. 
When private benefits are supplementary, however, 
much more active use of government power is required 
to encourage the expansion of private options and un- 
dercut public programs, as evidenced by conservatives’ 
layering of new tax breaks onto existing policies in the 
pension area. This framework thus offers a promising 
starting point for further analysis of the means by which 
established public policies are challenged and, at times, 
transformed. 

The pursuit of theoretical advances should not, how- 
ever, cause us to lose sight of the ultimate concern: the 
changing role of the welfare state in the lives of citizens. 
In the new climate of economic and family risks, the 
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welfare state has had to run to stay still—to do more 
merely to secure past gains In the United States, it 
has not done more, and when we examine the broader 
framework of American social protection, a strong case 
can be made that it has done less The scholarship on 
retrenchment has offered strong reassurance to those 
who believe that the welfare state is an essential ele- 
ment of a just society. My analysis raises the possibility, 
however, that formal welfare state policies may turn out 
to be more resilient than the ideals embodied in them. 
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a “tournament” as a substitute for the “race-to-the-bottom” model. Our key insight is that political 


T i inaner and tests a new formal model of the competition for capital, using the analogy of 


costs that accompany legislating have both direct and indirect effects on the likelihood and scale of 
reforms. While countries with higher political costs are less likely themselves to enact reforms, the presence 
of these costs also reduces competing countries’ incentives to reform regardless of their own political 
costs. Domestic politics therefore mitigates the pressures for downward convergence of tax policy despite 
increased capital mobility. We examine the capital tax policies in OECD countries during the period 
from 1980 to 1997 and find that states are sensitive to tax reforms in competitor countries, although their 
responses to reforms are mediated by their own domestic costs to reform. We define two potential sources 
of political costs of reform: transaction costs, due to the presence of multiple veto players in the legislative 
process, and constituency costs, due to ideological opposition to policy changes that benefit capital. Our 
evidence reveals that a reduction in these costs either domestically or abroad increases the likelihood that 


a country enacts tax reforms. 


hat are the policy consequences of the elim- 

ination of various barriers to international 

capital and labor mobility? Robert Frost, in 

Mending Wall, instructed that “Good fences 
make good neighbors.” Scholars have increasingly ex- 
pressed this sentiment’s converse, i.e., the pessimistic 
viewpoint that as the mobility of factors of produc- 
tion increases, international competition assumes a 
“beggar-thy-neighbor” character. When “fences” sep- 
arate countries and make it difficult for factors of pro- 
duction to move from country to country, governments 
can operate essentially as monopolistic suppliers of ser- 
vices, Governments can extract a price from the owners 
of productive resources in the form of taxes or can place 
regulatory restrictions on production without fear that 
factors of production will emigrate. When the “fences” 
fall and governments must compete with each other for 
the same resources, a state’s or country’s ability to tax 
or regulate diminishes; in the extreme, countries lose 
sovereignty over domestic policymaking (e.g., Andrews 
1994 and Lee and McKenzie 1989). 

The model that most scholars employ (implicitly 
or explicitly) to understand international or interstate 
competition is known commonly by its central pre- 
diction: the “race-to-the-bottom” (e.g., Oates 1972). 
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To attract a mobile factor of production, one govern- 
ment might reduce its use of a policy instrument that 
lessens the rate of return on that mobile factor, such as 
high tax rates or restrictive environmental standards. 
This policy change creates a negative externality for 
competitor states or countries that must enact their 
own reforms to maintain competitive parity. In game- 
theoretic terms, the states are presumed to face an in- 
escapable “Prisoner’s Dilemma” in which every state 
has a dominant strategy to make its market more at- 
tractive than its neighbors’ markets. In the game’s un- 
fortunate equilibrium outcome, the mobile factor may 
remain distributed the same as before barriers fell, but 
all countries and their citizens are in a worse condition, 
whereas owners of the mobile factor of production reap 
increased rewards. 

Scholars have searched for globalization-induced 
convergence, and indeed for downward spirals, to con- 
firm this prediction in policy areas such as environmen- 
tal and labor market regulation. Tax competition in 
particular would have broad repercussions that extend 
beyond tax policy itself. If lower rates lead to a reduc- 
tion in revenue that would accompany, policymakers 
would be forced to cut domestic spending, increase 
budget deficits, and/or shift the tax burden to less mo- 
bile factors of production. The empirical evidence for 
the race-to-the-bottom model, however, is weak. Vogel 
has found that the model is dissonant with actual trends 
in environmental regulation, and the same is true for 
tax policy—capital tax burdens have increased even as 
capital became more mobile, with the overall share of 
revenues provided by corporate income taxes rising 
from 7.7% in 1975 to 8.4% in 1996 (Genschel 2000). 
Many scholars have asked why race-to-the-bottom sce- 
narios have failed to materialize. Existing explanations 
for the lack of a downward spiral of capital tax bur- 
dens fall into two camps. The first argues that glob- 
alization does create pressures to converge but that 
political institutions and organizations mitigate these 
pressures. The second school contends that key actors 
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(firms especially) are not sensitive to changes in tax 
burdens abroad and hence have no reason to pressure 
governments to make any adjustments (e.g., Ganghof 
1999; Swank 1998, 2002; Swank and Steinmo 2002). 
We provide a new answer to this important question 
with both a new theory and new empirical tests that 
flow from that theory. Our formal model employs the 
analogy of a “tournament” to describe the strategic 
decisions that states face when competing for capital 
resources; the model has three features we wish to high- 
light, which have not appeared together in previous 
work. First, the model permits the responsiveness of 
capital flows to changes in tax policy to vary, allowing 
us to investigate the effects of capital markets’ glob- 
alization. Second, the model incorporates incomplete 
information, particularly about the decisiveness of tax 
reforms in determining where capital will locate and 
about competing countries’ costs of tax reform. Third, 
the model also incorporates a role for domestic politics 
and political institutions by acknowledging that coun- 
tries face political costs for undertaking tax reform. We 
find that the presence of political costs reduces a coun- 
try’s incentive to enact reform, but they simultaneously 
reduce competing countries’ incentives to reform. Do- 
mestic politics mitigates the pressures for tax rates to 
converge downward despite increased capital mobility. 
The formal model thus provides a theoretical explana- 
tion for the absence of the race to the bottom and, also, 
yields testable hypotheses about the conditions for tax 
reforms. We use this model to investigate the tax policy 
choices made in 20 OECD countries from 1980 to 1997. 


MODELING THE COMPETITION FOR 
CAPITAL 


The four tasks of this section are to introduce the 
standard race-to-the-bottom model, to modify that 
model by relaxing certain implicit restrictions, to solve 
for players’ equilibrium strategies, and to employ the 
model analytically for deriving hypotheses. We perform 
this last task by computing “comparative statics,” de- 
termining how equilibrium strategies change with vari- 
ations in key parameters of the model. 


The Race-to-the-Bottom Model! 


The standard race-to-the-bottom model assumes that 
the competition for capital is best represented as a 
Prisoner’s Dilemma.! The version of this game that is 
presentin most textbooks is discrete, such that countries 
would choose merely whether to cooperate (i.e., main- 
tain current tax levels or high regulatory standards) or 
to defect (i.e., reduce tax rates or relax regulations). 
Instead, let us present a continuous version of the 
model, which bears some resemblance to the Bertrand 
(price-setting) model of oligopoly; in the next section 
we expand on this model by incorporating incomplete 
information. Assume that an investor announces her 


1 See Radaell 1998 for an exception, employmg a “coordination 
game” for transfer pricing 
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intention to invest an amount of capital, worth k, in 
the national market offering the best rate of return. 
Suppose that N countries compete for this capital in- 
vestment by choosing a level of tax reform or tax reduc- 
tion effort, denoted e, for a generic country 1. Efforts 
to reduce tax rates improve a nation’s attractiveness 
to potential investors by raising the rate of return; let 
P:(e) = p,(¢, |e) denote the probability of country 1 
winning the contest, given its own effort and the ef- 
forts of its competitors which are denoted e_,. We as- 
sume that p, (e) is increasing in e, and decreasing in e_,. 
Countries derive positive utility from investments, but 
tax reductions also generate domestic political and eco- 
nomic costs, due to reduced tax revenues (holding the 
investment level constant) and to the transactions costs 
entailed in passing tax legislation. Let c,(e,) denote the 
cost to a country for its tax cutting effort; assume that 
for alli,c,(0) = 0, c > 0, c > 0, and there exists @ such 
that c,(e,) = 00 if e, > @. That is, no costs are borne if 
the country makes zero effort at reform, whereas costs 
increase continuously and at a weakly accelerating rate 
up to a certain threshold level ë (either an asymptote 
or a kink suffices). 

To solve the model, we adopt the Cournot—Nash 
equilibrium concept, which assumes that each country 
best responds to the conjecture that each opponent j 
chooses its own optimal effort level, e}. A country i 
thus solves the following: 


max p, (e |e.")ky — c (e), 


wheree*, =|] jäi e}. If there exists an interior solution, 
at which e, < ë, then we can identify a generic country’s 
optimal behavior by evaluating the first-order condi- 
tion (setting the first derivative with respect to e, equal 
to zero), revealing that e* must satisfy 


du, (e) 
de; 





= 0 pi(etlet) -k= c(et). 


For a race to the bottom to occur, no interior solution 
may exist; i.e., e” > @. This requires the left-hand side 
to exceed the right-hand side for alle, < ë, given e* re 
There are two sufficient conditions for this inequality to 
hold, both of which are satisfied in the standard model. 
First, note that this standard model is symmetric, such 
that ¢,* = e; for all i and j, and note the absence of 
uncertainty regarding investor behavior, such that the 
rank-order of countries’ effort levels is perfectly deter- 
ministic of victory and defeat in the contest. Combined, 
these two assumptions imply that p/(e*) = oo, such that 
when the countries exert the same equilibrium effort 
level, p; (er; e* )- k > ci(ef) for alle, < z, for alli. At all 
symmetric strategy profiles up to the threshold effort 
level, every country individually prefers to increase its 
own effort by an infinitesimal increment, which would 
increase its probability of winning from 1/N to 1 if no 
competitor country changed its own strategy (this is a 
self-refuting expectation, however). The second suffi- 
cient condition for the race to the bottom, and an im- 
plicit assumption of the standard Prisoner’s Dilemma- 
based model, is that the right-hand side equals zero 
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in the first place or, at least, is constant; either way, 
ci (e*) = 0 in the canonical model. If states can freely 
change their tax systems, tax rates will spiral downward 
because states quickly, completely and costlessly match 
other states’ actions. Thomas (2000) demonstrates in a 
simple framework that the nature of the tax competi- 
tion game changes as the costs attracting capital or jobs 
rise relative to the benefits, such that the competition 
for capital only approximates a Prisoner’s Dilemma 
when costs are relatively low; Dehejia and Genschel 
(1999, 409) conjecture that “the competition for foreign 
tax base is subject to diminishing returns, and, there- 
fore, is self-limiting.” 

In the next section, we present a new model that in- 
corporates more realistic assumptions about the strate- 
gic environment. Our model differs from the Prisoner’s 
Dilemma in three major ways. First, we do not assume 
that tax reduction is costless, and in fact, we permit 
cost levels to vary among the participants. Second, we 
do not assume that countries are perfectly informed 
of competitor countries’ domestic political costs of tax 
reduction. Third, we do not assume that tax reducing ef- 
fort is perfectly deterministic of the investors’ decisions; 
rather, our model permits the possibility that other fac- 
tors than tax cuts might influence investor decisions, 
and thus tax rate differentials are only probabilistically 
related to success or failure at winning investments. 


A Tournament Model 


Our model of the international competition for capi- 
tal is based on the analogy of a tournament, in which 
countries’ efforts to attract capital are rewarded with 
favorable investment decisions, but effort entails some 
perceptible (and variable) cost.? Our model reflects the 
insight that countries make their decisions in the con- 
text of potentially great uncertainty, which can take at 
least two forms. 

A first form of uncertainty regards competitors’ pref- 
erences over tax reform. Countries may differ by the 
level of political costs they incur; in our model we per- 
mit costs to take on two values—low cost (denoted 
6,) and high cost (denoted 6;;)—where c/(x;6z) < 
ci (x; 0) for all x. Country i’s information about cost 

is interim, meaning that i knows only its own cost 
of tax reform (high or low) with certainty, although 
i possesses beliefs about other players’ costs of tax 
reform. Results of future elections are unpredictable, 


2 We note, however, that tax rates are strictly bounded above zero in 
the Dehejia and Genschel (1999) model because of assumptions they 
make about the production function and about the responsiveness of 
capital flows to tax policy. Specifically, they assume that a € (—1, 0), 
where a parameterizes returns to scale. As a result, thear endogenous 
parameter t—which maps from the two countries’ tax rates into their 
capital stocks—is also strictly bounded to an interval that excludes 
the possibility of either country possessing the entire caprtal stock. 
Relaxing this constraint could also eliminate their conclusion that, 
“the tax competition game reaches a resting point at a positive level 
of taxation for both countries The race-to-the-bottom stops above 
zero taxation” (409). 

3 Similar models have been applied mainly to the labor market 
(Lazear 1995; Lazear and Rosen 1981). Wheeler and Mody (1992) 
treat investment location decisions as a tournament. 


and the domestic political characteristics of a coun- 
try may change over time, thus competing countries’ 
preferences over tax reform may be unpredictable. Yet 
we assume that sufficient information exists for coun- 
tries to make reasoned decisions based on the expected 
cost profiles of competitors for capital; formally let à 
represent the proportion of low-cost countries in the 
population and 1 — A represent the proportion of high 
cost countries. 

A second form of uncertainty regards the relative 
weighting that tax reform has in the mind of potential 
investors.* Assume that there exists an owner of cap- 
ital who wishes to make an investment and that this 
investor prefers higher rates of returns to lower ones, 
this behavior is loosely consistent with the common 
Mean-Variance portfolio selection model (Markowitz 
1952, 1959).5 As before, the investor announces her 
intention to invest k in the country offering the best 
rate of return, and the announced reward structure is 
assumed to be credible. Any model of tax competition 
must assume that investors are sensitive to different tax 
burdens in deciding where to locate investment. In a re- 
view of the empirical literature on taxation and firm be- 
havior, Hines (1999) is most emphatic that differences 
in tax rates do affect firm investment decisions, noting 
that “there is by now extensive quantitative evidence 
that international taxation influences the volume and 
location of foreign direct investment” (318-19). The 
Ruding Report surveyed European firms about how 
taxation affected their investment decisions and found 
that tax levels did enter the calculus of businesspersons. 
The proportions and types of firms that consider tax lev- 
els is the subject of continued debate. Mosley’s (2000) 
more recent interviews indicate that fewer than one in 
five considered taxation to be an important determi- 
nant of capital location. Devereux and Griffith (1998) 
have found that tax rates do not affect American firms’ 
decisions of whether to enter the European market in 
the first place but do affect where firms locate their 
investments within Europe. Low corporate tax rates 
could be part of an overall “menu” of policy induce- 
ments, but some investors might tolerate higher taxes 
if greater government revenues translate into beneficial 
expenditures, such as government investment in human 
capital (e.g., Garrett 1998 and Hall and Soskice 2001). 
Countries’ rate-of-return advantages may thus have 
sources other than favorable tax treatment, including 
proximity to important downstream markets, beneficial 
workforce characteristics, relaxed immigration laws for 
highly skilled workers, and the presence of exploitable 
national resources. 


4 Although the precise amount of uncertainty is permitted to vary as 
a parameter in the modeL 

5 The “E-V criterion” holds that for two assets : and s—yielding 
expected returns R, and Ry and variances o? ando?, respectively—an 
investor prefers i to j if R > R, ando? < o?, with both equalities not 
holding smultaneously If the investor is risk neutral, having Imear 
rather than quadratic preferences, or if o? = o?, then the expected 
return ıs the sole criterion guiding the investor’ decision 

é Fidelity to the tournament analogy requires such a structure of 
preannounced winnings for players, depending on their rank-order. 
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We formalize the insight that taxes are not exclu- 
sively deterministic of investors’ choices by permitting 
investors to possess some preexisting bias for or against 
the investment opportunities in country, which might 
result from characteristics of the country or of the in- 
vestor him- or herself, professionally or personally. We 
assume that the countries competing for the investment 
do not observe investors’ biases, although they do know 
their probabilistic distribution. Let r, denote the ex- 
pected rate of return from a particular investment in 
country i, and let it depend on both the country’s tax 
cutting efforts and a stochastic element s,, such that 
r, =e, +5,. The stochastic component is observed by 
the investor—one might treat s(= JI, ey $+) as his or her 
type—but not by either country at the point of deciding 
on tax reform. (Thus a country i’s information about the 
stochastic terms is ex ante, meaning that i knows neither 
Sı nor s, for any j #i.) An investor’s bias between a 
pair of countries i and j, denoted b,, =s, —s, 5 —b,,, 
reflects the stochastic component of countries’ rates of 
return. 

For a generic country i, again let the probability that 
the investor chooses country i to be denoted p,(e) = 
prob(r, > max, {r,}, Vj #i|¢,). Denoting by k the 
capital investment in country i, then a country can pre- 
dict that k, = k if and only if r, > r,, which is the case 
only if e +s, > e, +s; for all j. Country i can then 


infer the following relationships: 
ple) Ss | [prob(e: +5; >e,+5,) 
yt 
= | [ prob(e: =e >S,—5,) 
JA 


= | [probée, —e,>b,,), 
Joh 


where s, and s, are independently drawn from an iden- 
tical distribution. Given the iid. assumption, the ex- 
pectation of b, equals 0, and consequently p;(e) can 
be defined as the product of cumulative distribution 
functions Ge, — e,), with G(0) = 1/2. 

The extensive form game with N = 2 is illustrated 
in Figure 1. Nature moves first by choosing the two 
countries’ types; each country observes only its own 
type, as represented by each player’s information sets. 
Each country chooses its tax reform effort, given its 
beliefs about the other country’s type and given the 
knowledge that whether the investor rewards its efforts 
depends on the investor’s biases. Nature then reveals 
the two countries’ stochastic draws and the investor 
chooses where to invest k depending on rate-of-return 
advantages. 

To solve the game, we must find the equilibrium lev- 
els of tax reform effort for each type and the optimal re- 
sponse of the investor to effort choices. First, we adopt 
the Bayes—Nash assumption that each chooses its type- 
contingent best response to its beliefs about opponents’ 
types, which come from the population proportions of 
low- and high-cost types. Tax cutting efforts are chosen 
simultaneously, thus a strategy for a country maps from 
its type into an effort level. Assuming that types are 
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uncorrelated and that players are symmetric except for 
what is denoted by type, we can denote by ez the opti- 
mal choice by any low-cost type and by e*, the optimal 
choice by any high-cost type. Second, we assume that 
players are sequentially rational, meaning that after any 
history of the game, each player plays an optimal strat- 
egy for the remainder of the game, given his or her be- 
liefs. This permits us to use backward induction to solve 
for the subgame perfect Cournot-Bayes—Nash equilib- 
rium. We begin the analysis with the investor’s decision- 
theoretic problem at the very end, with all uncertainty 
resolved. Investors derive higher utility from placing k 
in the country offering the higher expected rate of re- 
turn (given the two countries’ tax cutting efforts and the 
investor’s own biases over the investment opportuni- 
ties). A generic country i again solves the maximization 
problem: 


max u, (e) 


= p,(e,|e%,)-k—c,(4,), 


= [(Gle — A-D . (Gle, — ef) OOD] 
-k— Ci (e,), 


where the latter equation substitutes for p,(e, | e7) the 
cumulative density function G(e, — e,). Simply, country 
i’s probability of winning equals the probability that 
its chosen effort level e, defeats all low-cost competi- 
tors and all high-cost competitors’ choices (which are 
assumed to be, respectively, the equilibrium low- and 
high-cost effort levels). Using the former equation, a 
generic country i’s first-order condition for a maximum 
yields 


du;(e) 
de* 


L 





=0> pie"; 6, 6_,) ‘k= c (e734), 


where 6, denotes the type (low or high cost) of player 
i, and 6_, denotes the profile of competitor coun- 
tries’ types. The interpretation of this equation is that 
a country will, in equilibrium, exert just enough ef- 
fort to equate the marginal benefit of tax reduction 
(increasing the probability of winning the competi- 
tion for capital, multiplied by the gain from win- 
ning) with the marginal political cost of tax reduction. 
This equation already permits three inferences to be 
drawn. 


Proposition I. An increase in a country’s costs will 
reduce a country’s effort level in equilibrium, all else 
constant, thus low-cost countries expend greater effort 
than high-cost countries.’ 


Proposition 2. An increase in the stakes of “winning” 
will increase a country’s effort level, all else constant® 


7 Tf, as assumed, c! (x; 0L) < ci (x; 6y) for all x > 0, then c/(y,6;) = 
cx; 9x) only if y > x Thus, if p; (8. e*,;94, 0) k= c (8,94) and 
x > 0, then pi(%,,e*,;61,04) k= d (&;0L) only fë > è. 
8 Suppose that k > k. If c¥(x,6,) > 0 for all x > 0, and p/(2;) k= 
c, (8:0), then pi (č.0) - k= c/(2,;6) only if 2, > & for all: 
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Proposition 3. An increase in tax reform effort’s de- 
cisiveness will increase a country’s effort level, all else 
constant? 


To deduce additional propositions from the model, 
we must impose further assumptions on the model’s 
components. For example, we might specify precise 
functional forms, such as assuming that the politi- 
cal cost of tax reform is a linearly increasing func- 
tion of effort levels. Let us instead focus our atten- 
tion on the probabilistic success function, p,(-). The 
model of choice described above places the investor 
in the position of maximizing a random utility model. 
Hence if the stochastic component of countries’ rates 
of return takes the type I extreme-value distribution 


9 The proof follows directly from the stipulation that tax cuts are 
“more decisive” in regime A, compared to regime B, if and only if 
pi (x, 84304) > pi(x, 2_,;6) for all x. 


(also known as the log Weibull distribution), such 
that it has density g(x) =e*-*") and cumulative 
density G(x — a) = e", then we have the multino- 
mial logit model of McFadden (1976, 1979). Conse- 
quently, 


Pile, e",) 
a et 
et + P teens, CE + Veenroemon) OF 
1 
~ TEO)(N — lei (1AN = Dei 








Substitution of this functional form into the above 
maximization problem, and again taking the first 
order condition with respect to e,, yields the fol- 


lowing: 
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vie; 6,, 04) 
(A)(N = 1le*F 4 (1 —a)(N Ler? 
[1+ ANN Det + (AAN = Dee}? 


_ & (e756) 
staan hea 


Since this equality holds for both low-cost and high-cost 
countries, then it will be the case that 





a ACEO) alenon) 
pilet, e* OL, 6_,) Dilek, e*,3 On, 6_,) 
Consequently, 


pilet, e*,301, 0) _— ch (e3301). 

Pilem 20x84) Cynn) 
by substituting the above definition of p/(e*;6;, 0) and 
rearranging terms, we find 
an (e én) ) 
en cy (ef; Ox) 

(eet (N= 1 [det aae] 
en +(N—1)[(Ajet+(1—A)eth] } 





‘Taking the natural logarithm of both sides of this equa- 
tion and rearranging yields the following comparison of 
low-cost and high-cost types’ equilibrium effort levels: 


Cy (etn 9H) 
Bide +n( ee) 
í Š c, (53 91) 


+2In(e% + (N—1)[(A)e% + (1 —A)e%]) 
—2In(e% + (N — 1) [ae + (1 —A)e*]). 


This equation can be simplified slightly by noting 
that the product of bracketed terms in the last two 
sets of parentheses resembles the sum of competitor 
countries’ expected effort levels: (N — 1) denotes the 
number of competitors, (À) and (1 — à) denote the pop- 
ulation proportions of low- and high-cost types, respec- 
tively, and ei and efh denote the exponent of the two 
types’ equilibrium effort levels. This equation thus re- 
veals that in equilibrium, the gap between the low- and 
the high-cost players’ effort levels will depend on the 
logged ratio of costs at equilibrium effort levels and on 
the total expected efforts of competitors, which itself is 
a function of the number of countries and the relative 
frequencies of low- and high-cost types. 

Performing a comparative statics analysis on this last 
equation permits us to express how changes in exoge- 
nous variables or parameters affect the optimal strate- 
gies of countries competing for capital; we derive the 
following two propositions. 


Proposition 4, An increase in the difference between 
the two types’ costs at equilibrium effort levels increases 
the gap between the two types’ effort levels. 


266 


May 2004 


Proposition 5. An increase in the proportion of low- 
cost countries increases the gap between the two types’ 
effort levels. 


To summarize the model’s conclusions, propositions 
one through five indicate that we should be more likely 
to observe one country undertaking tax cuts when the 
country’s political cost of tax reduction is reduced, and 
we should observe all countries undertaking tax cuts 
when the amount of capital at stake increases, when tax 
cuts become more influential with respect to investor 
behavior, and when there are more low-cost countries 
in the population. These results suggest that the race- 
to-the bottom outcome is unlikely in equilibrium, since 
its occurrence depends directly on the distribution of 
countries’ costs, plus—as shown earlier—on the stakes 
of winning and the decisiveness of tax reform in the 
minds of investors. It is to the task of testing these pre- 
dictions that we now turn our attention. 


EMPIRICAL ANALYSIS OF TAX 
COMPETITION 


In this section, we evaluate the predictions of the for- 
mal model based on a time-seriescross-section analysis 
of effective tax rates on capital. We begin by address- 
ing four methodological issues: how to operationalize 
domestic political costs, international capital mobility, 
and tax reform and how to aggregate variables to cre- 
ate “world” measures. We then rephrase our model’s 
predictions specifically in terms of the variables and 
present our empirical findings. 


Measuring Domestic Political Costs 


Governments must resolve the tradeoff between mak- 
ing the domestic market more attractive to investment 
and sacrificing other political goals, such as maintaining 
spending levels or achieving fiscal balance and an equi- 
table distribution of the tax burden. An important as- 
pect of our analysis is the assumption that political costs 
and benefits of tax reform vary across countries who 
face the same international economic environment. 
We are not alone in this argument; Steinmo (1993) 
argues that differences in political institutions had a 
profound effect on the evolution of the tax systems of 
Sweden, the United Kingdom, and the United States. 
Hallerberg and Basinger (1998, 1999) find that dur- 
ing the period from 1986 to 1990, states with a single 
veto player generally matched the United States’ 1986 
change in marginal tax rates, whereas states with more 
than one veto player did not. Increases in “competitive” 
veto players have been shown to decrease the num-’ 
ber and the breadth of tax reforms in OECD countries 
during the period from 1980 to 1997 (Wagschal 1999a, 
1999b), and increasing the number of veto players has 
been shown to depress the passage of important legis- 
lation of all types (Tsebelis 2002). Moreover, Steinmo 
and Tolbert (1998) find that the overall tax burden is 
systematically related to the size of the largest party in 


10 This proposition is consistent with Swank 2002. 
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parliament (a variable that is negatively correlated with 
the number of veto players). These legislative costs are 
not the sole factor worth considering, however, since 
politicians must also weigh the potential benefits of tax 
reform against economic and political costs that derive 
from their own partisan or ideological goals. Ganghof 
(2000) explains the lack of a race to the bottom in tax 
rates by noting that international pressures to lower 
tax rates might be countervailed by domestic pressure 
to keep taxes high. Political pressure from labor, due 
in part to worsening unemployment, made any shift 
to labor taxation or any reduction in social expendi- 
tures difficult. The option to increase deficits was not 
uniformly available to governments in the 1990’s, es- 
pecially for states in the European Union that faced 
the Maastricht criteria limitations on debt and deficit 
levels. Hence, while globalization might have increased 
the external pressure on governments to cut tax rates 
and burdens, governments did not respond because of 
countervailing internal pressures. Swank and Steinmo 
(2002) similarly assert that the welfare state’s resilience 
combined with large fiscal deficits from the 1970s on- 
ward to make initiating sweeping tax cuts increasingly 
difficult. 

For the purposes of empirical analysis, we wish to 
introduce a practical distinction between two different 
sorts of domestic political costs: transaction costs are 
associated with moving any piece of legislation through 
the legislative process; constituency costs are associated 
with potential opposition to tax cuts or particular tax 
reforms by pivotal actors within the legislative pro- 
cess, Transaction costs arise from (and increase with) 
the decentralization of the legislative process. These 
may be due to formal checks and balances, including 
the constitutional separation of powers in a political 
system. Even though bicameral systems are heteroge- 
neous (Tsebelis and Money 1997) and executive powers 
range broadly across presidential systems (Shugart and 
Carey 1992; Shugart and Haggard 2001), a polity with 
multiple legislative chambers or a separately elected 
executive with a legislative veto will present a greater 
number of hurdles that a bill must overcome before 
becoming law. Transactions costs may also originate in 
less formal institutions, including standing committees 
with gatekeeping pora and cabinet ministries with 
proposal control.“ Previous research on tax reform 
has operationalized the magnitude of transaction costs 
by counting the number of veto players—or perhaps 
more accurately, veto parties—in the legislative pro- 
cess (Hallerberg and Basinger 1998, 1999); a party is 
counted as a veto player if it is included in a gov- 
erning coalition or holds the majority in a legislative 
institution. 

This measure of veto players might be considered 
unsatisfactory because it does not consider the level of 
partisan or ideological conflict among the veto players. 
Tsebelis (1995, 2002) has shown that an increase in the 
distance between (partisan) veto players’ ideal points 


11 Henisz (2000) looks at the number of independent branches of 
government, which includes the judiciary and subfederal institutions 


may have two crucial effects: to reduce the likely mag- 
nitude of change from the status quo and to increase 
the likelihood that the new government will change the 
previous government’s status quo. Therefore we con- 
sider ideological distance between veto players in the 
analysis that follows,” using data from Laver and Hunt 
assimilated by Tsebelis.° Laver and Hunt (1992) asked 
experts to place parties on a left-to-right scale based on 
a set of specific questions, including one that focused 
on taxation (namely, whether a party argues that the 
government should “promote raising taxes to promote 
public services” to “promote cutting public services to 
cut taxes”). 

A second form of domestic political costs is con- 
stituency costs, which arise from the governing par- 
ties’ potential opposition to tax reform or its policy 
repercussions (Swank 2002). Any reduction in one tax 
might force a government to reduce spending, increase 
deficits, avoid repaying debt, or shift the tax burden 
(such as onto less mobile factors of production). A 
governing party’s constituents’ preferences affect the 
costs of each of these options. For operational pur- 
poses, we represent constituency costs in terms of the 
governing parties’ ideological positions. The logic for 
doing so can be traced back to Hibbs’s argument, based 
on the “Phillips Curve,” that right-leaning parties gen- 
erally count capital as stronger supporters than left- 
leaning parties and should, therefore, incur lower costs 
to cutting capital taxes than more labor-oriented par- 
ties (Garrett 1998; Hibbs 1977). We rely upon Tsebelis’ 
coding of partisanship according to Laver and Hunt’s 
scaling, but we standardize the variable to the range 
from 0 to 1, with lower numbers representing more 
left-leaning ideological attachments. 

In the empirical work to follow, we address both 
whether an increase in the domestic political costs of 
reform has a direct, negative effect on a country’s tax 
reform effort and whether increased political costs in 
one country have consequences abroad. Our tourna- 
ment model reveals that an increase in the likelihood 
of facing a low-cost country in the competition for 
capital increases efforts at tax reforms in all coun- 
tries. If an individual country’s political cost of tax 
reform is reduced, via either a reduction in the num- 
ber of veto players or a change in government parti- 
sanship, then all countries in the general population 
have an incentive to invest greater effort in tax re- 
form than they had done before; the reverse is also 
true. 


Measuring Capital Mobility 


By cutting taxes, national policymakers hope to attract 
investment; as capital becomes more mobile over time, 


12 Rartier versions of this paper reported results with both formula- 
tions of veto players. The results are largely the same, thus we report 
results for ideological distance only in the emprncal section. 

B http//www polisci.ucla.edu/tsebelis/. 

14 The correlation between this dimension alone and the two aggre- 
gates that appear m Tsebelis 2002 that average additional measures 


of partisanship is above .9. 
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a small rate of return advantage may have much greater 
impact on the set of investors’ decisions. At the heart 
of the race-to-the-bottom theory is an assumption that 
capital has become perfectly or near-perfectly mobile, 
such that capital can move so cheaply as to make the 
downward spiral in tax rates inevitable. In contrast, 
our model treats the level of capital mobility as an 
implicit parameter (affecting k, the amount of capital 
at stake) rather than as a constant. All else equal, in- 
creases in capital mobility should catalyze increased tax 
reform effort. 5 Hallerberg and Clark examine a range 
of measures common in the literature and conclude 
that, according to systemic measures of capital mobility 
(Andrews 1994; Frieden 1991; Hallerberg and Clark 
1997), capital had already become mobile circa 1973, 
after the collapse of the Bretton—Woods system. This 
implies that capital is consistently mobile in the pe- 
riod we consider in this paper. Following Clark and 
Hallerberg 2000, we employ an alternative measure of 
capital mobility that is based on the capital controls 
that states imposed to prevent capital from leaving the 
country; Simmons (1999) codes capital controls on an 
eight-point scale, based on information provided in the 
International Monetary Fund’s annual International 
Financial Statistics. 


Measuring Tax Reform 


How should one operationalize tax reform itself? A 
critical empirical matter is to identify the particular 
sorts of reforms relevant to the competition for capital. 
The question of what taxes owners of capital consider 
when making investment decisions inspired much re- 
cent work by or for the OECD, especially since dif- 
ferent measures of the tax burden lead to different 
conclusions about relative levels of the burden across 
countries and over time (Cary and Tchilinguirian 2000; 
OECD 2000; Volkerink and de Haan 2000). 

The simplest measure of tax policy (and its reform) 
that one could employ is the top marginal tax rate. The 
clearest disadvantage of using the marginal rate is that 
almost no firm (whether a corporation or an unincorpo- 
rated business) ever pays it. The simple percentage of 
income or profits listed in the tax code fails to indicate 
the exemptions, loopholes, etc., that capital owners can 
use to reduce their tax payments below the statutory 
rate. Using the top tax rate has several advantages, 
however. First, the marginal rate is the basis for tax 
deductions: in most cases, it is not possible to deduct 
foreign taxes paid from one’s home taxes beyond the 
level of the home marginal rate. Second, the statutory 
rate sets the parameters for two ways for firms to get 
around paying tax, transfer pricing and thin capitaliza- 
tion. Third, the top marginal rate plays an important 
signaling role about the tax burden of a country. Finally, 
as an OECD (2000) report points out, the marginal 
rates “are commonly an important factor in decision 





15 As an anonymous reviewer correctly points out, financial capı- 
tal will be more mobile than foreign direct investment. In practice, 
however, we know of no practical way of separatıng the two ın the 
empirical analysis 


268 


May 2004 


making on new investment” (7), therefore we consider 
central and general government marginal tax rates.16 

To address the shortcomings of marginal rates, we 
also examine a measure of “effective” or “implicit” tax 
rates, or what Volkerink and de Haan refer to simply 
as “tax ratios.”!” The most common method is referred 
to in the literature as the Mendoza method (Mendoza, 
Milesi-Ferretti, and Asea 1997; Mendoza, Razin, and 
Tesar 1994). These ratios have the advantage of mea- 
suring the actual taxes paid, which implicitly include the 
combined effects of statutory rates, tax deductions, and 
tax credits,"* but have the disadvantage of classifying 
different revenues according to categories like “capi- 
tal tax” and “labor tax” in somewhat ambiguous ways 
(OECD 2001). Following Swank and Steinmo (2002), 
we complement the use of marginal rates with this 
methodology to compute the second of our dependent 
variables,’ although we rely on an updated data set 
that expands the set of countries and makes some tech- 
nical corrections to the original Mendoza formula.” 
Our expectation is that, because of tax deductions, cred- 
its, and the like, the actual effective tax burden should 
consistently be lower than the marginal rates, with the 
general government rates higher than the central gov- 
ernment rates, 

Figure 2 plots the effective rate of taxation on capital 
and the central and general marginal corporate tax rate. 
In 1980, the effective rate was 20 percentage points 
below the actual general government rate. By 1996-97, 
however, all three rates’ medians had equalized. The 
average effective tax rate has remained remarkably sta- 
ble over the 18 years in our data set, whereas marginal 
rates have decreased over time. All three measures 
of the tax burden have narrowed slightly over time, 





16 Ganghof (1999) argues that investors care only about the overall 
tax burden they pay, and general government rates are most relevant. 
Hallerberg and Basinger (1999) counter that foreign investors are 
likely to know changes at the central government level rather than 
changes ın a particular city or state. We therefore use both mea- 
sures and thank Steffen Ganghof for making his data set available 
On marginal rates. 
17 Thear calculation differs from the previous method, as Volkernnk 
and de Haan (2000, 11) note, these ratios “indicate real changes in 
the tax system that might be over- or understated by, for example, a 
tax-to-GDP ratio” 
18 In technical terms when defining the Mendoza rates, “the capital 
income tax ratio 1s defined as the product of the personal mcome tax 
ratio . . and the sum of the operating surplus of private unmcorpo- 
rated enterprises. plus taxes on income, profits, and capital gains 
of corporations ... plus taxes on property ... and taxes on financial 
and capital transactions over operating surplus of the economy” 
(OECD 2001, 16). Carey and Txchilinguinan follow this method but 
make a correction for imputed salaries from self-employment (19). A 
second possibility is to connder tax revenues as a share of GDP, as do 
Inclan, Quinn, and Shapiro (2001). Unfortunately for our purposes, 
this statistic is problematic for single-country research across time, 
and it is not acceptable at all for comparisons across countries because 
of differing accounting definitions (see OECD 2000, 27). 
B. Swank (2002, chap. 7) uses the original Mendoza, Razin, and Tesar 
1994) data set n his empirical analysis, as do Swank and Steinmo 


) 
® Carey and Tchilinguran (2000) report data for the period 1980-97 
for 29 OECD countries. Mendoza, Razin, and Tesar 1994) mcludes 
data for seven counties and has a data set that ends in 1998, while 
Mendoza, Milesi-Ferrett, and Asea (1997) bas data for 17 OECD 
countries with a data set that ends in 1992. 
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although this narrowing should not be exaggerated— 
there remains a 25 percentage point spread between 
the two whiskers of effective tax rates. Conventional 
stories of a “race to the bottom” would not expect such 
a large spread, nor would they anticipate that the ef- 
fective tax rate on capital in aggregate has not changed 
appreciatively over the period. 


Welghting Scheme 


A final methodological issue concerns the weighting 
scheme we use in our regression analysis. Our model 
indicates that states consider what competitor states 
are doing with their tax policies. In empirical tests, how 
can one determine what a “competitor” country is in 
practice? We consider four alternatives. The first case 
is one where countries weight the probability of com- 
peting with another country based on the number of 
countries in the sample. That is, the likelihood that the 
United Kingdom competes with France for capital is 
simply 1/N, where N is the number of countries in the 
sample. We take the average values for the countries in 
the sample (excluding the country itself). 

This weighting seems implausible, since presumably 
the United Kingdom competes for capital more fre- 
quently with France than with New Zealand. We there- 
fore include in our empirical work three additional 
weights that governments themselves may use in eval- 
uating potential competitors. The first weight is Gross 
Domestic Product (GDP), which captures the notion 
that states may pay greater attention to tax policies in 
larger economies than smaller economies; this weight 
appears commonly in the “gravity” models of interna- 


FIGURE 2. Effective Tax Rate on Capital, and Marginal Central and General Government Corporete | 





tional trade theory.” The second weight is Foreign Di- 
rect Investment (FDI), capturing the notion that states 
are sensitive to capital flows, i.e., that states pay at- 
tention to the tax profile of other states that have had 
recent success in attracting capital. The final weight 
is gross Fixed Capital Formation (FCF), capturing the 
notion that states are sensitive to capital stocks, i.e., to 
potential, rather than actual, capital flows. In practice, 
these weighting schemes mean that changes in “larger” 
states should have a greater effect than changes in 
“smaller” states; that is, the United States’ veto player 
profile or partisan profile should have greater effect 
on the “global” veto player ratio than an equivalent 
change in Finland.” 


Model Specification and Results 


We include four control variables in our empirical 
analysis. The first is a country’s economic growth rate 
(lagged one year since taxes are often levied on in- 
come generated in the previous year). When economic 
growth is strong, capital profits should increase and the 
share of firms paying capital taxes should increase as 
well, but when economic growth is weak, profits and the 
share of firms paying taxes should decrease, and tax cuts 





21 Gravity models of trade also include a variable for the distance 
between countries. This variable 1s not relevant in our model be- 
cause we assume that capital can travel across distances effortlessly 
compared to trade in goods. 

2 Tt should be noted that we are weighting the impact of changes 
in tax rates, i ip, etc, when we compute the relevant inde- 
pendent variable. This is not the same thing as computing a weighted 
least-squares regression. 
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might be seen as an economic stimulus; we anticipate 
that the relationship between economic growth and the 
change in the capital tax rate will be positive. The sec- 
ond control variable that we include is a country’s rate 
of inflation (also lagged one year), which can have two 
countervailing effects: On the one hand, inflation can 
increase profits so that the proportion of capital owners 
paying taxes increases; on the other hand, because one 
pays taxes based on income in the previous year, higher 
inflation reduces the effective tax burden. The third 
control variable that we include is the level of the tax 
rate (whether the marginal rate or the effective rate), 
since states with higher taxes may feel more pressure 
to initiate tax reform. Finally, we include the change 
in the tax rate on capital in competitor countries (us- 
ing the four different forms of weighting competitor 
countries that we described above, to create a single 
index). 

Let us now turn to examining four hypotheses de- 
rived from our theoretical model: 


Hypothesis 1. Countries should engage in greater 
or more frequent tax reduction as the political 
costs of tax reform in the population of competitor 
countries decrease. Moves to the right politically or 
decreases in the ideological distance among veto 
Players abroad reduce political costs. 


Test of Hypothesis 1. World partisanship should 
have a negative coefficie nt, world ideological dis- 
tance between veto players should have a positive 
coefficient. 


Hypothesis 2. Countries should engage in greater 
or more frequent tax reduction as the decisiveness 
of tax reform in determining where investment will 
locate increases. Decreases in the use of capital 
reforms both abroad and at home increase deci- 
siveness. 


Test of Hypothesis 2. World capital controls 
Should have a positive coefficient, domestic capital 
controls should have a positive coefficient. 


Hypothesis 3. Countries should engage in greater 
or more frequent tax reduction as domestic po- 
litical costs of tax reform—in the form of either 
transactions costs or constituency costs—decrease. 
Moves to the right politically or decreases in the 
ideological distance among veto players decrease 
costs. 


Test of Hypothesis 3. Partisanship should have 
a negative coefficient, ideological distance should 
have a positive coefficient. 


Hypothesis 4. Countries should be increasingly 
responsive to changes in tax rates abroad as the 
domestic political costs of tax reform and/or the 
domestic decisiveness of tax reform falls. The con- 
ditional coefficient of changes in tax rates abroad 
should increase as ideological distance decreases, 
partisanship moves to the right, and there are fewer 
capital controls imposed at home. 
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Test of Hypothesis 4. Conditional coefficients 
calculations (explained below). 


Table 1 displays regression results. The 12 columns 
correspond to four different weighting schemes (the 
unweighted model and the GDP-, FDI-, and FCF- 
weighted models), for three different dependent vari- 
ables: the first quartet of regressions employs the 
change in effective tax rate on capital, the second 
quartet employs the change in central government’s 
marginal corporate tax rates, and the third quartet em- 
ploys the change in total marginal corporate tax rates.” 
Results vary as a consequence of changing the depen- 
dent variable, thus we discuss the regressions using 
the effective tax burden separately from those using 
marginal rates.** 

The effective tax rate regressions support two of the 
empirical hypotheses from our model when using ei- 
ther “stock” (GDP or FCF) variable as a weight for 
the political variables” Hypothesis 1 anticipates that 
states respond to changes in the level of political costs 
among the states that are likely to be their competi- 
tors for capital. The second and fourth columns show 
that the “world” partisanship level is significant at the 
.01 level in the two regressions that weight countries’ 
importance by their capital “stocks” (GDP or FCF). 
To get a sense of the practical effect of this variable, 
Table 2 displays the coefficients of significant political 
variables, the values those variables had at the 10% 
and 90% parts of their distributions, and the predicted 
value of the change in capital taxation if the remaining 
independent variables were set at their means. The ta- 
ble indicates that a rightward ideological movement in 
a given country’s perception of competitor countries’ 


B We also ran regressions with the absolute number of veto players 
instead of ideological distance. The results were broadly smular. 

M The lag of the dependent vanable was never significant, so we 
do not meclude it in the results. We also ran regressions assuming 
both a common AR(1) process and a panel-specific AR(1) process 
The substantive results did not change under either assumption. We 
include country dummies to control for fixed effects, with the United 
States as the excluded dummy variable. We also reran the regres- 
sions with year dummuea, although we do not report those results 
here. The advantage of including year dummies ıs that there may 
be year-specific effects that umpact taxation levels that we address 
The disadvantage 1s that they are highly correlated with the capital 
control variables; the mean vanance inflation factor (vif) score, which 
indicates that there 1s no multicolinearity with a score less than one, 
is in the thousands when year dummies are included. As a result, 
some political variables have inflated coefficients As Beck (2001) 
illustrates, including either country or year dummies in a regression 
when they are highly correlated with an independent variable of 
interest is problematic because of the effects on the coefficient of the 
independent vanable. We therefore include country dummies but 
not year dummies. We also compute panel-corrected standard errors 
{Beck 2001). All regressions were run with Stata 8. 

The “nonstock” regressions also have variables with the same 
signs, although the political variables are often not significant The 
difference between the “stock” and the nonstock results can be an- 
tiapated. It is unlikely that states treat each other equally in a com- 
petition game, which 1s the reason for including the weights and 1s the 
likely reason why standard errors are higher in the regression without 
weights. FDI net inflows, in contrast to the two stock variables, are 
more volatile. The correlation between the FDI in year ¢ and that 
in year t — 2, for example, is just .82, while it 1s .97 for gross capital 
formation Ttus suggests more “noise” in the use of FDI as one of 
the weights. 
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TABLE 2. Comparison of the Effects of Significant Political Variables According to Gross Domestic 


Product and Fixed Capltal Formation Welghts 





Change In Taxation 


Change In Taxation 





lf Remalning If Remalning 
Independent Independent 
Value Effect Vanables Set to Value Effect Varlables Set to 
Varlable Coefficient (10%) (10%) Means _ (90%) (980%) Means 
Partisanship, competitor countnes 
GDP welght —11.20 0.25 —2.78 2.80 0.59 —6.61 —1.04 
FCF welght —11.47 0.25 —2.85 2.64 0.59 ~6.78 —1.29 
Ideological distance, competitor countries 
GDP welght 9.11 0.08 0.69 —1.11 0.18 1.67 — 13 
FCF welght 11.28 0.08 0.86 —0.40 0.18 2.07 82 
Capital controls, competitor countries 
GDP weight 6.35 0 0 —1.54 25 1.58 04 
FCF weight 5.44 0 0 —0.81 25 1.36 54 

















Note: Coefhcrents calculated from Table 1 Note that the variable ıs included In the table only ff It Is stahstically significant at the 
p < .05 level. “Value” corresponds to the value of the Independent variable at a given point in that vanable's distrbuton The value of 
.25 for GDP weight in the 10% column, for example, 1s the value that partisanship took with the GDP weight at the 10% level of the 
varlable's distribution curve “Effect” multplies the value tmes the coefficient 





partisanship leads to a decrease in the taxation of capi- 
tal in one’s home country. In the FCF-weighted model, 
a move from the 10% level (Japan’s view of the world 
it faced in 1996) to the 90% level (Austria’s view of 
the world in 1985) would lead to a cut in the taxation 
of capital of 3.93 percentage points. Holding the re- 
maining independent variables at their mean valuess, 
the dependent variable’s predicted value moves from a 
2.64% increase in capital taxation at the first decile to 
a 1.29% cut in capital taxation at the ninth decile. Also 
consistent with Hypothesis 1, “world” ideological dis- 
tance among veto players has the hypothesized effect in 
all of the regressions, and the coefficient is statistically 
significant at the .05 level in the GDP- and FCF- 
weighted regressions as well as in the unweighted 
regression. Table 2 indicates that a decrease in a 
country’s perception of competitor countries’ trans- 
action costs leads to decrease in one’s own taxa- 
tion of capital. In the FCF-weighted model, a move 
from the 90% level of veto players’ ideological dis- 
tance in competitor countries (how Denmark per- 
ceived the world in 1984) to the 10% level (how New 
Zealand perceived the world in 1987) leads to a de- 
crease in the taxation of capital by 1.21 percentage 
points. Holding all independent variables other than 
world ideological distance at their means, the pre- 
dicted value of the dependent variable changes from 
an 0.81% decrease in capital taxation at the first decile 
to 054% increase in capital taxation at the nineth 
decile. 

The evidence from the effective tax rate regres- 
sions also supports Hypothesis 2: States are sensitive to 
changes in capital controls in competitor countries; this 
relationship has the correct sign in the first quartet of 
regressions and is statistically significant for both stock- 
weighted regressions. In the GDP-weighted regression, 
a move from the tenth to the ninetieth percentile in 
“world” capital controls leads to a 1.58% increase in 
the tax burden on capital. The FCF-weighted regres- 
sion yields results that are similar in magnitude and 
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statistical significance. We cannot, however, reject the 
null hypothesis that the domestic use of capital controls 
has no impact on tax reform decisions. This initial lack 
of evidence in the effective tax rate regressions regard- 
ing domestic politics obtains for Hypothesis 3, which 
predicts that domestic political costs should affect the 
degree of tax reform. The partisanship variable has 
the correct sign every time but never achieves statis- 
tical significance in the four effective rate regressions, 
and the ideological distance variable has the wrong 
sign. 

The regressions employing marginal tax rates pro- 
vide less evidence in support of Hypothesis 1 but do 
support Hypothesis 2: Changes in the use of capital 
restrictions in competitor countries do have a correctly 
signed and statistically significant effect in five of eight 
regressions. In other words, decreases in the world po- 
litical costs of tax reform seem not to effect reduc- 
tions in marginal tax rates, but a decrease in the world 
level of capital controls does lead to tax rate reduction. 
The regressions employing marginal tax rates provide 
stronger evidence for Hypothesis 3, which concerns the 
role of domestic political costs in the form of the ide- 
ological distance separating veto players. Countries in 
which the ideological distance among coalition part- 
ners is larger are less able to reduce their taxes. As 
mentioned above, partisanship again has the correct 
sign but is not statistically significant. Although changes 
in “world” partisanship or ideological distance do not 
appear to affect marginal tax rates at home, several con- 
trol variables, especially domestic capital tax rates, have 
statistically significant effects in the expected direction. 
Thus, states with higher marginal rates make deeper 
cuts than states with lower marginal rates (although 
note that the intercept or constant term is positive, 
so this results does not constitute prima facie support 
for the race-to-the-bottom prediction). Furthermore, 
actual changes in marginal rates by competitor coun- 
tries do lead to cuts in tax rates at home; note that 
our analysis is biased against finding consistent results 
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TABLE 3. Conditional Coefficients for the Effects of a Change In Tax Rates on Capital In 

















Competitor Countries Given Domestic Ideological Distance 
DCapital Tax Burden, DCentral Rates, Own DGeneral Rates, Own 
Own Country, with Welght Country, with Welght Country, with Weight 
Ideological Distance None  GDP_ FDI FCF None GDP FDI FCF None GDP FDI FCF 
Regression coefficients 

Coefficient, change .31 24*  23* .21* 87" 1.06" 357 1147 .70* =.19 57 4 
in competitor (20) (12) (14) (10) (21) (28) (14 (32) (23) (14) (14) (10) 
countries- 

Coefficient, —.83 —.52 -~—43 —.39 .04 —.17 .19 —.08 —.80 —.15 —.41 —.13 
change In (83) (60) (57) (50) (.79) (90) (57) (99 (72) (34) (45) (24) 
competitor 
countries * 
distance 

Conditional coefficient of change In P countries given a certain ideological distance 

0 distance 1 24" 21* 87" 1.06" 357° 1.147 70" 3.19 35" 14 
(United Kingdom, (.20) (.12) (ia) (10) (21) (29) (aay (32) (23) (14) (14) (10) 
1980-97) 

0.1 distance 23" 19%  .19* 177") .87" 1.04" 387") 1.13“ 62" 17 317) 12 
Denver (13) (08) (10) (07) (16) (24) (10) (27 (19) (13) (12) (09) 

0.2 distance .15 14" .15* 13% .877 1.03“ 38" 1.12" 5A™* 16 27" 11 
(Netherlands, (.11) (08) (.09) (.07) (.15) (.22) (.09) (.26) (19) (12) (41)  (.09) 
1982-88) 

0.3 distance .06 .09 .11 .09 8" 1.017 407% 1.12” .46* 14 23" 10 
(Italy, 1981) (14) (41) (41) (10) (18) (24) (11) (28) (21) (13) (12) (09) 

0.5 distance —.10 —.02 .02 01 .897 97" 44" 1.10" 30 11 15 .07 
Toa (.27) (.21) (.20) (.18) (.29) (.35) (.20) (.40)  (.29) (.16) (.18) (.12) 




















TABLE 4. Conditional Coefficients for the Effects of a Change In Tax Rates on Capital In 
Com r Countries Given Domestic Partisanship Level 











DCapital Tax Burden, Own DCentral Rates, Own DGeneral Rates, Own 
Country, with Weight Country, with Welght Country, with Welght 

Partisanship None GDP FDI FCF None GDP FDI FCF None GDP FDI FCF 

Regression coefficients 

Coefficient, 02 —.24 14 -.22 80 1.35" 29 1.44" 71 -.12 85 —.13 
change In (.52)  (.40) (38)  (.33) (.60) (.65) (.37) (.70) (.77) (44)  (.45) (.31) 
competitor 
counties;_; 

Coefficient, 25 .67 .02 63 14 —.59 17 —.57 51 —1.04 
change In (.86)  (.63) (.65) (53) (91) (1.01) (59) (1.11) (.75) — (.75) 
competitor 
countries * 
partisanhip 

Conditional coefficient of change In competitor countries oo a certain ideological distance 
0 partisanship 02 —.24 14 —.22 .80 135* .29 1.40 71 —.12 .85 —.13 

(no country) (52) (40) (38) (33) (60) (65) (37) (70) (77 (44) (45) (31) 
0 2 partisanship .06 -.10 15 —.10 .83* 1.23" 32 1.32" 66 -—.02 .64* —.04 
(Norway, 1989) (.35) A oo (.23) (.42) (.47) (.26) (50) (.53) (30) (31) (21) 
0.4 partisanship 11 .03 85" 1.12" .35* 1217 .60* .08 43" .05 
eae (19) ( 16) ( ye (.13) (.26) (.30) (.15) (.33) (30) (17) (17) (12) 
0.6 partisanship 16 16" 16% 15% 88" 1.00% 397% 1.10% 54% 19 2> 14 
o (11) (.07) (.09) (.06) (.14) (.22) (.08) (.26) (.19) (13)  (.11) (.09) 


0.8 partisanship 21 30° 16 30" 91% 88" 42" 98 48 29 15 2 
(.13) (.17) (13) (40) (33) (22) (20) (17 
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regarding competitor countries’ costs, since the actual 
changes—which are correlated with domestic political 
costs—are present and statistically significant in each 
regression and therefore “crowd” the partisanship and 
ideological distance variables. 

Let us turn our attention to Hypothesis 4, which pre- 
dicts that the responsiveness of a country to changes in 
tax rates abroad is conditional upon domestic political 
costs. We perform the previous set of regressions again 
but now include the lagged change in tax rates and this 
variable’s interaction with the domestic political cost 
variables. Tables 3 through 5 provide conditional coef- 
ficients for the range of values that ideological distance, 
partisanship, and capital controls take at home.” One 
should note that the interpretation of the coefficients 
of the interacted variables changes when an interaction 
term is introduced into the equation. The coefficient 
on ideological distance, for example, indicates the ef- 
fects of distance only when the lag of the change in 
taxation in competitor countries is equal to zero. Sim- 
ilarly, the coefficient for the effects of the lag of the 
change in competitor countries indicates the effects of 
this variable when the ideological distance separating 
veto players equals zero. 

Table 3 lists the conditional coefficients as well as 
standard errors for the effects of competitor countries’ 
tax changes, based on the domestic veto player profile 
for the 12 regressions. In the effective rate regressions, 
the coefficients change consistently with theoretical ex- 
pectations regardless of the weighting scheme. Coun- 
tries’ sensitivity to competitors’ changes in effective 
tax rates increases as domestic ideological distance de- 
creases, and they are most sensitive when ideological 
distance equals zero, Table 3 indicates that a country 
with great ideological distance between veto players 
(such as Italy’s governing coalition in 1981) would not 
react to changes in effective rates abroad, while a one- 
party government (such as the United Kingdom during 
the entire period) would respond to changes in tax rates 
abroad by adjusting its own taxes by between 21% and 
31% of the competitors’ change. The marginal tax rate 
regressions using general government tax rates are sub- 
stantively the same although less frequently significant, 
such that decreases in ideological distance at home do 
increase sensitivity to changes abroad. In contrast, ide- 
ological distance has little tangible effect on the condi- 
tional coefficients for central government rates. 

Table 4 evaluates the effects of changes in tax rates 
abroad given domestic partisanship. The results are 
consistent with our theoretical expectations that a 
move rightward makes a country more sensitive to 
tax cuts abroad, although only for effective rates. A 
Labour government in Norway, for example, should act 
as if it is oblivious to competing countries’ tax reforms, 


DGenera] Rates, Own Country, 
with Welght 





DCentral Rates, Own Country, 
with Weight 











ge In Tax Rates on Capital In Competitor Countries Given Domestic Use of 





with Welght 


DCapital Tax Burden, Own Country, 





ital Controls 


Regression coefficients 


% Readers should note that the lag for change m competitor coun- 
tries is interacted (mstead of the nonlag version) because the inter- 
action model is assuming that domestic players are responding to 
rate changes m place The full set of results is available upon request. 
Essentially, the “world” results for ideological distance for the three 
dependent variables as well as the world results for capital controls 
ın the statutory rate regressions remam as before. The world results 


for partisanship, however, weaken. 





(United States, 1980—97) 
0.25 capital controls 
(Portugal, 1980—85) 


0.75 capltal controls 


(Greece, 1981) 


competitor countries + 


distance 


Conditional coefficient of change In competitor countries given a certain ideological distance 


(Franca, 1980-89) 


0.5 capital controls 


TABLE 5. Conditional Coefficients for the Effects of a Chan 
Capital Controls 
Domestic 
Coeffldent, change In 
competitor countries; 
Coefficient, change in 
0 capital controls 
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while a Fianna Fail government in Ireland should re- 
spond directly and proportionately to tax changes in 
other countries. The story is much the same for the 
effects of domestic capital controls, whose effects are 
evaluated in Table 5. When a country employs no do- 
mestic capital controls, like the United States for the 
entire period, countries react to changes in effective 
rates abroad. Any increase in domestic capital controls 
reduces states’ sensitivity to other countries’ changes, 
although again these results only accord to theory for 
our effective tax rate regressions. 

Taken together, the empirical results suggest the fol- 
lowing. First, governments are sensitive both to the 
threat of changes especially in effective tax rates abroad 
and to actual tax cuts by competitors. This sensitivity 
increases in four conditions: when capital restrictions 
are removed abroad, when no domestic capital controls 
are in place, when a “right” government is in power, 
and when the ideological distance between veto play- 
ers is small. Simply, a prevalence of “right” govern- 
ments and low-ideological-distance coalitions partners 
in competitor countries should lead the home coun- 
try to lower the effective tax burden, especially when 
there are few restrictions on capital’s movement. Mean- 
while a prevalence of “left” governments and/or high- 
ideological distance coalitions abroad should allow a 
country to maintain or even increase the actual tax 
burden that capital pays. These patterns are less clearly 
supported for changes in statutory or marginal tax 
rates. Central government capital tax rates appear to 
be sensitive to changes abroad despite domestic costs, 
whereas domestic political costs have consistent direct 
effects, their indirect effects on a country’s sensitiv- 
ity to changes abroad in general government rates are 
ambiguous. It should be noted that this difference be- 
tween effective and statutory rates parallels the findings 
of Swank and Steinmo (2002). Our empirical frame- 
work allows us to draw the additional conclusion that 
countries are conditionally sensitive to changes in com- 
petitors’ rates, with ideological distance, partisanship, 
and capital controls all playing important roles. 


CONCLUSION 


Our first and foremost goal in this paper is to pro- 
vide a reasonable substitute for the race-to-the-bottom 
model, taking seriously the strategic nature of the 
games nations play. Our model departs from precedent 
most strikingly by incorporating the political costs of 
reform into the analysis. The most significant impli- 
cation of the model is that a country’s own political 
situation, combined with the institutional makeup of 
potential competitors, affects a country’s decisions re- 
garding competing for capital. In the tax competition 
case, there is good reason to expect tax rates to remain 
more or less static and certainly above zero in a host 
of situations. Moreover, it is important to recognize 
in our models that taxes are not the sole factor that 
owners of capital consider when choosing where to in- 
vest and that the actions of competing nations are not 
wholly transparent. Tax competition, therefore, should 


not lead to a downward spiral of tax rates, unless capital 
is completely responsive to even minor changes in tax 
policy (Dehejia and Genschel 1999). 

After presenting our model and detailing how its 
conclusions differ from the race to the bottom, our 
attention turned to the topic of the model’s empirical 
applicability. We find evidence for the importance of 
“world” political costs of tax reform, measured by both 
transactions costs and constituency costs (Carey and 
Tchilinguirian 2000). We also find an interactive effect 
between changes in tax rates in competitor countries 
and domestic political costs. States with a single veto 
player or where the veto players have similar prefer- 
ences on tax issues are more responsive to changes in 
competitor countries than those with more ideolog- 
ically dissonant veto players. These findings support 
the core insight from the model—states are conscious 
of the political costs of reform in competitor coun- 
tries, although their own adjustments are conditional 
on the ideological distance between domestic veto 
players. 

We apply this model empirically only to tax reform 
in this paper, but in principle the model is applicable 
to any sort of reform where states compete with each 
other, such as environmental standards and labor mar- 
ket regulation. The formal model’s components are not 
restricted to tax policy, and comparing across issues 
would provide perspective on the relative importance 
of tax competition. We have attempted to take a step 
toward integrating comparative and international po- 
litical economy. We urge scholars who have relied on 
the race-to-the-bottom logic in the past to consider 
the dynamics of the model introduced in this paper 
and to assess whether their claims about the conse- 
quences of integrated capital markets should be qual- 
ified by domestic political and economic conditions. 
At the same time, we urge scholars who study domes- 
tic politics to consider and to investigate further how 
globalization has (or has not) affected the incentives of 


policymakers. 
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divergent results of two roughly simultaneo 


us negotiations about international regimes for TV 


/ broadcasting and remote sensing from satellites reveal that governments do not form preferences 

by referring only to the interests emphasized in rational choice accounts or to the identities empha- 
sized in sociological institutionalist ones. Tracing the course of the negotiations reveals that governments’ 
perceptions of the interests and identities at stake, and their formulation of their preferences are shaped 
by their understanding of the contours of the problem or issue at hand. These understandings, or situa- 
tion definitions, structure political interactions by indicating the causal and moral beliefs relevant to the 
question at hand, the range of efficacious and acceptable policy means available for addressing it, and 
the sorts of authority, expertise, skills, and other resources that give particular actors strong claims to 


inclusion in the process of deciding how to address it. 


regimes contain the rules and procedures that 

states (or at least the most powerful states) want 
them to contain does not really answer the question of 
how a particular regime is given particular substantive 
features and not others. Why, for instance, were the 
rules in the 1947 GATT agreement applied to manu- 
factured goods but not to food and agricultural prod- 
ucts? Understanding why some substantive features are 
adopted rather than others requires understanding how 
states decide which among the set of feasible substan- 
tive regime features they prefer. 

Recent efforts to understand how states define their 
preferences have primarily focused on how individual 
states make choices and have produced two strongly 
contrasting conceptions—trational choice and sociolog- 
ical institutionalism. Rational choice accounts posit 
egoistic utility maximizing actors operating within so- 
cial institutions having minimal impact on their moti- 
vations, guided by answers to the question “What do 
I want or need?” when perceiving their situation, mak- 
ing choices, and pursuing courses of action. Sociological 
institutionalist accounts posit rule-following actors op- 
erating within denser social institutions shaping moti- 
vation by establishing standards of behavior for holders 
of various social roles, guided by answers to the ques- 
tion “What am I?” as they perceive, choose, and act. In 
rational choice accounts actors seek to advance their 
own interests; in sociological institutionalist accounts 
they follow logics of appropriateness derived from so- 
cial norms. 

Both conceptions focus so heavily on individual ac- 
tors that they obscure how perceptions, choices, and 
actions are affected by the dynamics of interaction with 
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other actors. The full impact of these dynamics gets lost 
because both accounts assume away the question of 
how actors develop enough of a common conception of 
what they are doing or discussing to interact effectively. 
In rational choice accounts, the common conception 
arises from actors’ ability to identify the pattern of 
convergent and divergent interests among them and 
thereby to discern the contours of the “game” in which 
they will be involved with the others. In sociological 
institutionalist accounts, the common conception arises 
from actors’ socialization into common norms enabling 
them to identify the relevant logic of appropriateness. 
Pressing on and asking just how rational utility maxi- 
mizers discern the pattern of convergent and divergent 
interests or rule followers converge on an identifica- 
tion of the most relevant social norms reveals that the 
answer is less simple than adherents of either theory 
assume. 

Some political scientists have addressed the ques- 
tion of how individual actors converge on a similar 
conception of an interaction obliquely, by arguing that 
the form of political interaction in domestic (e.g., Dahl 
1961 and Lowi 1964) or international (e.g., Keohane 
and Nye 1977 and Lipson 1984) politics is guided by 
the substantive character of the issue or problem at 
hand. A similar line of argument has been developed 
more strongly in symbolic interactionist sociology (e.g., 
Blumer 1969 and Rose 1962), which posits that actors 
orient themselves toward and understand the contours 
of an interaction by identifying the character of the sit- 
uation they face. In the symbolic interactionist concep- 
tion, actors answer the question “What am I doing?” 
before turning to the “What do I want?” emphasized 
by rational choice theorists or the “What am I?” em- 
phasized by sociological institutionalists. 

Paying more systematic attention to how the answer 
to “What am I doing?” shapes actor convergence on a 
common conception of their interaction permits taking 
up the additional symbolic interactionist insight that 
actors have multiple interests and identities that receive 
different degrees of emphasis at different times. Which 
interests or identities come to the fore at a particular 
moment depends, in the symbolic interactionist view, 
on the character of the interaction at hand. This insight 
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is particularly important when dealing with compos- 
ite actors like states in the international system. States 
consist of a wide range of other actors—including po- 
litical leaders, government agencies, economic or other 
interest groups, and social movements—with interests 
and identities of their own that they seek to advance 
by securing enough influence to guide state policy to- 
ward emphasizing state interests or identities compat- 
ible with their own. 

Political actors answer the “What am I doing?” ques- 
tion by invoking or establishing a situation definition 
that simultaneously establishes three basic contours of 
the coming interaction. First, the situation definition 
identifies the substantive type of issue or problem being 
addressed, thereby indicating which sets of causal and 
normative beliefs are relevant to comprehending the 
matter at hand. The extant causal and moral beliefs 
channel both sincere and strategic political communi- 
cations by providing the substantive starting points for 
developing sincere positions on the issue or the best 
platforms for arguments most likely to sway others. 
Here, as in other aspects of the political reasoning pro- 
cess, whether an actor’s own intentions are sincere or 
strategic is less important than others’ reception of the 
message being conveyed. Political actors are aware that 
many messages are strategic and deal with that reality 
by using their own substantive expertise to evaluate 
the merits of others’ messages or, when they lack much 
expertise, relying on cues provided by other actor’s 
characters or the institutional context of discussion to 
identify the messages that deserve their most serious 
consideration (Cobb and Kuklinski 1997; Lupia and 
McCubbins 2000, 55-60). 

Second, the situation definition conveys a sense of 
the policy means most appropriate to dealing with 
matters within that substantive area, bounding the 
range of plausible policy choices. Third, by defining 
the sorts of authority, capacity, experience, or exper- 
tise relevant to addressing the question at hand, the 
situation definition indicates which persons, groups, 
or government agencies have strong or weak claims 
to inclusion in the decision-making process. Situa- 
tion definitions thus include more than the beliefs 
about and perceptions of a particular policy question 
emphasized in the literatures on problem definitions 
(e.g., Rochefort and Cobb 1994 and Schoen and Rein 
1995) or frames (Sylvan and Voss 1998; Tversky and 
Kahneman 1981); they classify problems into types and 
suggest who, using what means, is relevant to address- 
ing them. 

The impact of situation definitions on political in- 
teraction can be traced most easily when they change. 
Stable, widely shared situation definitions form part 
of actors’ unexamined mental horizons, unarticulated 
and little noticed by the actors themselves. Although 
major political transitions, such as the end of the Cold 
War, generally involve changes of situation definition, 
tracing their impact is difficult because so much else 
is changing at the same time. Isolating the impact of 
situation definitions is easier when tracing how actors 
coped with an unfamiliar issue or problem in relatively 
stable times. 
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This study focuses on two controversies about cross- 
border information flows that were addressed by the 
UN General Assembly’s Committee on the Peaceful 
Uses of Outer Space (Outer Space Committee) be- 
tween 1962 and 1986 with very different results. The 
first, on using satellites for direct TV broadcasting to 
viewers in other countries, ended in 1982 when a Third 
World-Soviet bloc coalition persuaded a large majority 
of the UN membership to vote for General Assem- 
bly Resolution 37/92 endorsing a wide array of restric- 
tions on such broadcasts. The second, on using satellites 
for civilian applications of remote sensing technology, 
ended in 1986 when the same UN membership came 
to consensus on General Resolution 41/65 endorsing 
permissive rules for undertaking remote sensing and 
distributing the data thus acquired. 

Comparing these two negotiations provides a good 
opportunity for assessing the impact of situation def- 
initions because so many other factors affecting the 
outcome of multilateral interactions were very similar 
across them. First, they shared a common political con- 
text, occurring during the Cold War and in the shadow 
of wider contentions over Third World demands for 
a New World Information and Communications Or- 
der. Second, the identical set of participants in both 
negotiations meant that the distribution of overall and 
issue-specific capability among the states involved was 
the same. The main feature of this distribution was 
the divide between a small group of “system opera- 
tor” states—the United States and the USSR foremost 
among them by far—possessing the satellites and a 
much larger group of “system user” states able only 
to receive signals or data from others’ satellites. 

Third, the initial constellation of governments’ pref- 
erences was very similar. The primary political cleav- 
age reflected the technological divide between states 
where the government operated or allowed nationals to 
operate satellite-based systems (“broadcasting states” 
or “sensing states”) and states where the government 
and nationals alike lacked such capacity but would 
be affected by others’ activities (“receiving states” or 
“sensed states”). This primary cleavage did not com- 
pletely parallel the South—North division between de- 
veloping and industrial states because a few develop- 
ing states were operating their own domestic broadcast 
satellites by 1982 and most industrial states possessed 
neither broadcast nor remote sensing satellites. How- 
ever, it was close enough to encourage Third World 
participants to cast debate in South—North terms. The 
general consensus that both negotiations were about 
regulating a form of transborder information flow 
meant that the primary substantive question was how 
far the international regime to be created should permit 
restricting that flow, and governments could be arrayed 
on a similar spectrum. At one end were governments 
advocating highly permissive rules that would allow 
broadcasters or remote sensing operators to use satel- 
lites at will and choose what signals to transmit or data 
to collect and distribute. At the other end were gov- 
ernments advocating highly restrictive rules that would 
subject both satellite use and signal or data flows to 
prior receiving or sensed state consent. In the middle 
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were governments advocating a degree of receiving or 
sensed state control over signal or data flows. U.S. gov- 
ernment positions were on the permissive end of the 

in both negotiations, strongly affirming the 
principle of “free flow of information” and accepting 
only minimal government controls over its origination 
or spread. Next along the spectrum were the Canadian 
and Western European positions, balancing freedom 
of expression for broadcasters and wide sharing of 
remote sensing data with accepting enough receiving 
state government control over foreign-origin TV pro- 
gramming to protect local culture and morals or to 
maintain control over the number and type of entities 
permitted to engage in TV broadcasting. Most Third 
World governments’ positions occupied parts of the 
more restrictive half of the spectrum; they advocated 
stronger forms of receiving or sensed state controls 
on the activity but disagreed about the precise extent 
and form those controls should take. Soviet bloc and 
other Leninist governments occupied the most severely 
restrictionist end of the spectrum, particularly on TV 
broadcasting. 

Fourth, both negotiations proceeded in a very similar 
institutional context. The decision to deal with both is- 
sues in the UN Outer Space Committee entailed follow- 
ing well-established procedures of multilateral negoti- 
ation. These did not entirely exclude affecting nego- 
tiations through unilateral action; states with national 
space programs did exercise a degree of “go it alone 
power” (Gruber 2000) to push the negotiations in cer- 
tain directions by their own actions. Yet participants 
shared strong expectations that all UN member gov- 
ernments would have a say in the negotiations and that 
agreements will be developed in a two-stage process of 
enunciating a nonbinding “declaration of principles” 
followed by writing a legally binding multilateral treaty. 
Using the UN Outer Space Committee for the detailed 
negotiations meant confiding the preliminary work to a 
committee of less than the whole UN membership—28 
states in 1963, 47 in 1978, and 53 in 1980—operating 
under a strong consensus rule. However, the rest of the 
UN membership could get involved when the Outer 
Space Committee presented its work to the UN Gen- 
eral Assembly, a plenary body deciding by consensus 
if possible but reverting to its majoritarian decision- 
making rules whenever consensus fails to develop. 

The consistency of the international level context for 
both negotiations means that the explanation of the 
different results of the negotiations must be sought at 
the national level, in the stability or change in national 
policy on direct satellite broadcasting and satellite re- 
mote sensing. The trend of national policies over time 
was the product of stability or change in situation defi- 
nitions. In the direct TV broadcasting negotiations, the 
type of information flow involved was defined consis- 
tently throughout and viewed through beliefs that too 
many foreign-origin signals would seriously erode local 
culture and weaken national autonomy. Though a con- 
venient mask for the material interests of locally based 
broadcasters reluctant to see their monopolies and rev- 
enues eroded, the beliefs were shared by many domes- 
tic actors who did not share in the revenues. The same 


set of domestic actors—political leaders, diplomatic 
generalists, local broadcasters, and intellectual crit- 
ics of foreign media (particularly U.S.-based privately 
owned media) as agents of “cultural imperialism”— 
maintained control of policy-making and continued to 
view limiting the extent of incoming signals as the most 
desirable policy outcome. 

There were no preexisting revenue flows to protect 
in the remote sensing negotiations, but the initial em- 
phasis on information about the location and extent of 
natural resources fanned fears in many countries that 
foreign corporations with better access to that infor- 
mation would enrich themselves at local expense. This 
triggered a similar defensive reflex that loosened only 
as definitions of remote sensing data expanded to in- 
clude information about pollution, land use patterns, 
and other local conditions useful to government agen- 
cies charged with administering territory and regulating 
private activity. This expansion in data uses meant that 
concern about others’ gains became only one part of 
a governments’ overall benefit-cost calculations, and 
also gave additional domestic actors—local analysts 
and users of remote sensing data—a place in policy 
discussions previously dominated by political leaders 
and diplomatic generalists. 

Though elements of a changed situation definition 
in satellite remote sensing began to appear in the late 
1970s, the impact of its existence on the UN negotia- 
tions was not significant until later. Thus the negotia- 
tions can be divided into two phases: one running from 
the 1960s through about 1980, when they ran in parallel 
and most observers expected similar outcomes, and the 
other starting in 1981-82, when their paths diverged. 


PARALLEL PRESSURES FOR RESTRICTIVE 
INTERNATIONAL REGIMES 


Direct TV Broadcasting 


Until 1962 TV broadcasting was a very localized ac- 
tivity because TV signals from terrestrial towers can 
be received only within an approximately 45-mile ra- 
dius. Though the launch of Telstar in July 1962 only 
demonstrated the feasibility of using satellites to re- 
lay telephone and video signals to 10-meter-diameter 
antennas far too expensive for household acquisition, 
governments and other actors needed little imagina- 
tion to envision orbiting transmitters sending TV sig- 
nals directly to household receivers in the near future. 
Satellite broadcasting technology offered large states 
the possibility of developing national TV systems very 
rapidly (Kerdoun 1988, 710) but also raised for all states 
the specter of foreign broadcasters using satellites to 
send their signals to local populations directly rather 
than through locally based rebroadcasters whose ac- 
tivities could be controlled by the government. Thus 
satellites appeared to create a TV equivalent to short- 
wave radio broadcasting. Since the failure of League of 
Nations efforts to develop agreed limits on propaganda 
broadcasts in the 1930s, shortwave broadcasting had 
proceeded under a tacit “broadcast at will/prevent local 
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listening if can” regime under which governments and 
private firms emitted programming while governments 
objecting to the programs tried to block local listening 
by jamming the signals or banning possession of radios 
able to receive them (Dalfen 1972, 193; Gorove 1985, 
9; Krasner 1991, 346). 

In the early 1960s TV broadcasting from terrestrial 
towers was a well-established activity in most countries, 
though the number of active local broadcasters and 
the hours and types of programming available varied 
considerably. Each government of the world had a na- 
tional system of regulation and/or direct operation of 
TV stations in place and a set pattern of relations with 
the operators of those stations (see Nordenstreng and 
Varis 1974). In many countries, a government-owned 
broadcasting authority monopolized TV broadcasting. 
Although such authorities usually broadcast on more 
than one frequency to offer some degree of programs 
choice, they faced no domestic competitors. Nor, except 
in border areas, did they face any foreign competitors 
because TV signals did not travel very far. In other 
countries, including the United States and some Latin 
American states, competing private broadcasters oper- 
ated within government licensing and regulation. In a 
third group of countries, including Japan, government- 
owned and private broadcasters competed with each 
other. Regional broadcasting unions existed among 
Arab, Asian, Eastern European, Western European, 
and Latin American broadcasters, but only in the latter 
two was interbroadcaster exchange of programs well 
established (Pool 1979, 132). 

Broadcasters could also buy foreign-origin program- 
ming from producers or syndicators. Surveys of broad- 
casting practices in selected countries during 1970- 
71 revealed considerable variation in the extent of 
such purchasing (Nordenstreng and Varis 1974). In the 
United States only 1-2% of broadcast material was im- 
ported. Government-owned stations in Japan aired al- 
most no imported programming; private ones imported 
about 10% of their programs. About 30% of the pro- 
grams shown in most Western European states was im- 
ported. Latin American reliance on imported programs 
varied considerably, from almost none in Cuba to 84% 
in Guatemala, with most countries’ broadcasters using 
imported programs 30 to 50% of the time. Though vary- 
ing degrees of intraregional importing existed, the main 
sources of imports were the United States (by a wide 
margin), followed by France, the United Kingdom, and 
West Germany. 

The politics of TV broadcasting remained local and 
the strength of parallel interest in maintaining na- 
tional autonomy meant that there was little basis for 
creating cross-border policy coalitions favoring trans- 
border activity. Governmental and commercial broad- 
casters alike regarded potential foreign direct satel- 
lite broadcasters as rivals rather than partners, as they 
would not need local partners to reach viewers. U.S.- 
based broadcasters—with their already strong mar- 
ket share in programming, ample funds, and access 
to space capability—inspired the greatest worry, but 
even government-owned broadcasters in neighboring 
countries could cause concern where strong political 
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rivalries or ideological differences prevailed. Whereas 
the extent of foreign TV watching in some border 
areas suggested that viewers might welcome foreign- 
origin broadcasts, their preferences had no impact 
on national broadcasting policy. Authoritarian govern- 
ments of all ideological persuasions sought to control 
viewer choice. In most Third World countries intel- 
lectuals strongly supported government policies ex- 
cluding Western, particularly U.S.-origin, programming 
(Gauhur 1989). Similar sentiments, aimed at US.-origin 
material, prevailed among Canadian and Western Eu- 
ropean governments and intellectuals. Even govern- 
ments like those of India, Brazil, and Mexico engaged in 
developing domestic satellite TV systems distinguished 
very sharply between controllable domestic broadcast- 
ers and uncontrollable foreign ones. 

Strong relations between governments and local 
broadcasters were manifest in Western Europe and the 
United States as well. Western European governments, 
eager to protect their national broadcasting authori- 
ties, inhibited development of a privately owned re- 
gional satellite consortium even after the government 
of Luxembourg broke ranks and licensed one in 1983 
(Hirsch 1986). The U.S. government denied licenses to 
all satellite TV broadcasting other than domestic ed- 
ucational ventures until 1982. This appears paradox- 
ical because it kept U.S. broadcasters from develop- 
ing the extensive activity anticipated so fearfully in 
other countries but conformed to the preferences of 
key domestic actors (Queeney 1978, 9; Williams 1984). 
The government gave higher priority to protecting its 
own and the major communications companies’ invest- 
ments in the Intelsat consortium throughout the 1960s 
and 1970s. Owners of established over-the-air and ca- 
ble TV networks also opposed licensing because they 
were not interested in facing any more competition. 
New firms had little incentive to establish satellite TV 
operations until affordable home satellite dishes and 
receivers were developed in the mid 1980s because the 
established terrestrial networks already covered virtu- 
ally all of the United States. Only in the mid 1980s— 
and mainly through newcomers like CNN and United 
Satellite Communications—did U.S. firms emerge as 
satellite broadcasters. 

The domestic coalitions in most countries found it 
easy to converge on restricting international direct 
satellite broadcasting for two reasons. First, all media 
experts agreed that TV has much stronger effects on 
audiences—many even believed that TV broadcasts 
could be used to implant subliminal messages by flash- 
ing signals too fleeting for conscious perception but ca- 
pable of swaying the unconscious mind (Quester 1990, 
5-76, summarizes these arguments). Thus for govern- 
ments and intellectual critics of Western, particularly 
commercial, TV programming, there was enough dif- 
ference between TV and radio signals to justify re- 
jecting the shortwave “broadcast at will, jam if can” 
solution. Second, governments, intellectuals, and local 
broadcasters alike assumed that the U.S. lead in satel- 
lite technology meant that direct satellite broadcasting 
would only reinforce the already strong place of U.S. 
media companies in world programming and thereby 
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strengthen U.S. “cultural imperialism.” Thus Leninist 
governments seeking to create a new socialist culture, 
Third World governments seeking to maintain or re- 
assert indigenous cultures, and Western Europeans or 
Canadians wishing to insulate their societies against the 
negative impact of U.S. popular culture and/or commer- 
cial advertising (Kerdoun 1988, 701-2; Queeney 1978, 
202-11; Sauvant 1979, 15-16) could all agree that cross- 
border satellite TV broadcasting was a threat to na- 
tional culture and autonomy that needed to be limited 
in some way. 

Governments advocating severe restrictions on 
cross-border satellite broadcasting quickly took the 
initiative in international forums. The Brazilian gov- 
ernment proposed establishing UN supervision over 
satellite TV broadcasting in late 1962, whereas the 
French and a number of Third World governments 
favored a moratorium on satellite broadcasting until 
international rules were developed. The Soviets and 
Americans persuaded delegates at the ITU’s Extraor- 
dinary Administrative Radio Conference in 1963 to 
allow operation of domestic “experimental systems” 
but accepted that systems capable of reaching audi- 
ences in other states should be established only after 
agreement with the governments of those states. The 
Egyptians proposed a set of restrictive regulations in 
1966 and Third World inerest succeeded in getting the 
issue greater priority on the Outer Space Committee 
agenda (Smith 1976, 189-91; Queeney 1978, 30-31). 

The tenor of much governmental and public opinion 
in the Third World was revealed in January—February 
1970 when a projected cooperative educational TV ar- 
rangement linking U.S. universities with counterparts 
in the Andean countries fell apart in the uproar caused 
by the Colombian government’s condemnation of the 
project. The Colombian government accused the U.S. 
participants of threatening to begin broadcasts even 
without the Latin American participants’ agreement to 
a programming schedule and alleged that the project 
was merely a front for U.S. corporations seeking to 
destroy Latin American cultural integrity (Pool 1979, 
132-33). 

When the UN Outer Space Committee began ex- 
tended discussion of rules for satellite broadcasting in 
1970, advocates of receiving state controls over pro- 
gram material were the vast majority. However, they 
failed to develop a common negotiating position be- 
cause different groups of governments advocated dif- 
ferent mixes of controls over satellite use and signal 
content. A few governments focused mainly on the first, 
and would have been satisfied with rules requiring re- 
ceiving state consent to the existence of a cross-border 
broadcast service. Most governments also desired a de- 
gree of signal control and proposed that receiving state 
consent be required for both the existence and the basic 
programming schedule of a cross-border broadcasting 
service. The most adamant advocates of signal control 
went further by proposing to require receiving state 
consent before broadcasting each program. 

As all negotiators realized, prebroadcast review of 
individual programs meant granting receiving state 
governments the right to censor broadcasters located 


in other states. This was the very point on which the 
shortwave negotiations had foundered, and the con- 
tinuing discussions of information issues in other UN 
forums revealed that this disagreement had not gone 
away. Canadian and Swedish negotiators quickly per- 
ceived the strong potential for stalemate and sought to 
avert it by proposing that cross-border broadcasting be 
undertaken only by regional consortia of broadcasters 
who would jointly select the programming and operate 
the system (Canada and Sweden 1970). Though based 
on experience with the Eurovision program exchange 
(described in Queeney 1978, 62), this idea failed to 
satisfy other negotiators. The Americans and Japanese 
opposed requiring any single organizational structure 
for international broadcasting, whereas the Soviet bloc 
and other advocates of prebroadcast program review 
insisted that regional consortia whose broadcasts could 
be seen on the territory of nonmember states should 
be subject to the same program review requirements 
as national broadcasters (Fisher 1989, 102-5). 

The U.S. and other governments advocating less com- 
prehensive receiving state controls over signal con- 
tent sought to reduce the level of ideological conflict 
by supporting adoption of technical regulations lim- 
iting the extent of cross-border broadcasting at the 
ITU’s 1971 World Administrative Radio Conference 
on Space Telecommunications (WARC-ST). The con- 
ference agreed to limit satellite broadcasts by (1) allo- 
cating frequencies to satellite broadcasting that could 
not be received by existing home TVs, (2) reserv- 
ing geostationary orbital slots and associated channel 
assignments to national governments so that an “in- 
ternational” service could be started only if govern- 
ments agreed to share them, and (3) requiring that 
governments or firms operating domestic broadcasting 
services limit “overspill” of signals onto the territo- 
ries of neighboring states as much as possible (ITU 
1971). Together these rules made it impossible for 
any one government to operate or authorize nationals 
to operate a global direct satellite broadcasting sys- 
tem. Western governments hoped that these regula- 
tions would result, as had similar rules in other areas 
of telecommunications, in devolution of detailed rule- 
making to regional groupings, but governments seek- 
ing strong signal control still sought adoption of global 
rules. 

After the ITU technical regulations were reaffirmed 
in 1977, some Western European delegates again 
expressed hopes that governments would accept a de- 
centralized approach to program content rules (e.g., 
Belgian delegate 1979, Dutch delegate 1977, and 
United Kingdom 1977). However, Third World advo- 
cates of receiving state control over signal content still 
insisted on the need for a set of common global rules 
(e.g., Indian delegate 1977). Soviet bloc governments 
even claimed that the 1977 reaffirmation bound gov- 
ernments to accept extensive receiving state control 
over program content (Bulgarian delegate 1977; Soviet 
delegate 1979). As the 1979 negotiating session (Legal 
Subcommittee 1979) made clear, the constellation of 
preferences had not shifted significantly at the end of 
the decade. 
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Efforts to escape the deadlock took the common 
diplomatic form of trying to work out verbal formu- 
lations that would bridge most of the divides and 
win such widespread support that isolated holdouts 
could be pressured into accepting the resulting com- 
promises. The Canadian and Swedish governments 
had begun this work in 1973 (Canada and Sweden 
1973; Swedish delegate 1973) and renewed their efforts 
in 1979 with a draft assembling all the formulations 
they regarded as having the widest support (Canada 
and Sweden 1979; Canadian delegates’ rationales in 
Chapman and Warren 1979 and Dickson 1977). How- 
ever this new effort soon foundered over divergent 
interpretations of the key phrase “agreement to an in- 
ternational direct broadcast service.” The U.S. govern- 
ment defined it as agreement on the existence of such 
a service, other Western states understood it to include 
agreeing on the broad outline of types of programs, 
whereas many Third World and all Soviet bloc gov- 
ernments interpreted it to mean “prior consent” to the 
service, the types of programs, and the content of each 
individual program (positions summarized in Gorove 
1985, 6). 


Remote Sensing 


When the U.S. government announced its intention 
to develop a civilian satellite remote sensing system 
in September 1969, other governments could draw on 
either of two existing activities for appreciating its im- 
plications: military reconnaissance and weather obser- 
vation. Both forms of activity were carried out unilater- 
ally by the superpowers, though weather satellite data 
were distributed globally through the WMO’s World 
Weather Watch data reporting system (described in 
WMO 1966). Satellite reconnaissance had strongly neg- 
ative connotations because it was simply a form of 
snooping by which others could learn things that gov- 
ernments preferred to keep secret. These negative 
connotations were strong enough that the Soviets had 
invoked them in its early efforts to have satellite recon- 
naissance declared illegal under international law (e.g., 
Kislov and Krylov 1956 and Zhukov 1960), an effort 
they abandoned after developing their own satellite 
reconnaissance capability in 1962 (e.g., Piradov 1976, 
136; also see Gaddis 1988.). Neither superpower pub- 
licized its satellite reconnaissance, but other govern- 
ments were aware of it. They did not like it but real- 
ized that they could not stop it and believed that the 
superpowers would not give it up voluntarily (Dalfen 
1972, 193 n. 36). Though worried about data sharing 
with other governments, they could take some comfort 
from the lack of data sharing with private entities. In 
contrast, using satellites for weather observation had 
strongly positive connotations. Interest in using satel- 
lites as weather observation platforms arose in the mid 
1950s as meteorologists realized that they would not be 
able to develop computer-based “numerical weather 
prediction” without acquiring more extensive observa- 
tion data (WMO 1955). Observation data covering up- 
per atmosphere winds and conditions over the 70% of 
Earth covered by oceans were particularly scanty, and 
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satellite technology appeared to provide a way of filling 
much of the gap very rapidly. Results produced by the 
United States’ TIROS-1, the first dedicated weather 
observation satellite, after its April 1960 launch were 
greeted enthusiastically by the WMO (1960, 83) and 
the UN General Assembly (General Assembly 1961). 

Governments could hold such apparently inconsis- 
tent attitudes toward satellite observation because of 
differences in the information produced. The sensors 
used for military reconnaissance in the 1960s yielded 
images capable of identifying bases, missile launch sites, 
ships, and other large military installations, whereas 
the improved sensors in use by 1980 could identify 
troop concentrations and smaller military objects (see 
Burrows 1986). The low-resolution sensors used for 
weather observation were designed to identify cloud 
masses; later weather sensors were designed to detect 
wind movements and temperatures at various heights 
(see Widger 1966). 

The domestic politics of remote sensing varied con- 
siderably by country. The closest analogous activ- 
ity, taking photographs from aircraft, was an estab- 
lished practice in some countries, where national policy 
ranged from government monopoly to allowing com- 
petition among private firms. In most countries, how- 
ever, neither the government nor private citizens had 
much experience with aerial photography and no clear 
policy coalitions had formed. This created a relative 
policy void in which political leaders and diplomatic 
generalists could shape national policy on their own. 

The U.S. government took this policy void into ac- 
count when designing its effort to spark positive inter- 
est in, and reduce fears about, civilian remote sensing. 
It sought to modify government attitudes and encour- 
age use of remote sensing data abroad through hard- 
ware choices and familiarization. On the hardware side, 
NASA consciously chose a relatively low, 80-meter 
(260-feet) resolution for Landsat-1’s sensors to create 
an obvious distinction between civilian remote sens- 
ing programs and military reconnaissance (Allen and 
Latham 1980, 147), a gap left in place even with the 
shift to 30-meter (95 feet) resolution on some Landsat- 
2 sensors in 1975. 

US. familiarization efforts proceeded along, three 
tracks: providing instruction in use of remote sensing 
data for foreign as well as U.S. nationals, encouraging 
use of data by providing it inexpensively, and creating 
an active role in remote sensing operations for other 
governments, The user training programs had begun 
even before September 1969 and continued afterward; 
in 1975-76 the U.S. Interior Department’s EROS Cen- 
ter alone enrolled at least 130 foreigners in its programs 
(see Mack 1990, 141-45). The U.S. government encour- 
aged demand for data in two ways. First, it offered 
to provide images and data tapes free of cost to sci- 
entists anywhere in the world desiring to use them in 
research (U.S. delegate 1969) and invited foreign scien- 
tists to the scientific symposia where Landsat designers 
described the forms of data that would be produced 
and the sorts of projects for which it would be useful 
(Mack 1990, 144, 147). By mid-1974, Landsat data had 
been provided to two international organizations and 
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scientists in 37 countries (U.S. delegate 1974). Second, 
it sold data to other users at cost of reproduction rather 
than trying to recapture any of the costs of developing 
and operating the satellites. At $15 per photographic 
image and $200 per data tape, the Tanzanian govern- 
ment was able to buy data providing an initial survey 
of approximately 68,000 square kilometers of land in 
the Rukwa region for $1,000 rather than paying about 
$23,000 for an aerial survey of about half that area (Paul 
and Mescarenhas 1981, 214). In 1973 foreign entities, 
mainly government agencies and private mining and oil 
companies, ordered 13,000 images or data tapes from 
Landsat centers (Brown et al. 1977, 134). 

The need for active foreign collaboration arose be- 
cause Landsat’s tape system could only store about 
12 minutes’ worth of data (Allen and Latham 1980, 
145). Whether triggered by a Canadian suggestion (as 
claimed in Mack 1990, 191) or its own appreciation of 
the situation, NASA invited foreign government agen- 
cies to become directly involved in the Landsat pro- 
gram by building and operating ground data receiving 
stations. The United States already had ground stations 
in various parts of the world for tracking and com- 
manding reconnaissance satellites but wanted a sep- 
arate Landsat network for technical and security rea- 
sons (Burrows 1986, 174-98). Building and operating 
a Landsat ground station required significant financial 
and operational commitments. The U.S. government 
did encourage station construction by allowing a station 
operator to keep a copy of all data its station received 
for a $200,000 annual access fee, providing low-cost 
training for station operators and data analysts, and 
directing third party orders for images or tapes of data 
received to the ground station operator initially receiv- 
ing it. However, ground station owners were expected 
to bear all the construction costs—which could run to 
several million dollars—and operating expenses—then 
some $750,000 to $1,600,000 a year (Yachinski 1980, 
160). Receiving station operation was thus most at- 
tractive to smaller Western bloc and neutral industrial 
states lacking their own space programs and to more 
advanced developing countries with enough territory 
to need large amounts of Landsat data. The Canadi- 
ans, Brazilians, and Italians were the first to join in; 
by December 1980 the Argentine, Australian, Indian, 
Japanese, South African, and Swedish governments 
had also constructed Landsat ground stations (list in 
Outer Space Committee 1985). 

Yet even with these U.S. efforts in place and the 
distinction between weather data and military recon- 
naissance available, most governments’ initial reaction 
to civilian remote sensing was negative. This produced 
a constellation of preferences in the UN remote sens- 
ing discussions very similar to the constellation already 
established in the direct TV broadcasting discussions. 
The U.S. government took a position very similar to 
its stance on satellite TV broadcasting in proposing 
to allow states freedom to undertake satellite remote 
sensing operations and wide latitude to distribute the 
resulting data to other governments and to private enti- 
ties. Again, the Soviet bloc and many Third World gov- 
ernments sought a regime allowing broad sensed state 


controls, whereas Western European governments 
wanted less comprehensive controls but were sympa- 
thetic to Third World concerns. 

As in the satellite TV broadcasting negotiations, the 
proponents of strong sensed state controls failed to con- 
verge on a common negotiating position. Several Third 
World governments wanted control over both satellite 
use and data distribution (e.g., Brazil 1974, articles 3, 
7, 8). They agreed with the Argentine assertion that “it 
is self-evident that the remote sensing of the territory 
of a foreign state without its prior consent is an illicit 
international act which the international community 
should avoid and, if it nevertheless occurs, should pun- 
ish” (Argentine delegate 1973). This time, however, the 
Soviets did not insist on satellite control. They were 
already engaged in some remote sensing from manned 
spacecraft, as well as running reconnaissance satellites, 
and wanted to maintain discretion to engage in remote 
sensing. Their position on the issue was quite close to 
that of the French, whose proposed regime included 
controls on data distribution but not on satellite use 
(France 1973; USSR 1973). 

Some delegates sought to avert stalemate by propos- 
ing that remote sensing become a joint activity. Early 
in the discussions, the Argentines proposed that satel- 
lite operators turn their data over to a UN-run data 
bank and, once the technology had been fully proven, 
also turn over operation of the satellites (Argentina 
1970). The Swedes returned to the idea of joint interna- 
tional operation in 1974 (Swedish delegate 1974). Tak- 
ing cues from superpower attitudes, most governments 
prepared for negotiations by assuming that national op- 
eration of remote sensing satellites would remain the 
tule. 

With the question of who would operate remote sens- 
ing satellites settled in favor of individual governments, 
negotiators turned to the task of defining the types of 
data gathering that would be covered by the new term 
“civilian remote sensing.” Most quickly acknowledged 
that it would not cover gathering of military intelligence 
or of weather data, which were covered by other rules. 
By 1974, most governments agreed that the term “civil- 
ian remote sensing” referred primarily to acquisition of 
data relating to economic activity or useful in the for- 
mulation of economic policy (e.g., France and USSR 
1974). This focus on economically significant activity 
deemphasized military intelligence, though it did not 
keep proponents of strong sensed state controls from 
invoking notions of “economic espionage” (e.g., Iraqi 
delegate 1971). It also deemphasized the mutually ben- 
eficial scientific and meteorological uses of the technol- 
ogy, giving policy generalists concerned with promot- 
ing the New International Economic Order and the 
New World Information Order the predominant voice 
in Third World national policy debates. 

Thus it was not surprising that advocates of broad 
sensed state control invoked the principle of “perma- 
nent sovereignty over natural resources and wealth” to 
justify their position (Argentina 1970, article 7; Brazil 
1974, article 2). Soviet bloc governments even argued 
that the that principle covers information about re- 
sources as well as the resources themselves (Mongolia 
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1976; USSR 1973, par. 2), despite the fact that the main 
General Assembly Resolutions endorsing the principle 
did not mention information about resources (General 
Assembly 1962, 1973, and 1974). Western governments 
strongly opposed this extension for practical and nor- 
mative reasons. On the practical plane, they were 
familiar enough with computer manipulation of digi- 
tized data to know that it would be difficult to separate 
remote sensing data into clear categories of resource- 
related and other. On the normative plane, they feared 
that the lack of clear distinction could encourage treat- 
ing all remote sensing data as “information about re- 
sources” and thereby severely weaken the principle of 
free flow of information. 

As negotiations continued in the 1970s, more gov- 
ernments acquired first-hand experience with remote 
sensing data and local users began to emerge as dis- 
tinct participants in domestic policy debates. In the late 
1970s governments without remote sensing satellites 
could acquire data in several ways. The small but grow- 
ing number of data receiving station operators received 
raw data transmissions covering areas within range of 
their station, which they then formatted and processed 
before proceeding to analysis. Others could wait a few 
weeks and purchase computer tapes of processed data 
from the U.S. government or a ground station operator 


for $200 a tape, or they could wait a few months and ~“ 


acquire computer-enhanced images from a third-party 
data analysis firm for $1,000-$2,000 each (Lazarus 1983, 
105). Developing country governments were often pro- 
vided with data tapes or analyzed unages through U.S. 
or multilateral aid programs (Bishop 1986, 327; Joyner 
and Miller 1985, 79). Most national meteorological ser- 
vices received weather data free through the World 
Weather Watch network. In 1972, 165 ground stations 
in 76 countries were receiving 100-meter resolution Au- 
tomatic Picture Transmission (APT) data (WMO 1972, 
map after p. 16 checked against a map of world political 
boundaries), and the number had grown by the end of 
the decade. As scientific researchers and data analysis 
firms found ways to enhance weather data, it came to 
be used for resource management and environmental 
monitoring as well as weather prediction (Allen and 
Latham 1980, 148-49). 

This growing local familiarity began to have percep- 
tible effects on the Outer Space Committee discussions. 
These first appeared in discussions of sensed state con- 
trol over satellite use. Current and future satellite op- 
erating governments all opposed controls on satellite 
use. Though announcing intended activities publicly, 
recruiting foreign scientific and other users of remote 
sensing data, and enlisting other governments as part- 
ners in tracking and data downloading, the U.S. govern- 
ment never asked other governments for permission to 
gather remote sensing data (Burrows 1986, 141-42). 
The Soviets and French indicated their similar atti- 
tudes in their 1973 and 1974 proposals. However, many 
Third World governments yielded to this unanimity 
of spacepower preference only after some additional 
discussion. Both Western and other technical experts 
advised not only that it would be difficult to imple- 
ment a rule requiring prior sensed state consent to data 
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gathering, but that denying permission to sense would 
reduce the scientific value of remote sensing. They also 
argued that “raw data” received from satellites were 
not as important as the “analyzed information” pro- 
duced after visual or computer processing of images. 
Most Third World governments accepted these argu- 
ments in the late 1970s (see Andhyarujina 1977, 522, 
and Special Political Committee 1978). Though a few 
held out (Chilean delegate 1978; Syrian delegate 1982), 
the Indian government expressed a widely shared atti- 
tude when it acknowledged in 1981 that “sensing of one 
country by another would appear to have been tacitly 
accepted de facto” (India 1981, par. 89). 

Signal control—rules about distribution of data— 
thus became the primary focus of negotiations. Gov- 
ernments’ material stakes in controlling data dissemi- 
nation had been expressed by the Canadian minister of 
mines, energy, and resources in a 1971 speech empha- 
sizing the need to protect Canadian companies from 
foreign competitors using satellite remote sensing data 
to identify areas worth exploring for resource deposits 
(quoted in Dalfen 1972, 192-93 n. 34). Third World gov- 
ernments also worried that remote sensing would give 
already-powerful foreign firms the additional advan- 
tage of possessing more information about the resource 
potential of their country than they possessed them- 
selves. In the early 1970s these concerns were strong 
enough to polarize debate on South-North lines par- 
allel to those dominating the satellite TV broadcasting 
debates. 

However, the growing prominence of environmental 
concerns opened up a cross-cutting dimension of de- 
bate. Environmental issues had gained a definite place 
on UN agendas with the 1972 UN Conference on the 
Human Environment, and Third World governments 
were beginning to accept that they were real prob- 
lems, not simply sham excuses for discouraging their 
economic development. The 1974 Franco—Soviet draft 
principles introduced this new element into the negoti- 
ations by exempting “information on natural disasters 
and phenomena which can be detrimental to the envi- 
ronment in general” from the controls on data distri- 
bution (France and USSR 1974, art. 5c). 

Growing appreciation of remote sensing as an envi- 
ronmental policy tool inspired a lengthy effort to for- 
mulate some rule that would permit wide dissemination 
of environmental data while limiting dissemination 
of economic data. The Soviets initiated this discus- 
sion in 1977 by proposing quantitative distinctions 
among “global,” “regional,” and “local” data, with 
only the last subject to sensed state control of distri- 
bution (USSR 1977; revised version in USSR 1979). 
They applied the term “global data” to images hav- 
ing viewing areas extending several hundred kilome- 
ters acquired by sensors having low resolutions of 
several hundred meters, “regional data” to images hav- 
ing viewing areas extending 200 to 800 kilometers ac- 
quired by sensors having resolutions no finer than 50 
meters, and “local data” to images having viewing ar- 
eas less than 180 kilometers long or wide or acquired 
by sensors with resolutions finer than 50 meters. The 
Indians (Indian delegate 1979) and French (France 
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1977) took up the idea, though the French modified it 
considerably by proposing that controls cover only data 
acquired with sensors having resolution finer than 10 
meters—a definition that would exempt all data from 
their planned SPOT system from the prior consent re- 
quirement. However, enough delegates doubted that 
quantitative resolution distinctions would reliably dis- 
tinguish among types of data to prevent resolving the 
data distribution issue in this fashion. 


DIVERGENT PATHS TO DIFFERENT 
INTERNATIONAL REGIMES 


The results of the wide-ranging discussion of space ac- 
tivity at the Second UN Conference on the Exploration 
and Peaceful Uses of Outer Space (UNISPACE’82) 
in August 1982 suggested to knowledgeable observers 
that Third World desire to control transborder infor- 
mation flows would result in establishment of interna- 
tional regimes imposing considerable government con- 
trols over both direct satellite broadcasting and satellite 
remote sensing (e.g., OTA 1983, 7-8, 52). Yet signs of di- 
vergence in the paths of the two negotiations were also 
apparent to those who looked more closely. Whereas 
UNISPACE’82 discussions of direct TV broadcasting 
from satellites remained framed in terms of informa- 
tion flows to be controlled, discussions of satellite re- 
mote sensing were already including demands that the 
United States guarantee continuity of Landsat data 
(UN 1982; also noted in OTA 1983, 8). 


Direct TV Broadcasting 


It was easy to miss the early signs of divergence be- 
tween the two negotiations because international con- 
troversies about transborder information flows were 
then running at fever pitch. Third Word demands for 
creation of a New World Information and Communi- 
cations Order and a reintensification of the Cold War 
contributed considerably to a hardening of negotiat- 
ing positions all around. U.S. government unhappiness 
with majority positions in the UNESCO debates on 
the NWICO hardened into outright rejection by the 
Reagan administration in 1981. Heightened confidence 
among Leninist governments, which interpreted U.S. 
defeat in Southeast Asia, the advent of Leninist regimes 
in Afghanistan, Angola, Ethiopia, and South Yemen, 
and the upsurge of leftist revolutionary movements in 
Central America, as confirmation that the “correlation 
of forces” had tipped decisively in their favor. The 
waves of democratization that would transform politics 
in many Third World countries had not yet occurred. 
Technological developments added a material rea- 
son for the governments most strongly supporting 
limits on satellite TV broadcasting to seek an early 
resolution of the issue. The more powerful satellite 
transmitters that began coming into use late in 1980 
would allow viewers to receive signals with dish anten- 
nas two to three meters in diameter rather than the 10- 
meter diameter “dishes” that had been standard since 
1962. Though not inexpensive enough to become a 
mass-market item, they could be acquired by wealthier 


households or neighborhood groups (including local 
cable rebroadcasters). At the UN many governments 
became anxious to finalize the declaration of principles 
so they could move on to their main goal, developing 
a binding treaty regime, before local possession of the 
new-generation antennas became widespread. 

Perceiving this anxiety, the governments hoping to 
find a consensus solution redoubled their efforts. In 
April 1981 the Canadians joined with six Latin Amer- 
ican, three Asian, and two African delegations to of- 
fer a new proposal (Twelve Powers, 1981) packaging 
together provisions they believed had the widest sup- 
port and urged the Outer Space Committee to accept 
it as the main negotiating draft. However, stalemate 
persisted, as both the United States, on one side, and 
the Soviet bloc-Third World hardliners, on the other, 
rejected the proffered compromise. That fall Brazilian, 
Egyptian, Chilean, and Kenyan delegates warned that 
if stalemate persisted, they would bypass the Outer 
Space Committee and take a draft to the General As- 
sembly for adoption by majority vote (Special Political 
Committee 1981). 

When consensus still failed to emerge in the spring 
and summer of 1982, a Third World—Soviet bloc coali- 
tion confident of broad support tabled an alternate 
proposal (Majority Coalition 1982) more strongly ori- 
ented toward limiting direct satellite TV broadcasting 
than the Twelve Powers draft. Its tenor was evident in 
Principle I, which stated that “activities in the field of 
international direct television broadcasting by satellite 
shall be carried on in a manner compatible with the 
sovereign rights of States, including the principle of 
non-intervention, as well as with the right of everyone 
to seek, receive and impart information and ideas as 
enshrined in the relevant United Nations instruments.” 
Receiving state rights also received strong protection 
later in the draft. Principle 13 specified that “a State 
which intends to establish or authorize the establish- 
ment of an international direct television broadcast- 
ing satellite service shall without delay notify the pro- 
posed receiving State or States of such intention and 
shall promptly enter into consultation with any of those 
States which so requests.” Principle 14 required that 
“an international direct television broadcasting satel- 
lite service shall only be established after the condi- 
tions set forth in paragraph 13 above have been met 
and on the basis of agreements and/or arrangements in 
conformity with the relevant instruments of the Inter- 
national Telecommunications Union and in accordance 
with these principles.” 

Though the Third World-Soviet bloc proposal did 
not specifically endorse receiving state control over 
individual program content, it was widely interpreted 
that way for two reasons. First, Soviet bloc and a sig- 
nificant number of Third World delegates continued 
to define the term “agreements and/or arrangements” 
as including acceptance of program content controls. 
Second, the 1982 text dropped a “saving clause” in the 
preamble of the 1981 Twelve Powers text specifying 
that governments recognize “the unique characteris- 
tics of such satellite broadcasting not encountered in 
other forms of broadcasting which necessitate besides 
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relevant technical regulations also principles solely ap- 
plicable in this field.” The saving clause was an effort 
to persuade the U.S. government and other strong ad- 
vocates of free flow of information that any statement 
of rules for the satellite TV broadcasting would not be 
used as a precedent in the New World Information Or- 
der debates then raging in UNESCO and the General 
Assembly. Its omission suggested that the Soviet bloc 
and Third World supporters of censorship would use 
the principles as precedent in those debates and even in 
international negotiations about regulating other types 
of broadcasting, such as the upcoming ITU conference 
on shortwave services. 

The 1982 draft, which became General Assembly 
Resolution 37/92, elicited more than the fully antic- 
ipated U.S. opposition and was reported out of the 
Special Political Committee (1982) on a vote of 88 in 
favor, 15 against, and 11 abstaining. The vote in the 
plenary was more impressive in terms of numbers— 
107 in favor, 13 opposed, and 13 abstaining (General 
Assembly 1982)}—however, the opposed and abstainers 
included all the Western industrial states without whose 
cooperation a viable regime could not exist. 


Remote Sensing 


Innovations in satellite remote sensing during the early 
and mid 1980s did not trigger a similar sense of urgency 
about the need to finish negotiations even though con- 
cern about military uses of outer space had risen again 
as superpower tensions intensified. The military recon- 
naissance potential of satellite remote sensing received 
greater attention in the 1980s as the French 10-meter 
SPOT system came closer to operation, Third World 
governments paid considerable attention to allegations 
that superpowers were sharing data selectively with al- 
lies or client states, and discussions of a proposed “me- 
diasat” consortium among private news organizations 
became public knowledge. This concern did not trans- 
late directly into demands to complete negotiations 
quickly for several reasons. Some governments appre- 
ciated the arms control monitoring potential of satel- 
lite imagery (Jacku and Trecroce 1980). At least a few 
realized that commercially available satellite imagery 
could provide even relatively poor countries with some 
reconnaissance—a possibility borne out in Iranian use 
of SPOT imagery to locate some Iraqi targets in 1988 
(Perry 1992). 

More significant for the course of negotiations was 
the influence of the growing number of domestic ac- 
tors in Third World countries interested in securing 
and using remote sensing data. Some of the users were 
research scientists, but most were members of govern- 
ment agencies who realized that remote sensing data 
would make their work more effective. In 1977, the 
most active Brazilian users of data coming from the 
Landsat ground station were the state planning agency, 
the agency administering the Amazon region, and the 
state oil company Petrobras. Whereas some locally 
based private firms used remote sensing imagery, the 
vast majority of Brazilian and other Latin American 
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users were government agencies (Hammond 1977). A 
similar array of users then developed in Africa and 
South Asia (Paul and Mescarenhas 1981). 

The influence spread as the user community in- 
creased in size. By 1981 more experienced Third World 
users were themselves offering training courses for na- 
tionals of neighboring countries (e.g., Kenya 1981). As 
more non-U.S. participants became involved, NASA’s 
Landsat Station Operators Group became a forum for 
active dialogue between ground station owners and 
Landsat system designers and administrators (Bishop 
1986, 329). The spread of specialist journals provided 
the clearest evidence of the size and transnational ex- 
tent of the user community. By the early 1980s The 
International Journal of Remote Sensing (London), 
Remote Sensing of the Environment (New York), Issle- 
dovaniye Zamit iz Kosmosa (Moscow), The Canadian 
Journal of Remote Sensing (Ottawa), and Remote Sens- 
ing (Falls Church, Virginia) were all helping link local 
users around the world into what a French commen- 
tator called an “observing community” (Dutheil de la 
Rochere 1983, 364). 

The strength of the local nodes of this transnational 
user community did vary considerably from country 
to country. Lazarus (1983, 108) noted that the Kenyan 
government relied heavily on remote sensing data in 
land use planning but not in hydrocarbon prospect- 
ing, and that one West African government had built 
new port facilities on a silt zone that even a cursory 
glance at a Landsat image of the area would warn 
against using. Other commentators noted similar varia- 
tions and criticized most training programs for focusing 
on relatively simple applications rather than provid- 
ing a broad overview (e.g., Abiodun et al. 1988 and 
Malingrau 1985). Newer sensor and computer technol- 
ogy increased the cost of data analysis hardware suf- 
ficiently to take it beyond the budgets of many Third 
World governments, reinforcing the potential for con- 
centration of data analysis skills in the emerging pri- 
vate data analysis firms (Lazarus 1983, 110-12). Yet 
enough Third World government agencies were using 
remote sensing data for suggestions from such staunch 
friends as the Swedes and Yugoslavs (Special Political 
Committee 1980) that limiting data distribution would 
leave sensed states even more dependent on others for 
information by more firmly concentrating data inter- 
pretation and analysis activities in satellite-operating 
states to receive careful consideration. 

The political impact of these local users was rein- 
forced as technological developments opened up more 
land use planning and environmental monitoring uses 
of remote sensing data. From the beginning Third 
World governments’ attitudes toward remote sensing 
had wobbled because they welcomed its meteorological 
applications even as they about its economic and mil- 
itary ones. Growing realization among administrative 
agencies that new forms of information were needed 
to formulate and implement effective land manage- 
ment and environmental policies made access to re- 
mote sensing data attractive to government agencies 
previously unconcerned with it (Golden 1989; Litfin 
1999). 
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Local users also helped reduce Third World polit- 
ical leaders’ and policy generalists’ fears of remote 
sensing by helping them develop a more sophisticated 
understanding of the extent and limits of the advan- 
tages foreign mining and oil companies could derive 
from remote sensing data. Initially, many Third World 
governments saw all the advantages as flowing to the 
foreign firms. The government of Pakistan even made 
a formal protest when journalists revealed that Land- 
sat operators believed that they had identified ad- 
ditional copper deposits in that country (U.S. Probe 
Uncovers Copper 1975). Local experience with han- 
dling remote sensing data contributed to greater con- 
fidence that in most situations the information foreign 
firms got from remote sensing data needed on-ground 
confirmation before it could be used to advantage 
(Greenberg 1983, 387). Foreiga companies’ requests 
for access to a country’s territory to perform this con- 
firmation would provide notice of interest and give the 
relevant government agencies time to acquire remote 
sensing data themselves. Local users were critical to 
domestic policy debates in Third World countries in 
another way. No matter how sincere Western advo- 
cates of remote sensing were in their claims about the 
benefits of wide data distribution, they were not re- 
garded as trustworthy sources of information by Third 


World governments, and their arguments were easily > 


ignored as examples of “marketing” or “strategic com- 
munication” aimed at putting something over. Third 
world users, whose interests were more congruent with 
those of Third World policymakers and thus seen as 
less likely to be pushing a harmful policy, could provide 
persuasive endorsements of the arguments in favor of 
a permissive remote sensing regime (Druckman and 
Lupia 2000, 17). 

Local users’ increased policy significance first be- 
came apparentin Latin America, where they persuaded 
their governments to include ensuring continued access 
to data among the explicit national negotiating goals in 
the late 1970s (“Latin America” 1978). The impact soon 
went further, and was registered in broader shifts of 
national negotiating positions by several Third World 
governments. The change in Brazilian positions was 
among the most dramatic. In 1974 the Brazilians had 
joined the Argentines in advocating sensed state con- 
trol over both satellite use and data distribution, and 
in 1975 the US government needed to threaten cutting 
off all Landsat transmissions to their ground station 
to induce Brazilian compliance with stipulations that 
Landsat station owners sell images and data tapes to 
all third parties on a nondiscriminatory basis (Latin 
America Alters View on Landsat 1978, 17). By 1980 
the Brazilian National Institute of Space Research had 
a large data processing and analysis staff providing in- 
formation to a range of other government agencies, and 
in 1982 Brazilian delegates tabled proposals shifting 
the remote sensing negotiations toward acceptance of 
a broadly permissive regime. The extent of turnaround 
can also be seen in some Third World reactions to the 
prospect, much discussed in the mid 1980s (e.g., Joyner 
and Miller 1985) of data cost increases and other new 
policies raised by the U.S. decision to turn Landsat oper- 


ations over to a commercial entity. Active Third World 
users joined their Western counterparts in pressing for 
compatibility of data from new and existing sensors 
(see Doetsch 1985, 117), and policy generalists did not 
use the occasion to renew calls for a highly restrictive 
regime. 

Greater Third World sophistication about remote 
sensing was also manifest in the second round of the 
Outer Space Committees’ efforts to resolve the data 
distribution issue by distinguishing between data that 
may be distributed at will and data that may be dis- 
tributed only with prior sensed state consent. Most of 
the negotiators were aware that data analysts using 
newer-generation computers could extract a broad ar- 
ray of environmental and economic information from 
“low resolution” data (see Ambrosetti 1984, 17-18). 
They thus discarded the idea of using any resolution as 
the dividing line in favor of developing the French no- 
tion (French delegate 1978) of differentiating between 
ordinary and “sensitive” (or “critical”) data (compare 
draft principle XV in Working Group 1980, 1983). 
Even this effort collapsed, however, because negotia- 
tors could not develop a stable and easily implemented 
distinction between the two types of data. 

Meanwhile Third World governments ready to ac- 
cept permissive data distribution rules began exploring 
alternate ways of protecting themselves against abusive 
uses of data. Some proposed requiring that sensed state 
governments get prior or at least simultaneous access 
to data about their territories (Group of 77 1982; Work- 
ing Group 1981, draft principle XII). Others proposed 
holding a sensing state legally responsible for any harm 
to sensed states caused by independent analysts or third 
parties using data it had released (Brazil 1982, principle 
XI; Eight Powers 1983, principle XI; Mexico 1981, prin- 
ciple XI). Western governments acknowledged others’ 
concerns but were not willing to concede much on these 
points. They were not prepared to accept any require- 
ment that sensed states acquire data before other users. 
Although ready to accept legal responsibility for their 
own agencies’ actions, they refused to accept direct re- 
sponsibility for the activities of independent data an- 
alysts or other data users (Peyrefitte 1991, 206; U.S. 
delegate 1983). 

The strength of momentum toward agreement un- 
derlying this continued arguing over details was re- 
vealed very clearly in others’ reactions to the Brazilians’ 
February 1982 proposal (Brazil 1982). Like sponsors of 
the 1981 “Twelve Powers draft” on satellite TV broad- 
casting, the Brazilians were trying to move negotiations 
forward by assembling a draft out of ideas they believed 
had wide support among other delegates. Rather than 
being overtaken by a more restrictionist proposal, the 
Brazilian offering attracted important Third World sup- 
port the following year (Chinese delegate 1983; Eight 
Powers 1983) and encouraged others to take up more 
intense efforts to find a generally acceptable balance 
between sensing and sensed state concerns. 

The Austrians, whose interest in a successful outcome 
was strong because an Austrian traditionally chaired 
the Outer Space Committee, and the French then took 
the lead in efforts to develop generally acceptable texts 
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(Jasentuliyana 1988, 281). By the end of the Outer 
Space Committee’s 1985 session, an Austrian draft 
(Austria 1985) was accepted as the single negotiating 
text (Egyptian delegate 1985, speaking for the Group 
of 77; remarks of most Third World delegates in Special 
Political Committee 1985). 

The growing convergence of views allowed the Aus- 
trians to develop a draft accommodating both sensing 
state and Third World concerns at several levels. It en- 
dorsed freedom to sense but addressed Third World 
concerns by stating that remote sensing should be car- 
ried out for the benefit of all states, respect the principle 
of permanent sovereignty over natural resources and 
wealth, and respect the rights and interests of sensed 
states. Western concerns were accommodated by ex- 
cluding “information about natural resources” from the 
rendition of the permanent sovereignty principle. Sens- 
ing states were obligated to inform the UN Secretary- 
General of impending remote sensing programs and 
provide information about them to any government re- 
questing it. They were also encouraged to provide other 
states with opportunities to participate in remote sens- 
ing operations and technical assistance. Sensed state 
rights regarding data did not include control over dis- 
tribution and were defined in Principle XI as “access to 
[primary and processed data] on a nondiscriminatory 
basis and reasonable cost terms” and “access to the 
available analyzed information ... in the possession of 
any State participating in remote sensing activities on 
the same basis and terms.” 

Balancing also pervaded the liability provision in 
Principle XIV. The first half specified that “states oper- 
ating remote sensing satellites shall bear international 
responsibility for their activities and assure that such ac- 
tivities are conducted in accordance with international 
law, irrespective of whether such activities are car- 
ried out by governmental or nongovernmental entities 
or through international organizations to which such 
States are parties.” Western concerns about broad in- 
terpretations were addressed in the second half, which 
specified that “this principle is without prejudice to the 
applicability of the norms of international law on state 
responsibility for remote sensing activities.” The defini- 
tions in Principle I reinforced the limitation by defining 
“remote sensing activities” to include “activities in pro- 
cessing, interpreting, and disseminating the processed 
data” and using the distinct term “analyzed informa- 
tion” for the sorts of further computer enhancement or 
merging of remote sensing data with other information 
typically performed by independent data analysts. 

Some delegates continued to grumble about certain 
parts of the text. The Soviet bloc tried to interpret 
the text as extending sensing state liability for 
data abuse to the activities of independent analysts 
(Soviet and Vietnamese delegates 1985; Vereshchetin 
and Postyshev 1985, 247-50) whereas a few others com- 
plained about inadequate safeguarding of sensed or 
developing states’ interests (Special Political Commit- 
tee 1986). However, the Outer Space Committee pre- 
sented an agreed text based on the Austrian draft when 
the General Assembly convened in September. The 
Assembly’s Special Political Committee reported out 
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the draft by consensus (Special Political Committee 
1986), and the General Assembly adopted it by con- 
sensus as Resolution 41/65 (General Assembly 1986). 


CONCLUSION 


The similarities of international context mean that ac- 
counting for the different results of the direct TV 
broadcasting and remote sensing negotiations requires 
focusing on differences in the domestic political de- 
bates that produced national negotiating positions. By 
itself, however, looking at the domestic debates does 
not demonstrate the insufficiency of either rational 
choice or sociological institutionalist accounts of how 
states form preferences. Each can be restated to explain 
changes in state negotiating positions as the product of 
changed preferences among domestic actors influential 
enough to create shifts in government policy. In rational 
choice versions, domestic actors revise their policy pref- 
erences in response to shifts in internal or external con- 
ditions that alter their incentives sufficiently to change 
the results of their utility maximizing calculations (e.g., 
Freiden 1991 and Hiscox 1999). In the constructivist 
amendment to sociological institutionalist accounts fo- 
cusing on argument, the process of argumentation in- 
spires domestic actors to alter their beliefs and revise 
their policy preferences to be consistent with the new 
beliefs (e.g., Crawford 2002 and Risse 2000). 

Yet each restatement fails to explain important ele- 
ments of the divergence between the direct TV broad- 
casting and remote sensing negotiations. The rational 
choice version explains why existing domestic actors 
shift preferences but not how new actors acquire in- 
fluence in national policy debates. Rational choice ac- 
counts nicely explain the stability of domestic coalitions 
in the direct TV broadcasting negotiations: The actors 
in the policy process lacked any incentive to change pol- 
icy preferences. Rational choice accounts also suggest 
why local groups of remote sensing data users could 
become an effective policy force—their numbers were 
small enough that they faced low barriers to collective 
action and could easily coalesce into a coherent partici- 
pant in the policy process. However, they do not explain 
particularly well why actors already established in the 
policy process and holding very different policy prefer- 
ences let the local data users into discussions, listened 
to them, and adopted at least some of their views. 

The constructivist accounts emphasizing the impact 
of processes of argument are more attuned to the ebbs 
and flows of participation in policy arguments because 
they treat argumentation as a distinct mode of social 
interaction with its own form and momentum. Though 
acknowledging that powerful actors and coalitions will 
use their power to steer the process by allocating oppor- 
tunities to participate or selecting among substantive 
starting points, these accounts treat entering into an 
argument as agreeing to participate with other actors 
in a joint search for better policy where participants 
will treat one another as equals in the conversation and 
accept assertions based on superior substantive reason- 
ing rather than the power or social position of the as- 
serter (e.g., Risse 2000, 7-11). However, they have not 
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identified any particular mechanism sorting out which 
actors are likely to be regarded as meriting inclusion 
or the particular substantive propositions forming the 
framework for their conversation. 

Paying attention to the impact of situation defini- 
tions fills these explanatory gaps. Situation definitions 
link the substance and process of policy debates by 
suggesting not only the type of problem at hand and 
the means appropriate to addressing it, but also the 
actors with the best claims to inclusion in the effort 
to address it. The stability of situation definition in 
the direct TV broadcasting negotiations not only kept 
the debate running in the same substantive channels, 
but also reaffirmed the centrality of the same set of 
domestic actors. The change in situation definition oc- 
curring in the remote sensing negotiations as environ- 
mental concerns became more important and changes 
in remote sensing technology opened up a wider ar- 
ray of pollution-detecting, land use management, and 
resource policy enforcement uses of data not only al- 
lowed additional considerations to be brought into the 
discussion, but gave additional actors—the local data 
analysts and users — an influential place in national pol- 
icy debates. 

Comparing the direct TV broadcasting and remote 
sensing negotiations shows very clearly that, even for 
the same state at the same time, the questions “What do 
I want?” and “What am I?” do not have single answers 
across all issues and problems. Rather, the responses to 
those questions vary with responses to “What am I do- 
ing”? Classifying both direct TV broadcasting and re- 
mote sensing as problems in “control over transborder 
information flows” to be addressed through creation of 
an agreed global regime set up the negotiations as both 
an East-West and a South—North contention, laid out 
the basic contours of the problem, suggested the menu 
of reasonable policy choices, and structured the op- 
portunities for domestic actors who were not national 
political leaders, leaders’ immediate advisors, or the 
country’s delegates in the UN forums to get involved 
in the national policy debates. Stability in the definition 
of the sorts of signals at stake in the direct TV broad- 
casting debates helped reinforce the stability of the 
array of domestic actors participating in the national 
policy debates. Third World governments’ definitions 
of the national interest in controlling the organization 
of the broadcasting sector and protecting local culture 
remained stable, as did their conceptions of their re- 
sponsibility as defenders of that autonomy and culture. 
The increasing array of information available from re- 
mote sensing altered both conceptions of the issue and 
the array of participants in Third World policy debates. 
The new forms of information were very useful to gov- 
ernment administrators, who could recast satellite re- 
mote sensing as something conducive to the national 
interest and helpful in meeting government responsi- 
bilities as maintainers of order, managers of the na- 
tional economy, and protectors of ecological endow- 
ments. This change gave members of the transnational 
“observing community” consciously fostered by NASA 
Landsat managers entrée into their countries’ policy 
debates. As indigenous experts perceived as sharing 


their political leaders’ concerns with national auton- 
omy, the local data users were able to inspire a reevalu- 
ation of position in which the risks of others getting 
data did not disappear but were outweighed by the 
newly perceived advantages of having it available. Thus 
the same Third World governments that insisted on a 
restrictive regime for direct TV broadcasting in 1982 
were brought around to a different perspective on re- 
mote sensing, adopted new negotiating positions, and 
were willing to accept a broadly permissive interna- 
tional regime for remote sensing in 1986. 
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Ithough a large literature on delegation exists, few models have pushed beyond a core set of 
canonical assumptions. This approach may be justified on grounds of tractability, but the failure 
to grasp the significance of different assumptions and push beyond specific models has limited 


our understanding of the incentives for delegation. Consequently, 


the justifications for delegation that 


have received recent scrutiny and testing differ from some of the more plausible justifications offered 
by informal studies of delegation. We show that surprisingly few results in the literature hinge on risk 
aversion, and surprisingly many turn on the ignored, though equally canonical, technological assumption 
that uncertainty is fixed (relative to policies). Relaxing the key assumptions about dimensionality and 
functional forms provides a clearer intuition about delegation—one that is closer to classical treatments. 
The theory allows us to relate different institutional features (commitment, specialization costs, monitor- 
ing, multiple principals) to delegation’s observable properties. 


antiquity—the Book of Exodus refers to it— 
only in the last century has much scholarship 
examined its causes and effects, and only recently has 
research focused on developing and testing formal the- 
ories that explore the logic of delegation.’ Despite this 
recent surge in formal work on delegation, scholars 
have failed to generalize several important properties 
of the early pathbreaking models. Instead of revisiting 
assumptions that were initially imposed for the sake 
of convenience, recent papers tend to maintain and 
reinforce the cannonical assumptions, applying these 
limited models to ever-richer institutional settings. Al- 
though the breadth of understanding about delegation 
across various settings is impressive, this approach has 
come at a price. It has produced an understanding that 
only partly captures delegation’s informational ratio- 
nale in even the simplest contexts. The literature is rich 
in predictions about delegation in complicated institu- 
tional settings when stylized foundational assumptions 
are satisfied but poor in explanations for and intuitions 
about why a poorly informed boss chooses to delegate 
to subordinates with different preferences. Moreover, 
we lack an understanding of what basic features drive 
the boss’s choice of which agent is given control of 
policy. 
For example, scholars have provided detailed expla- 
nations about how separation of power systems affect 


. Ithough the practice of delegation dates back to 
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delegation by legislatures to agencies, but the basic 
question “Does the informational rationale for dele- 
gation hinge on the boss’s risk preferences?” has been 
ignored. Despite the analysis of complicated signaling 
and screening games between bosses with ex post con- 
trols and their subordinates, we do not yet understand 
when or why the ally principle (the boss picks the most 
ideologically similar agent as delegatee) fails in simple 
settings. 

We try to put models of delegation on more solid 
footing by examining a larger (more general) set of 
environments. We eschew the canonical assumptions 
of quadratic utility (or even risk aversion), unidimen- 
sional outcome spaces, and additive and homogeneous 
random shocks to policy. Our analysis focuses first on 
two basic questions: When does delegation occur? and 
To whom does the boss delegate? We then consider 
richer, institutionally driven extensions, e.g., the role 
of commitment and monitoring devices, the impact of 
multiple bosses, and the institution in which their pref- 
erences are aggregated. Through a sequence of nested 
models with varying levels of abstraction, the analysis 
isolates the relations between various assumptions and 
different findings. 

To give readers a sense of the theory’s fertility, we 
summarize four implications now.* (1) As the uncer- 
tainty between policies and outcomes rises a principal 
becomes more prone to delegate, regardless of her risk 
preferences. This holds for multiple principals (e.g., in 
a separation of powers system) if uncertainty increases 
“sufficiently.” (2) In political hierarchies with a single 
principal and multiple (informed) agents, if the latter 
can precommit to delivering particular outcomes, then 
they will compete away their informational advantages: 
In equilibrium the boss will get her ideal point (re- 
gardless of the policy space’s dimensionality). Hence 
in these circumstances the agents’ preferences do not 
matter: As in certain kinds of Wittmanesque electoral 
competition, conflict is resolved completely in favor of 
the principal. (3) A classical feature of delegation, the 


2 These summaries omit umportant details. The umplications are 
stated more precisely later. 
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ally principle, will not hold if obtaining information is 
costly and agents cannot precommit to doing so. In such 
situations the boss and her best ally face a collective 
action problem, which the principal solves by delegat- 
ing authority to an agent whose policy preferences are 
an “intermediate” distance from hers. (4) If the boss 
can select a discretion level and monitor compliance, 
the optimal level of discretion may not depend mono- 
tonically on the agent’s ideological distance from the 
boss. 


Before turning to these implications, however, we 
must first examine the “big picture”—the informational 
rationale for delegation. Though the canonical model’s 
diffusion seems to indicate a scholarly consensus about 
this rationale, this is not really so: The traditional per- 
spective on delegation provides a different view. 

The classical analysis of delegation offers a simple 
and persuasive perspective. This view was elegantly 
expressed by Alexander Hamilton ([1788]. 1961, 199- 
200) in The Federalist No. 23’s examination of defense 
policy and management: 


The authorities essential to the common defense are these: 
to raise armies; to build and equip fleets; to prescribe rules 
for the government of both; to direct their operations; to 
provide for their support. These powers ought to exist 
without limitation, because it is impossible to foresee or 
to define the extent and variety of national exigencies, and 
the correspondent extent and variety of the means which 
may be necessary to satisfy them. The circumstances that 
endanger the safety of nations are infinite, and for this 
reason no constitutional shackles can wisely be imposed 
on the power to which the care of it is committed. (italics 
in the original). 


Hamilton thus advanced a logic for transferring con- 
trol from less informed to more informed officials, an- 
ticipating that the more knowledgeable ones would 
make better choices. His view has persisted to the 
present day and is particularly prominent in the study of 
bureaucracy. Indeed, that delegating to experts or spe- 
cialists can yield superior outcomes has been conven- 
tional wisdom in public administration for most of the 
last century.‘ Translated into the language of modern 
economics, the traditional perspective says that optimal 
plans are usually state-contingent. Hence, if a principal 
is ignorant of the state of the world, then it may be 
optimal to delegate to an agent who does know the true 
state, provided that the costs of delegating (financial, 
political) are not too high. 

In contrast, in modern formal models of delega- 
tion, making outcomes stochastically better is not the 





> To avoid confusing pronouns, throughout this article the principal 
or boss ıs assumed to be female and the agents to be male. 

4 Consider these remarks from two eminent texts: “A second limita- 
tion of legislative control results from the mcreasing complexity and 
volume of governmental actrvity Legislators... are highly dependent 
on the very administrative agencies they seek to control for the m- 
formation necessary for rational choice, for recommended solutions 
to public problems, and even for the identification and description of 
the problems themselves” (Simon, Smithburg, and Thompson 1950, 
524; emphasis added). “In general, authority should be placed at 
the lowest level at which all essential elements of information are 
available” (Wilson 1989, 372). 


294 


key to delegation. Instead, the focus is on how del- 
egation can make outcomes less risky (e.g., Gilligan 
and Krehbiel 1989, 462; 1990, 536-37; Epstein 1997, 
277, 1999, 6; Epstein and O’Halloran 1999, 235).5 

This focus on risk reduction has had significant im- 
plications for other features of current models of del- 
egation, thereby affecting our current understanding 
of the phenomenon. In particular, because lowering 
risk is unequivocally desirable if and only if a prin- 
cipal is risk-averse, most current formal models as- 
sume that political principals are risk averse. (For 
an exception see Huber and Shipan 2002.) Indeed, 
they typically assume a specific degree of risk aver- 
sion, by positing a quadratic utility function. For ex- 
ample, in discussing the assumptions of the canonical 
model of delegation, Epstein and O’Halloran (1999) 
write: 


This quadratic loss function has two important implica- 
tions. First, actors will prefer outcomes that are closer 
to their ideal point than those further away; this is the 
distributive component of their utility. But actors will also 
be risk-averse, meaning that they will dislike uncertainty 
over policy outcomes; this is the informational component. 
Even though actors want outcomes close to their preferred 
policy, then, they may nonetheless be willing at the margin 
to accept a system that biases policy away from their pref- 
erences if they can simultaneously reduce the uncertainty 
associated with these outcomes. (54; emphasis added) 


The contrasts between the traditional and the mod- 
ern perspectives are clear and striking. The former as- 
serts that the informational basis for delegation is very 
simple: When optimal policies are state-contingent and 
the costs of delegating not too high, any rational, un- 
informed principal prefers to transfer authority to an 
informed agent. The latter asserts that the informa- 
tional basis for delegation is based on risk reduction. 
Recall Epstein and O’Halloran’s (1999) explanation: 
“Actors will also be risk-averse, meaning that they will 
dislike uncertainty over policy outcomes” (54). Under 
this perspective, delegation depends intimately on the 
risk preferences of principals. 

Substantively, this analysis comes down on the tra- 
ditional side. We show that what drives the decision 
to delegate in spatial models is not aversion to risky 
outcomes but simply an aversion to bad—i.e., distant— 
ones. (More precisely, the quasi-concavity inherent in 
Euclidean preferences by itself creates an informa- 
tional rationale for delegation; concavity of the loss 
function is inessential.)® 

As noted, to ensure tractability the canonical del- 
egation model typically uses three other specialized 


5 Distribution W is less risky than Z rf the latter has “fatter tails” To 
keep the comparison clean, most scholars assume a mean-preserving 
spread. Loosely speaking, one produces a mean-preserving spread 
of W by “squashing” it ifting probability mass from the 
distribution’s center to its tails For two-parameter distributions (¢.g., 
the normal or uniform), Zis a mean-preserving spread of W if it has 
the same mean and more variance. 

6 Other rationales for delegation mclude comparative advantage (re- 
garding, eg. tıme) An extension of our basic model captures this 
logic. 
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premises: (1) Both policies and outcomes are unidimen- 
sional; (2) the policy technology is stark—outcomes = 
policies + random shocks; and (3) shocks are from 
a specific distribution, usually the uniform. Whether 
these assumptions are substantively important or only 
analytically useful has received scant attention in the 
literature.’ (It is argued, however, that assuming uni- 
formly distributed shocks is just for computational con- 
venience [e.g., Gilligan and Krehbiel 1990, 536].) 

Our focus on more general models of delegation al- 
lows us to show that none of the canonical model’s spe- 
cialized premises drive the decision to delegate in polit- 
ical institutions, and only the technological assumption 
strongly affects decisions about which agents are given 
authority. Though we find that current scholarship over- 
states the importance of risk aversion, the assumption 
that outcomes = policies + random shock (especially 
the premise that the shock’s distribution is unaffected 
by the policy) turns out to be very important—a fact 
that has gone unappreciated by the literature. Relaxing 
this convenient but empirically suspect assumption can 
dramatically affect predictions about which agents are 
given authority. 

To show that we only need very general assump- 
tions about preferences, policy and outcome spaces, 
technologies, and uncertainty to understand the core of 
delegation, we construct a nested sequence of models. 
Model A is the most general. It assumes only that agents 
have Euclidean preferences over outcomes (which lie 
in N-dimensional Euclidean space) and that agents 
make policy choices (which lie in K-dimensional Eu- 
clidean space, where K need not equal N). The struc- 
ture of uncertainty and the relation between policies 
and outcomes are unspecified. Yet even at this level 
of generality we can show the validity of Hamilton’s 
insight that delegation is driven by the search for state- 
contingent plans. Hence, by using a more general for- 
mulation, we can isolate the importance of specific 
features: For example, we can identify exactly which 
results truly hinge on risk aversion (in the classical 
sense}—and which do not. Thus, we depart from and 
generalize the canonical model not merely to check the 
robustness of its results but, more fundamentally, to 
locate and correct errors in the received wisdom about 
delegation. 

Model A’ imposes an additional technological as- 
sumption: The mapping from policy to outcomes has 
the common additive form, outcome = policy + ran- 
dom shock, where the shock is independent of the pol- 
icy choice. This simple structure allows for stronger 
results. Indeed, the richness of A’’s results indicates 
that the literature has greatly underestimated the tech- 
nological assumption’s significance: We think that this 
part of the canonical model is more powerful than the 


7 Battaglini (2002), an exception to this neglect, shows that some 
of the canonical model’s premises do matter substantively. He 
proves that when the outcome and policy spaces are at least two- 
dimensional, equilibria exist m which informed subordinates reveal 
their private information to the boss. (See also Baron and Meirowitz 
2002.) This finding differs sharply from the conclusions in Gilligan 
and Krehbiel 1989 and from those ın Krishna and Morgan 2001. 


premises of risk aversion or unidimensionality, whose 
importance has been overestimated. 

We then consider model A”, which additionally as- 
sumes risk-averse decision makers. Moving from A’ to 
A” allows us to see that only some features of delega- 
tion hinge on attempts to reduce risk rather than on 
the more fundamental desire to avoid bad outcomes 
and secure good ones. 

Even the most specific model, A”, is considerably 
more general than the canonical one: It dispenses with 
specific functional forms for the utility functions (only 
concavity is presumed) and distributions of the shocks 
(only symmetry is required), and it is set in an arbitrary- 
dimensional space. Though the move to more general 
models often complicates analysis and obscures intu- 
ition, we find that for this subject the move aids un- 
derstanding. The more general analysis, by avoiding 
algebraic derivations, helps us see the relation between 
results and major assumptions. 

To demonstrate that a “spatial” theory of delega- 
tion (ie., one with Euclidean preferences) not only 
is powerful for the “big picture”—delegation’s central 
features—but also can yield detailed institutional in- 
sights, we consider some extensions to the basic models. 
These focus on three key aspects of delegation exam- 
ined by many variants of the canonical model: (1) how 
a boss induces subordinates to acquire information, 
(2) her ability to make policies contingent on the 
world’s true state, and (3) how closely outcomes reflect 
her preferences.’ 

At each level of generality we discuss not only that 
level’s results but also the omissions: what a given 
model does not imply. This comparison of results and 
assumptions clarifies which premises drive which re- 
sults. To facilitate these comparisons we derive results 
at the level with the weakest set of sufficient assump- 
tions. (For example, if a result can be established in 
model A’ but not in A, then it is presented in A’.) An 
overview of the three models and their results is given 
in Tables 1 and 2. 

Before turning to the analysis, it is helpful to for- 
malize some key decision theoretic issues associated 
with both the traditional and the contemporary per- 
spectives. Although these issues turn out to be vital in 
delegation decisions, the literature has mostly ignored 
them. 


8 This explains our sequencing of nested models: In gomg from 
model A to A’, one generates more new results by assuming the 
additive policy technology (while keeping the general preferences of 
mode] A) than by positing risk aversion (while keeping the general 
policy technology of A). 

We always allow for multiple agents but not always multiple prin- 
cipals Two reasons explain the latter. First, if issue spaces are unidi- 
mensional, then collective choice procedures (e g., majority rule) can 
ensure that one boss is prvotal Second, though recent literature has 
focused on how Congress delegates authority to the executive branch, 
delegation often occurs in standard tree hierarchies (e.g., from the 
president to the armed services), where multiple subordinates re- 
port to one supenor. Moreover, the issue of delegation confronts 
leaders in ail political systems, not just democracies. A dictator—the 
quintessential solo boss—faces hard delegation choices Our theo- 
ries of delegation should be able to accommodate different kinds of 
political institutions 
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TABLE 1. Comparing the Models 
Model A Model A’ Model A” 
Assumption 
Policy, shocks, outcomes General X= pt+o X= pt+o 
Preferences Euclidean over Euclidean over Euclidean and strictly 
outcomes outcomes concave over outcomes 
Result 
Ally abies In basic Ally principle holds. Same as A Same as A’ 
version 


Efficiency of delegation — — Not delegating Is ex ante 


Inefficient. 
increased preference conflict Boss Is worse off. Boss Is worse off. Same as A’ 
Effect on expected Effect on expected 
outcome Is ambiguous. outcome can be pinned 
down In some cases. 
Increased policy uncertalnty — Delegation set expands. Same as A’ 
Increased risk aversion = = Delegation set expands. 





Note: Entries of — denote results or effects that cannot be established at the level of generality of the model In question. 











TABLE 2. Extensions 











Extension Model A Model A’ Model A” 

Busy boss As cost of control increases, Same as A Same as A’ 

delegation set expands. 

Informationally As Imperfection increases, Same as A Same as A’ 

imperfect agents delegation set contracts. 
Notion of “ally” Is ambiguous Notion of “ally” Is clear and Same as A’ 
but dominated agent will boss faces trade-off 
not be chosen. between quallty and 
spatial proximity. 

Costly Information Ally principle may fall even If Ally principle holds tf costs Same as A’ 

gathering costs are homogeneous. are homogeneous. 

Ex ante control and ex For some costs the system ls Same as A Same asA’ 

post auditing used when no agents are In 
the delegation set. 

Compliance may be 
state-contingent. 

Discretion decreases In 
detection probability and 
allles may be harder to 
control. 

Reasserting contro! (1) With heterogeneous Same as A Same as A’ 

competence, alty principle 
may fail. (2) f delegation With costly Same as A’ 
occurs, it ls to an Information-gathering, 
undominated agent. delegation set is a 
doughnut. 
Agents can commit With two or more agents the Same as A Same as A’ 
boss gets her Ideal 
outcome (generically) 
Multiple bosses With reversion to control If policy uncertalnty Is Delegation set Is 
by a boss, delegation Is sufficiently large or nonempty and 
more likely If the voting bosses have sufficiently expands as bosses 
rule is less restrictive or homogeneous become more 
bosses have more similar preferences, then locally risk averse 
preferences. delegation occurs. 

Political uncertainty — — As uncertainty about 
preferences increases, 
delegation sst 
decreases. 
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DECISION-THEORETIC FOUNDATIONS 
OF DELEGATION 


Mapping Hamilton’s thesis—delegating can obtain su- 
perior outcomes—into modern theories of choice is 
more than translation; it also helps us to see the claim’s 
generality. Let us translate the intuitive but fuzzy no- 
tion of “superior outcomes” into its modern equiva- 
lent, first-order stochastic dominance. The gist of this 
concept, a standard one in decision theory, can be con- 
veyed by a simple example. Suppose that a person with 
an insatiable desire for a good (e.g., money) chooses be- 
tween two lotteries, Xand Y, where X = Y + 10 dollars. 
Then X is shifted to Y’s right, toward better (higher) 
values of the good; hence X first-order stochastically 
dominates Y. Hence the colloquial phrase, “It’s a no- 
brainer,” captures the situation: Any rational person 
with these monotonic preferences will take X over Y. 
This conclusion requires no further assumptions. In pat- 
ticular, the decision maker’s attitudes toward risk are 
irrelevant: Even if the stochastically better distribu- 
tion were more variable than the dominated one, (e.g., 
X=Y + Z, where Z is five or 50 dollars with equal 
probability), it would not matter because all the “risk” 
is benign. 

We can now reinterpret the traditional view as ar- 
guing that delegation produces outcome distributions 
that first-order stochastically dominate those produced 
without delegation: Delegating to qualified specialists 
will shift those distributions toward better values. Of 
course, given “political” (spatial) preferences, where 
people have bliss points and want outcomes to be 
close to ideal, the idea of first-order stochastic domi- 
nance must be used carefully. Because such preferences 
are noneconomic (satiable), “better” means “closer to 
the decision maker’s bliss point.” Hence, given spatial 
preferences it is easy to confuse first-order stochastic 
dominance with risk reduction. 

A simple example reveals the decision theoretic fea- 
tures of our analysis. Figure 1 depicts a symmetric 
single-peaked utility function with a bliss point of zero. 


Consider a lottery with two equally likely outcomes: 
x and —x. Though outcomes are random, in an impor- 
tant sense nothing of consequence is stochastic: The 
agent’s utility is the same for either realization of the 
“lottery.” This example demonstrates that theories of 
choice under uncertainty with strictly monotone pref- 
erences must be adapted when applied to Euclidean 
(or, more generally, quasi-concave but nonmonotone) 
preferences. Although variance in outcomes produces 
variance in utility when preferences are insatiable (and 
outcomes are one-dimensional), in the spatial setting 
this relation need not hold. 

Figure 1 illustrates a second important point: With 
Euclidean preferences the utility function’s curvature 
is only weakly related to choice under uncertainty. ‘The 
figure’s utility function is not concave. Indeed, for all 
neighborhoods to the left or right of the bliss point 
the function is strictly convex. Hence it does not ex- 
hibit risk aversion in the classical sense. To see this, 
consider a lottery between outcome —x and outcome 
—y. This lottery’s expected utility exceeds the utility 
of its expected value. Despite this, the utility function 
exhibits features of risk aversion given lotteries that are 
sufficiently symmetric and centered close to the ideal 
point. For example, the lottery with two equally likely 
outcomes, y and —y (where y > x) isa mean-preserving 
spread of the lottery of x and —x with equal probability, 
and the agent strictly prefers the latter. This apparent 
preference for risk reduction is equivalent to something 
more fundamental: In this setting, more risk means that 
outcomes are farther from the ideal point, hence less 
desirable. It is therefore instructive to think about lot- 
teries over outcomes in terms of the corresponding in- 
duced lotteries over utility. Our analysis shows that for 
some distributions, second-order stochastic dominance 
in the former is the same as first-order stochastic domi- 
nance in the latter.!° Moreover, when decision makers 
can choose policy to center the outcome distribution 





10 Focusing on first-order stochastic dominance regarding lotter- 
ies over utilities highlights risk aversion’s second-order character. 
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at their ideal point, they typically confront outcome 
distributions in which this equivalence holds. 

The presentation of the following text is informal: It 
focuses on conveying the intuition behind the results. 
Due to space constraints, only a few major results are 
proven in this paper (see the Appendix). For proofs of 
all results, see Bendor and Meirowitz 2003a and 2003b. 
Anything identified in boldface as a “conclusion” in the 
text is proven either in the Appendix or in the working 
paper. 


MODEL A 


We consider a boss and a finite set of subordinates, 
{1,...,m}, m > 1. The set of outcomes is called X, a 
subset of N-dimensional Euclidean space; a particular 
outcome is a vector x. Each agent, say i, has a unique 
ideal point y, in X. The utility of outcome x to agent 
i is given by a common, continuous, and decreasing 
function of the distance between x and the ideal point 
yı- Thus, preferences are Euclidean: Player i strictly 
prefers outcome x; over outcome xz if and only if x; is 
closer in Euclidean distance to his ideal point than is 
x. (Hence in two dimensions indifference curves are 
circles.) To avoid the annoyance of an indifferent boss, 
we assume that all ideal points are strictly rank-ordered 
by their Euclidean distance from the boss’s ideal 
point. 

Policies are located in a K-dimensional space (K 
need not equal N). The set of policies is P; a par- 
ticular policy is a vector p. Policies are perturbed by 
random shocks (e) to yield outcomes. In the basic 
model we assume only that the outcome distribution, 
given an uninformed decision maker’s choice of a spe- 
cific policy, is not degenerate: More than one outcome 
can arise. Thus, in contrast to most delegation mod- 
els, the shock need not be uniformly distributed; in- 
deed, it need not be continuous, symmetric, or even 
independent of the policy. We require only that to- 
gether the policy and shock create some uncertainty in 
outcomes. 

Because outcomes depend on both policy and dis- 
turbance and because weak premises about the dis- 
turbance’s distribution suffice, outcomes can be seen 
simply as random variables that depend on policies: 
x’s distribution function given policy p is F(x] p). 
In contrast to the canonical technology (outcomes = 
policies + shock), our basic model stipulates neither 
a technology nor a parametric family of shocks. In- 
stead, uninformed agents have beliefs over outcomes, 





(Recall that Von Neumann—Morgenstern utility functions are linear 
and so exhibit risk neutrality over utility lotteries ) Alternatively, one 
could define a notion of nsk aversion relative to certain lotteries. (For 
example, all decision makers with Euclidean utility functions exhibit 
risk aversion when we examine only distributions that are centered 
at the agent’s bliss point ) However, we have chosen not to use such 
a notion here. Instead, it seems more illuminating to highlight the 
fact that what seems to be classical nsk aversion is equivalent to 
a more fundamental property of Euclidean utility functions Bad 
outcomes are necessarily far from one’s ideal, and when outcome 
lotteries are centered at one’s bliss point, greater dispersion (“mak”) 
moves outcomes farther from one’s global maximum. 
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via the policy-conditioned distribution of F (x| p). We 
sometimes say that informed agents have learned the 
mapping from policy to outcomes. Explicitly, an in- 
formed agent has concentrated beliefs over outcomes 
(given policies). Since the canonical model presumes 
shocks, we sometimes describe informed agents as hav- 
ing learned the shock’s value. Explicitly, F(x | p) can be 
modeled as a function x = g(p, e) where e is some high- 
dimensional random variable. This allows for much 
flexibility in how policies and shocks generate out- 
comes. For example, the variance of outcomes may de- 
pend on which policy is chosen: In the canonical model 
this variance is constant. 

We do, however, impose some structure on the pol- 
icy technology. (The following properties—and much 
more—hold in the canonical model.) First, we assume 
that given the uncertainty about the mapping from poli- 
cies to outcomes, each agent has an optimal policy (it 
need not be unique). Second, knowing the perturba- 
tion permits complete “shock absorption”: If a deci- 
sion maker chooses a policy after he observes the dis- 
turbance, then he can always find a policy that, when 
combined with the shock, yields any desired (feasible) 
outcome. We call this perfect shock absorption. Simply 
put, the shock maps a policy (in P) into an outcome (in 
X). Perfect shock absorption means that agents who 
observe the shock can get any specific outcome, x, by 
selecting the appropriate policy, p. Thus, an agent who 
selects policy while knowing the shock will choose one 
that results in her ideal outcome. (Multiple policies may 
produce this optimal outcome.) 

All of the above is common knowledge (as is the 
game itself, below). The extensive form of the basic 
game is simple. The boss can either delegate or not. If 
she doesn’t, then she chooses a policy, which combines 
with the random shock to yield an outcome. Payoffs 
are then distributed. If the boss delegates, she selects 
an agent from the feasible set. (Recall, however, that 
this set may have only one member.) Once the agent 
is selected and given authority, he observes the shock’s 
realization and then picks a policy. Payoffs are then 
distributed." 

Throughout, we consider subgame-perfect Nash 
equilibria, which resolve the boss’s indifference (to con- 
trol or delegate) in favor of delegation. (If an agent 
is indifferent between policies at any decision node, 
we assume that he selects, from his set of optimal 
policies, one that is best for the boss, in expectation.) 
Subgame perfection is required to rule out equilibria 
that rely on noncredible play in some of the game 
forms we consider. Because none of our game forms 
involve signaling, subgame perfection is sufficiently 


11 There are two major types of delegation models m political science, 
defined by game forms: Either the principal moves first, typically by 
choosing to delegate or not (as in this paper), or the agent does, 
typically by sencing his superior a message about the state of na- 
ture. The latter (often called signaling models) are harder to analyze; 
hence to figure out what drives delegation we have chosen the for- 
mer. We believe, however, that our claim that first-order stochastic 
dominance (over utilities) is more ımportant than risk reduction for 
understanding delegation holds in signaling models; the sequence of 
moves should not affect this point. 
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restrictive: Appeals to stronger refinements are un- 
necessary. 


Results: Properties of the Equilibrium 


1. Generically, a unique equilibrium exists for each set 
of parameters. 
2. The equilibrium has the following properties. 


a. If the boss does not delegate, then she chooses a 
policy that maximizes her expected utility, given 
her knowledge of F(x | p). Because her prefer- 
ences are strictly single-peaked (no flat spots) 
and because F(x|p) is nondegenerate for all 
p, the expected utility of such a policy must be 
strictly less than the utility garnered by getting 
her ideal outcome with certainty. Because utility 
varies continuously in outcomes, this implies that 
she has certainty equivalents in outcome space: a 
set of outcomes such that she would be indifferent 
between getting any such outcome (for sure) and 
taking a policy that maximized her expected util- 
ity. Given preferences that are monotonic with 
Euclidean distance, all these certainty equiva- 
lents are equidistant from her ideal point. Since 
F(x | p) is not degenerate, this distance must be 
strictly positive. 

This set of certainty equivalents defines a dele- 
gation cutoff rule or ideological distance thresh- 
old for the boss: She delegates if and only if some 
agent is at least as ideologically close to her as this 
threshold. Call this threshold d*; anything within 
this boundary is the boss’s delegation set. 

Because delegating is optimal for the boss if her 
delegation set is not empty and because this set’s 
radius is strictly positive, even model A”s very 
general assumptions imply the possibility of ra- 
tional delegation. None of the canonical model's 
four main premises (e.g., risk aversion) is neces- 
sary. The key to delegation lies elsewhere: It is 
based on the boss’s ignorance of the state of the 
world, captured by F(x | p)’s nondegeneracy, and 
the agent’s ability to condition his policy choice 
on the true state. This combination is precisely the 
optimality of state-contingent plans emphasized 
by the traditional perspective. 

b. If the boss delegates, then she picks the agent 
whose ideal outcome is the closest to hers, in 
Euclidean distance. (We sometimes call this per- 
son the “best agent.”) Since the agent knows the 
shock’s value before choosing a policy and is a 
perfect shock absorber, he picks his induced ideal 
point in policy space, thus obtaining his ideal out- 
come. This is clearly optimal for the agent. 

Thus this ancient political principle—the boss 
picks the (ideologically) closest agent—is valid 


12 Tf the boss is indifferent between delegating (to the closest agent) 
and retaming control, the equilibrium is not unique. Due to her in- 
difference these equilibria are payoff-equivalent to the boss (but not 
to the agents). 


given very general assumptions about prefer- 
ences, shocks, and policy technology. In partic- 
ular, her risk preferences are immaterial. The 
ally principle is driven instead by the principal’s 
desire to minimize the costs of delegating by 
appointing someone with preferences as simi- 
lar to hers as possible. This satisfies first-order 
(not second-order) stochastic dominance regard- 
ing the random distance between the boss’s ideal 
point and the final outcome; hence, risk prefer- 
ences are irrelevant. 

c. If the boss delegates, then the equilibrium is ef- 
ficient, both ex ante and ex post. (It must be 
efficient: The delegatee gets his ideal outcome, 
which he strictly prefers to all others.) If, however, 
the boss controls the policy, then the outcome 
may be inefficient. For example, as model A” 
will show, if the players are risk-averse, then ex 
ante everyone would prefer that the boss get her 
ideal outcome for sure, which could be achieved 
if the boss delegated to an agent who promised 
to implement that outcome (and delivered on the 
promise), over the lottery produced by the boss 
making an uninformed choice. Because agents 
are perfect shock absorbers, if one promises to 
implement the boss’s ideal outcome, he is tech- 
nically able to do so. But the agent’s promise to 
carry out this plan is not credible, so the boss dis- 
regards it. 


Turn now to the comparative statics of preference 
conflict. Suppose that conflict increases: The ideal 
points of all agents move an equal distance away from 
the boss’s. What happens? Three possibilities arise. 
First, if no agent had been in the delegation set pre- 
viously, then nothing changes: The boss keeps the reins 
in her hands and the expected outcome is unchanged. 
Suppose, however, that before the exogenous shift del- 
egation had prevailed. Then we could get the second 
possibility—the situation changes qualitatively—if the 
best agent becomes unacceptable to the boss, so she no 
longer delegates. Third, the preference change could be 
sufficiently small so that delegation remains attractive. 
In the second and third cases the expected outcomes 
shift, but they can move in opposed directions. In case 
three it must move away from the boss’s ideal; in case 
two it can move toward it. Hence an increase in ideolog- 
ical conflict between the boss and her agents does not 
always push policy in the same direction. That greater 
preference alignment (in case two) can move expected 
outcomes away from the boss’s ideal might contradict 
conventional wisdom. 

An exogenous increase in conflict does have one con- 
sistent (and unsurprising) effect: The boss is (weakly) 
worse off the more her subordinates are outliers. More 
precisely, as the best agent’s ideal point moves away 
from the boss’s, the latter’s expected utility falls or stays 
constant. 


Conclusion: When preferences between agents and the 
boss conflict more, (1) the movement of the expected 
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outcome is ambiguous, but (2) the boss’s expected utility 
unambiguously falls. 


Extensions of the Basic Model 


The following extensions are to be taken as one-at- 
a-time modifications. (We will see how this serial ap- 
proach matters when we examine, e.g., the boss’s cost 
of effort—point 1—and, later, the subordinates’ cost of 
information-gathering, in point 3.) 


1. A Busy Boss. Delegation often arises when a boss 
confronts many issues. That can be easily represented 
here by a cost of effort of controlling, reflecting the 
opportunity cost of attending to the matter at hand. If 
the boss’s expected utility of controlling is additive in 
policy outcomes and effort, then the busier she is the 
more she wants to delegate. 


Conclusion: When the boss becomes busier the delega- 
tion set expands. 


2. Informationally Imperfect Agents. In the basic 
model agents always observe the state of nature. What 
if agents may fail to gather information? Consider a 
simple type of fallibility: After the boss delegates, with 
probability q, agent i observes the state of nature; with 
probability 1 — q, he does not. An agent who does not 
observe e must make an uninformed decision. This 
puts the boss in the worst of all possible worlds: There 
is policy bias but no compensating adaptation of the 
policy to the state of the world. Hence, the expected 
value of delegating to i is increasing in i’s competence 
(qı). Consequently, if agents are homogeneously im- 
perfect (q, = q for all i), as q falls the delegation set 
contracts: The less delegation is worth to the boss via 
increased expertise, the more ideological affinity she 
demands. 

Note that for q < 1 the delegation set need not be 
convex or connected. In fact, the meaning of “ally” is 
unclear in this setting. If all agents are perfectly com- 
petent, then the boss can ignore the location of their 
induced ideal policies: She knows that if she delegates, 
the delegatee will set policy to attain his ideal outcome, 
so only outcome proximity matters. But if shock ab- 
sorption is imperfect, then the boss must also worry 
about the location of the agents’ induced ideal policies, 
and given the generality of model A, her closest ally in 
outcome space may not be the closest in policy space. 
Consider an example where the outcome and policy 
spaces are the real line. The boss’s ideal point is 0, agent 
1’s ideal is —1 + e (with e > 0 and small) and agent 2’s 
is 1, so agent 1’s ideal is closer to the boss’s than is 
agent 2’s. Suppose that F(x | p) is as follows. If p > 0, 
then x = p+0.1orx = p — 0.1 with equal probability. 
If p < 0, then x = p+0.2 and x = p — 0.2 are equally 
likely. If agent 1 or 2 is chosen but does not observe the 
shock, he will select policies —1 + ¢ and 1, respectively. 





13 This implies the following cutoff rule for expertise. For any agent 
ı with ideal point y* (distinct from the boss’s ideal) who would be in 
the delegation set for g = 1, there is a threshold Gy > O such that ı 
is acceptable to the boss if and only 1f q = Oy. 
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Hence agent 1’s policy entails more outcome risk than 
agent 2’s. If the boss is risk-averse and € is sufficiently 
small, then the boss prefers delegating to agent 2. Yet 
this ordering is the opposite of her ordering of the 
agents’ ideal points in outcome space. This reveals that, 
because the meaning of ally is ambiguous in this setting, 
the ally principle is also unclear. 

Now consider a general increase in competence: q, 
rises for each agent. Obviously this cannot hurt the boss. 
More interesting is the effect on the expected outcome 
in equilibrium: By inducing delegation, higher compe- 
tence (like reduced preference-conflict) can move the 
expected outcome away from the boss’s ideal point. 
This point has substantive bite and differs from con- 
ventional wisdom. Agencies with more competent sub- 
ordinates may on average attain outcomes that are far- 
ther from the boss’s ideal. This points to the subtle yet 
important distinction, noted earlier, between a policy 
lottery’s expected outcome and the expected distance 
of its outcomes from the boss’s ideal point. The boss 
need not prefer a lottery whose expected value is close 
to her bliss point over one whose mean is farther away. 
This may seem perverse (Why prefer a lottery with a 
more biased mean?) or risk-averse (she accepts more 
bias in return for less variance), yet in fact it is never 
the former and need not be the latter.’4 

Since agents’ competence may vary, ability and ide- 
ology may trade off. Thus the ally principle must be 
understood as a “ceteris paribus” rule: All else equal, a 
rational boss should choose her closest ally as an agent. 
If that ally is a dolt, delegating to him is silly. 

Though the meaning of ally is ambiguous in this set- 
ting, one could use a weaker sense of the concept—a 
partial ordering over three dimensions: quality, prox- 
imity of ideal outcomes, and proximity of ideal policies 
(when choosing under ignorance). Then a weaker ver- 
sion of the ally principle holds: The Boss never dele- 
gates to a dominated agent. 

However, she never delegates to outliers—agents 
outside her delegation set given q, = 1—even if they 
were perfectly competent and allies were fools. Thus, 
although some competence is necessary for receiving 
authority, it is not sufficient. (The same holds for 
preference similarity.) As in the old communist phrase, 
to receive authority an agent must be both “red” and 
Conclusion: (1) As agent competence decreases the dele- 
gation set contracts. (2) With homogeneous competence 
the ally principle may be violated as an ally’s igno- 
rant choice may be worse than a more distant agent's. 
(3) With heterogeneous competence the ally principle 
may also be violated as the boss trades off competence 
for preference similarity. (4) A dominated agent (re- 
garding competence, proximity of ideal outcomes, and 
Proximity of ideal policies) is never picked. 





14 Recall the unidimensional example m which ignorant choice pro- 
duces y or —y with equal likelihood. The expected outcome 1s zero— 
exactly the boss’s ideal point—but this “lottery” yields, with certainty, 
an outcome of distance y from her ideal. Hence, she prefers del- 
egating to a sufficiently competent agent whose ideal is x, where 
0 <x < y, even though that yields a biased lottery. 
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3. Costly Information-Gathering. In the basic model 
agents freely observe the state of nature. But becoming 
informed is usually costly. So consider a model similar 
to that of Gilligan and Krehbiel (1987): An agent, i, who 
pays a cost c, > 0, observes the true state; otherwise, i 
stays uninformed (unspecialized). Utility is additive in 
policy and the cost of getting informed. 

Allagents have cutoff rules: “Specialize if and only if 
c < c*,” where the threshold c* solves an indifference 
condition. Hence the boss may not select the closest 
agent: That one may have high specialization costs and 
may elect to remain uninformed, and an uninformed 
agent is useless. Instead, if she delegates, the boss will 
pick the closest agent of those who would specialize. 

Even if c, is a constant, agents may have differ- 
ent cutoff rules on the specialization decision because 
the benefits of acquiring information may vary. That 
is, given the generality of the F(x |p) formulation, 
different delegatees might face different amounts of 
uncertainty associated with their ignorantly selected 
policies. 

Obviously, if everyone’s c, is sufficiently large, then 
all agents prefer to stay ignorant. Anticipating this, the 
boss won’t delegate. But low specialization costs can- 
not make a preference outlier acceptable to the boss. 
Suppose that initially c, is so high that i would stay 
ignorant if given authority. Then the boss will not del- 
egate to i, regardless of his policy preferences. Now 
let c, fall, so that i would specialize if appointed. Nev- 
ertheless, if i is such an outlier that he is not in the 
delegation set, then this parametric shift cannot bring 
him into the boss’s favor. As with competence, “suffi- 
ciently low” information-gathering costs are necessary 
but not sufficient for an agent to be picked. The boss 
wants subordinates who are both spatially proximate 
and informationally efficient. The latter ensures that 
inducing specialization is feasible; the former, that it is 
valuable (to the boss). 


Conclusion: (1) As information-gathering costs rise the 
delegation set contracts. (2) Even with homogeneous 
costs the ally principle need not hold because the gains 
of acquiring information may vary across agents, mak- 
ing proximate agents less willing than distant ones to 
specialize. 


Contro! Systems 


4a. Ex Ante Controls and Ex Post Auditing. The 
boss may be able to exert some control over agents, 
by limiting their discretion ex ante and enforcing that 
with ex post auditing and punishments (e.g., Huber 
and Shipan 2002 and McNollGast 1987). We follow 
Huber and Shipan(H-S) in assuming that (1) reducing 
an agent’s discretion is costly for the boss and (2) this 
cost increases strictly and continuously the more dis- 
cretion is reduced. The cost affects the boss’s utility 
additively. 


15 Although these considerations affect agents directly only if the 
boss delegates authority, they also influence her decision to delegate 
m the first place. 


The sequence of moves is as follows. The boss moves 
first, either delegating or not. If she sets policy, every- 
thing is as it was in the basic model. If she delegates, she 
can limit the agent’s discretion: An outcome is accept- 
able if and only if it is within the “discretion set,” Le., the 
outcomes that are within d of her ideal point. (H-S as- 
sume that discretion pertains to policies, not outcomes. 
Both assumptions are reasonable; they describe differ- 
ent programmatic environments. Wilson [1989, 158-71] 
examines how policy and outcome observability varies 
empirically.) The bigger d is, the more discretion the 
agent has. 

The boss can choose to pay a cost c > 0; if she does, 
then the final outcome will be inspected with proba- 
bility q > 0. (For c = 0 auditing is a free “fire alarm” 
pulled by constituents [McCubbins and Schwartz 
1984].) The delegatee learns the mapping from policy 
into outcomes and sets a policy. The game ends here if 
the boss did not pay the auditing fee. If she did pay it, 
then with probability q she learns whether the outcome 
is in the discretion set. If the outcome is acceptable, the 
game ends; if not, the agent pays a fine f. If the boss 
catches the agent deviating, then she must ignorantly 
choose a policy. (This departs from the H-S model, in 
which an unspecified nonstatutory mechanism forces a 
detected shirking agent to implement the boss’s ideal 
point.'®) 

If she decides to use a control scheme, the boss must 
select an agent, a discretion level, and a detection prob- 
ability. For a fixed control scheme and random shock, 
the agent’s choice is simple: Either he can select, of the 
acceptable outcomes, the one closest to his ideal or he 
can get his own ideal outcome and risk detection. 

Despite their differences, in both the H-S and our 
model the subordinate’s optimal compliance decision 
may be state contingent: He may comply for some, but 
not all, of the shock’s values. In the H-S model, if the 
agent is caught disobeying, then the final outcome is 
the boss’s ideal. Hence for the agent the policy conse- 
quences of being caught do not depend on the shock’s 
value. However, the consequences of disobeying do de- 
pend on the shock if he goes free. 

In our model the boss chooses ignorantly, so a de- 
tected deviating agent does incur state-contingent im- 
pacts. Because the agent finds out the disturbance’s 
value before deciding whether to comply, the compli- 
ance decision is also state-contingent: Complying is 
optimal for some shocks but not others. (For exam- 
ple, disobeying is unattractive for an extreme perturba- 
tion: If the agent is caught, then the boss’s uninformed 
choice—insensitive to the shock—will produce an out- 
come that is awful for the agent. But if the shock would 
distort the boss’s policy selection toward the agent’s 


16 We drop their premise for three reasons. First, the nonstatutory 
mechanism ın H-S is a vital part of the model. Hence, it should be 
modeled explicitly, as a set of rational actors with specified moni- 
toring capacities Second, if catching a deviating agent means that 
the boss gets her ideal outcome, then she has a perverse incentive to 
induce agents to shirk. Third, if an auditor (part of H-S’s nonstatu- 
tory mechanism) sees the unacceptable outcome but not the policy, 
then agents may have an mcentive to distort policies, to confuse the 
auditor. 
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bliss point, then being caught is not so bad.) Hence in 
equilibrium the boss, having observed that the agent 
deviated, must update her beliefs about the shock." 
Thus, although the agent’s and Boss’s decision prob- 
lems differ in these two models, in either case com- 
pliance may depend on the shock. This helps explain 
why even optimal control schemes may only sometimes 
induce compliance: The agent’s private information af- 
fects the relative costs of submitting to a control system 
(and systems that always induce compliance may cost 
too much). In contrast, in other models compliance is 
not state-contingent and so is binary: The agent always 
complies or never does. (If the agent won’t comply, then 
installing a control system is stupid, so such models 
imply that if controls are used, then the agent always 
complies.) 


Conctusion: Even under optimal control schemes an 
agent may, depending on the shock’s realization, choose 
to comply but, at other times, intentionally disobey. 


Although selecting a system that induces compliance 
for all possible shocks may not be optimal, a tightest 
control system—that the agent will invariably obey— 
always exists. This system is characterized by a discre- 
tion level dm for which agent m is willing to enact an 
outcome exactly d, from the boss’s ideal, for every 
shock, rather than disobey and risk detection.!® Though 
the boss may not want to bind the agent so tightly, it 
is useful to analyze how this minimum discretion level 
is affected by the control variables q and f and the 
agent’s ideal point, Ym. Intuitively, as q or f increases 
the agent becomes more willing to comply: dm falls. 
Surprisingly, the ally principle may be violated here: 
Even if y is closer to the boss’s ideal, dı may exceed d. 
This can happen because being caught deviating might 
hurt an outlier more than an ally. Suppose, e.g., that 
the fine is small: A shirking agent is punished mostly 
by losing control to the boss. The (common) loss func- 
tion is piecewise concave: Almost-flat near the bliss 
point, it plunges steeply once a specific distance from 
the ideal. (For simplicity assume that the shock barely 
affects utility.) Thus, if y} is close to the boss’s bliss 
point, then losing authority won’t hurt agent 1 much, 
whereas if y is far away, then agent 2’s losses would be 
catastrophic. Then agent 2 will be more compliant than 
agent 1. 


Conclusion: (1) The discretion level, dm, is decreasing 
in the detection probability and the fine. (2) Allies may 
be less controllable than outliers: The discretion level 
need not be increasing in the distance between ym and 
the boss's ideal. 


17 Because compliance is shock dependent, the solution concept 
should be sequential or perfect Bayesian equilibna. A boss who 
chooses policy after learning that the agent shirked will opumize 
given updated beliefs about the shock, these beliefs must reflect the 
mformation mphed by the agent’s actions. 

18 Recall that we assume that if the agent complies, then he chooses 
the final policy But in H-S, nonstatutory mechanisms make policy 
revert to the boss’s ideal with positive probability even if the agent 
complies. 


H-S (2002) address similar issues in their proposition 
1: “Increases in policy conflict... and decreases in the 
reliability of nonstatutory factors [analogous to our de- 
tection probability] always make it easier to satisfy the 
conditions under which the Politician adopts a low dis- 
cretion law as opposed to a high discretion law” (248).!9 

Whereas H-S find that increases in policy conflict in- 
duce the boss to constrain agents more, we find that the 
boss may want to give more discretion to distant agents 
than to closer ones. The counterexamples to H-S’s intu- 
itive finding (i.e., more conflict results in less discretion) 
hinge on assumptions about the agents’ preferences: 
These influence the discretion levels that agents will 
accept. As the above example shows, nonlinear loss 
functions and a richer set of shock realizations can yield 
different conclusions. 

The two models differ still more in their analysis of 
detection and its impact. In ours, the detection proba- 
bility and discretion level vary inversely: If the chance 
of detection falls, to induce compliance the boss must 
give the agent more discretion. H-S obtain the opposite 
result because they assume that even if the agent com- 
plies, nonstatutory factors affect the chance that the 
boss will get her ideal outcome. This cannot happen 
in our model, given optimal compliance (and dm > 0). 
Thus the models differ crucially in their assumptions 
about when the boss gets her ideal outcome. Both as- 
sumptions seem to be reasonable descriptions of differ- 
ent institutions. 

Plausibly, in the short run the boss can select d and g 
but not f. In Bendor and Meirowitz 2003a and 2003b 
we show that, first, if f is sufficiently big and q > 0, 
then any (d, q) pair will produce full compliance and, 
second, for any f,q where fq > 0 there is a discretion 
level d that the agent will obey. The boss’s optimal con- 
trol scheme can be designed as follows: (1) For each 
agent m select the optimal (dm, gm); (2) given these op- 
timal (d, q) pairs for every agent, pick the optimal agent 
m, (3) compare controlling this agent versus ignorantly 
choosing policy. Setting d sufficiently large or g = 0 is, 
in effect, delegating without control. Given our models’ 
level of generality, many types of institutions can be 
optimal. For example, if the cost of monitoring and the 
marginal cost of reducing d are sufficiently small, then 
the boss will install the control system and use it to 
constrain an agent; if f is also sufficiently big, she will 
get her ideal outcome with certainty. But when control 
is very costly or fines are small the boss may prefer 
delegating without imposing constraints. 

These findings suggest that when reputations matter 
(big f) and monitoring is cheap we should observe con- 
trol schemes. But tight systems (small d) are not nec- 
essarily optimal even when some monitoring is cheap. 
For tight controls to be worthwhile, the marginal cost 
of reducing discretion must be small and/or fines for 
shirking must be large. Finally, assume that the optimal 
contro] scheme induces constant obedience. Then we 


19 The topics are similar but the approaches differ. H-S examine the 
decision to select low discretion, we focus on the optimal level of dis- 
cretion that mduces full compliance (even if inducing full compliance 
is itself suboptimal). 
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assert that if f is increased, then d or q or both must 
fall. Why? The intuition hinges on this: If f goes up 
by a little, there is a correspondingly small decrease in 
q and/or d that will keep compliance optimal for the 
agent. So when f rises the boss can do strictly better: 
She can get the same outcome for less (lower q) or a 
closer one at the same cost (lower d) or both and split 
the gain. Thus, subordinates are more useful to the boss 
when she has a rich set of controls than when her only 
options are ignorant control or open-ended delegation. 


Conctusion: (1) The optimal control scheme varies with 
the costs of monitoring and punishment. (2) Tight sys- 
tems (small d) are optimal when the marginal cost of 
monitoring is low and/or the fines for shirking are big. 
(3) In an optimal control system that induces full com- 
pliance, if f rises, then d or q or both fall 


Part 2 of this conclusion is consistent with the second 
part of H-S’s proposition 1: “A low discretion law is 
adopted in equilibrium only when legislative capacity 
is sufficiently high (a is sufficiently small)” (248). The 
rest of our conclusion makes predictions about the re- 
lations between institutional properties (the ability to 
punish disobedience) and observable control decisions 
(degree of discretion and monitoring effort). 

Comparing our model to that of H-S is complicated 
for two reasons. First, the models make assumptions 
about preferences, dimensionality, policy technology, 
and shocks at very different levels of generality. Sec- 
ond, they differ on two key structural dimensions: how 
detecting noncompliance affects the final outcome and 
whether discretion pertains to policies or outcomes. 
The two types of intermediate models—those differ- 
ing from either ours or H-S’s in only one structural 
dimension—are worth investigating. Though doing so 
is beyond this paper’s scope, we offer the following 
conjecture: The agent’s compliance decision is indepen- 
dent of his private information (regarding how policies 
map into outcomes) in only one type of model—when 
discretion is over outcomes and detecting disobedience 
leads to the boss getting her ideal outcome. 


4b. Reasserting Control. A simpler form of control is 
for the boss to reassert her authority if matters go awry. 
Of course, taking control is one thing; choosing wisely is 
another. If seizing the reins just puts the boss back in her 
original position—facing the tension between control 
and expertise—then she may have gained little. 

Given, however, imperfectly competent agents (ex- 
tension 2, above), the benefits of this simple control 
become clear. Consider the following time line. As al- 
ways, initially the boss can either set policy (ignorantly) 
or delegate. If she delegates, she waits to see whether 
the agent manages to become informed. If he does, the 
boss remains passive. If, however, the agent fails—stays 
uninformed—then the boss reasserts control and sets 
policy herself. (One can easily extend this analysis by 
assuming that retaking control is costly.) True, she will 
do so ignorantly, but better that than let an equally 
ignorant subordinate choose. 

This ability to (freely) retake the reins implies that 
the delegation set is once more decisive: The boss will 


delegate if and only if some agent’s ideal outcome is 
in the delegation set. Because she can reassert control, 
the boss need not worry about whether agent i is “suf- 
ficiently” competent: If i does not observe the random 
shock, then the boss will take over. 

This control process therefore restores clarity to the 
meaning of “ally.” Since the boss will set policy if the 
agent is uninformed, that agent’s induced ideal policy 
(when choosing in ignorance) is of no concern. This 
suggests the following prediction: The ally principle 
will hold more often in institutions that allow a boss, 
upon seeing that a subordinate is making uninformed 
decisions, to retake the policy reins. 

Although the ability to reassert control implies that 
“ally” has only one meaning (proximity in outcome 
space), agents may still vary on the dimension of com- 
petence. Of course, ceteris paribus the boss prefers 
more able agents. Hence she will only select an agent 
from the efficient set of subordinates. This set is defined 
by a dominance relation. (For example, if qı = q, and 
agent i is closer to the boss, then i dominates j.) Un- 
dominated agents form the efficient set, and the boss 
ignores any agent who is weakly dominated by another 
agent, regarding competence and outcome preferences. 
Because by assumption agents’ ideal points vary in their 
distance from the boss’s, if the efficient set contains 
multiple agents, then they must be strictly rank-ordered 
by competence: The nearest agent is the set’s weakest 
member; the farthest one, its most able.” 


Conclusion: (1) When the boss can reassert control, 
she prefers delegating to any agent in the delegation 
set (regardless of competence) over ignorantly choosing 
policy. (2) If the boss delegates and competence is 
homogeneous, then she will pick the closest agent. (3) If 
competence is heterogeneous, then the ally principle can 
fail but the chosen agent must be undominated. 


5. Agents Can Commit. In the basic model agents 
cannot commit to policies or outcomes. Hence the 
boss anticipates that any agent will, if given the 
opportunity—the authority—do what is in his best in- 
terest. Suppose, however, that agents can “join the 
boss’s team” by committing to an outcome.*! (We are 
not relaxing the requirement of subgame perfection; 
we are examining subgame perfect equilibria in a new 
game form.) In this extension the agents first learn 
the shock’s value and then announce (commit to) an 
outcome.” Each agent’s strategy space is the set of 
constant functions from the state space to the outcome 
space. The agents simultaneously announce this func- 
tion, then the boss chooses to delegate or not. If she 
delegates, she picks an agent who then selects a policy 
that yields his announced outcome. Otherwise the boss 
selects policy ignorantly. As we will see, in equilibrium 


20 In the special case of homogeneous competence (q, = q for all 1) 
the efficient set is a singleton, so we recover the ally principle. 

21 Smce we are modifying the basic model serially, here we continue 
to assume perfect shock absorption. Thus committing to outcomes is 
feasible. 

2 Allowing agents to commit before they learn the state yields the 
same results 
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the delegate commits to a policy that is suboptimal for 
him. #2 

Commitment by itself can benefit the boss, even if 
there is only one agent (m = 1). Not always, of course: 
If this agent is in the delegation set, then commitment 
is useless—the agent will implement his ideal outcome, 
which he could credibly promise to do even if unable 
to precommit. But if the agent is outside the delegation 
set, then commitment changes the equilibrium dramat- 
ically. Without commitment the boss won’t delegate— 
the distributional loss is too great—leading to a risky 
and possibly inefficient choice by the ignorant boss. But 
if the agent can commit to a policy before the boss’s 
delegation decision, then the outcome is efficient: The 
agent will offer to deliver an outcome in the delegation 
set, and the boss will accept.” 

Now consider the additional benefits of having mul- 
tiple agents who can commit. With m > 1, the situation 
resembles Wittmanesque competition among policy- 
oriented candidates. (Also related are agency models of 
competiton [e.g., Ferejohn 1986].) Here, however, the 
boss is the only voter. Thus she is always pivotal, regard- 
less of the outcome space’s dimensionality. Two results 
follow. First, if the outcome space is unidimensional 
and at least two agents are ideologically heterogeneous 
(their ideal points bracket the boss’s), then with com- 
mitment the boss gets her ideal outcome in equilibrium. 
Second, in multidimensional spaces the boss gets her 
bliss point in equilibrium for “almost every” configura- 
tion of ideal points. Intuition can be gained by studying 
just two agents, say 1 and 2, with ideal points that are 
not collinear with the boss’s. Consider any outcome 
proposed by agent 1 that is not the boss’s ideal. Agent 2 
can offer an outcome (on a line linking the boss’s bliss 
point to agent 2’s) that is closer to the boss’s ideal 
point than is agent 1’s proposal and that benefits both 
the boss and agent 2. Similarly, responding to any 
promise by agent 2, agent 1 can propose a policy closer 
to the boss’s ideal. Hence, the only pair of proposals that 
are best responses to each other is two commitments at 
the boss’s ideal outcome. 

If all ideal points are perfectly collinear, then in ef- 
fect the outcome space is unidimensional. Here, if all 
subordinates are on the same side of the boss, then 
an equilibrium exists where the outcome is the ideal 
point of the boss’s ally. Since agents are purely policy 
oriented, more distant ones would not “outbid” the 
favored agent; that would hurt them. But if they are 
even slightly noncollinear, then an outlying agent can 


B Thus, if one keeps the basic game and drops subgame perfection, 
this subsection’s equilibria are not supportable, as here we require 
commitment on as well as off the equilibrium path. 

24 Because the agents have private information their announcements 
might be considered signals. If so, the right model is a mgnaling game, 
with its associated complexity. Happily, this complication does not 
anse here: The uninformed boss picks an agent after hearing the 
agents’ (binding) offers; appealing to perfect Bayesian equilibria 
does not refine the set of equilibria Because informed agents can 
precommit and the boss's action set 1s sparse, interesting signalmg 
cannot occur 

25 The delegation set has many outcomes that both prefer to the 
boss’s ignorant policy choice—a spatial “surplus.” Hence, a bargain- 
ing model is needed to pin down the equilibrium allocations 
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always find an outcome that he and the boss prefer 
to the ally’s ideal point, and so on. If the policy space 
has at least two dimensions, then perfectly collinear 
bliss points have measure zero in the space of possible 
preference profiles, so this happens “almost nowhere.” 
Significantly, these results hold even if no agent is in 
the boss’s delegation set. 

Thus, because the principal’s ability to base person- 
nel choice on credible promises about outcomes wipes 
out the agents’ informational advantage, inducing com- 
petition among agents who can precommit is a very 
powerful tool for bosses facing delegation decisions. 


Conclusion: (1) If there is just one agent and he is out- 
side the delegation set, then the ability to commit helps 
the boss. (2) With multiple agents and a unidimensional 
outcome space the final outcome is the boss’s ideal as 
long as two agents have ideal points that bracket the 
boss's. (3) With multiple agents and a multidimensional 
outcome space the final outcome is the boss’s ideal for 
almost all preference profiles. 


6. Multiple Bosses. Many political settings have mul- 
tiple bosses and multiple agents. The bosses determine, 
by some procedure or institution, which subordinates 
(if any) are authorized to make policy. One simple in- 
stitution has a primus inter pares: an empowered boss 
bı who proposes a delegation decision. A stipulated 
number of other bosses must approve this proposal. If 
no winning coalition supports it, then b; selects a policy 
in ignorance. 

In aricher bargaining model, bosses sequentially pro- 
pose delegation options. After each proposal the bosses 
vote. The first alternative supported by a winning coali- 
tion is enacted. If none is accepted, then again, b, must 
select a policy in ignorance. 

No offer may be acceptable, in either extension, for 
either uni- or multidimensional outcome spaces. (How- 
ever, if all bosses are strictly risk-averse over outcomes, 
as in model A”, a “collective delegation set” always 
exists. See A” for details.) 

Both the simple and the rich models presume that 
delegation (not bı’s policy choice) requires approval. 
Consider the set of winning (decisive) coalitions, given 
a voting rule: e.g., under majority rule, coalitions with 
over half the voters. Compare rules a and b, where a 
is more restrictive (fewer decisive coalitions). Clearly, 
under rule a weakly fewer proposals will be acceptable 
to a decisive coalition, so reaching a decision to dele- 
gate will be harder. If the empowered boss is in every 
decisive coalition—she has a veto—then delegation will 
be even less likely. On the other hand, fixing the rule 
(and so the set of winning coalitions), as the bosses’ 
preferences become more alike it gets easier to find 
an agent backed by some winning coalition. Thus in 
this model delegation is more likely when rules are less 
restrictive or principals more similar ideologically. 

While analyzing these models fully is beyond this pa- 
per’s scope, we briefly compare two institutions, each 
with simple majority rule. In one, only boss b, can pro- 
pose to delegate authority; in the other, any boss can. 
Hence, delegation will occur under institution 1 if and 
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only if a majority coalition that includes b, is better 
off delegating than having bı choose ignorantly. But 
in institution 2 no one monopolizes agenda-setting, 80 
delegation will occur if any majority coalition is better 
off delegating. Therefore we predict that delegation will 
occur more often in institution 2. 


Conclusion: With multiple bosses and reversion to con- 
trol by an uninformed boss, delegation is more likely 
when (a) the rule is less restrictive or (b) the bosses’ 
preferences are more similar. 


Part a has implications for institutions such as the 
separation of powers. When multiple branches of gov- 
ernment have veto powers, the rule is more restric- 
tive: Fewer coalitions can decisively implement del- 
egation; more can block it. Hence, all else equal we 
should observe less delegation in separation of power 
systems. 

Part b may appear intuitively obvious. Yet it contrasts 
sharply with proposition 1 of Epstein and O’Halloran 
(1999, p 75). Their result concludes, “The closer the 
preferences of the committee to those of the median 
floor voter, the less likely Congress is to delegate au- 
thority to the executive,” implying that the more ide- 
ologically similar are the political superiors (the floor 
median and committee), the less likely they are to del- 
egate. Why the difference? In their model, the com- 
mittee knows more than the floor median but less than 
the agency. The committee and floor play a cheap talk 
signaling game before the floor decides whether to del- 
egate to the agency. So proposition 1 holds for two 
related reasons. First, the committee gives the floor 
less information when they are ideologically far apart 
(a robust result in signaling games.) Second, this in- 
duces the poorly informed floor to rely on the other 
informed decision maker, the agency, by giving it au- 
thority. 

Each model focuses on properties ignored by the 
other. Our model does not examine situations in which 
a subset of bosses (a committee) is informed. Thus, 
it ignores the possibility that informed principals and 
agents might compete, regarding giving information to 
uninformed principals. By examining only unidimen- 
sional outcome spaces, however, their model ignores 
situations in which many bosses—not just the (unitary) 
floor median—make delegation decisions. Hence, it 
cannot analyze how preference conflict among multiple 
bosses impacts such decisions. 


MODEL A’ 


Model A’ is strictly more specialized than A: It uses 
all of the latter’s assumptions and adds others. The 
main new features concern the policy technology. De- 
spite this greater specificity, A’ is more general than the 
canonical model regarding utility functions (it does not 
assume risk aversion), the distribution of shocks, and 
the issue space’s dimensionality. 

Policy technology here is conventional, i.e., additive: 
outcomes = policy + noise. The noise, €, is symmet- 
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rically distributed around the zero vector.” We again 
allow for N-dimensional outcome spaces. (Given the 
technology, policies and shocks must have the same 
dimension as outcomes.) 


Comparative Statics 


1. Increased Preference Conflict. Here we can defini- 
tively conclude that if the agents’ ideal points move 
away from the boss’s, then the expected equilibrium 
outcome in two cases move in opposite directions. If 
conflict increases enough to empty a once-occupied del- 
egation set, then the boss sets policy and the expected 
outcome moves toward her bliss point. Butif the change 
is not this large, then the expected outcome moves away 
from her ideal. 


Conclusion: (1) Small increases in preference conflict 
tend to move the expected outcome away from the boss’s 
ideal. (2) Big increases tend to move the expected out- 
come toward her ideal. 


2. More Variable Shocks. If the shocks become more 
variable, via a mean-preserving spread of e, will the 
boss’s behavior be affected here? Conventional wis- 
dom does not give a clear answer: Since A’ does not 
presume risk aversion, why would more risk matter? 
The following result may therefore be surprising: If 
the uninformed boss’s optimal induced policy equals 
her ideal outcome, then a riskier shock reduces the ex- 
pected utility of not delegating. But delegating’s value 
is unchanged, for the best agent will still be picked and 
will still get his ideal outcome. (Because he picks a pol- 
icy after seeing the shock, its distribution is irrelevant.) 
Hence the boss becomes less choosy: Her delegation 
cutoff moves away from her ideal point. Thus in this 
circumstance delegation’s relative benefits rise as the 
shock becomes riskier.” So the chance of observing 
delegation must be (weakly) increasing in this techno- 
logical uncertainty. 

This comparative static reveals the significance of 
spatial preferences, independently of risk aversion. As 
a key point, this is worth explaining. Consider the fol- 
lowing example. Suppose that the boss’s utility function 
is tent-shaped (piecewise linear) in a unidimensional 
space; 78 her bliss point is 0. The shock is —x or x with 
equal odds. Given model A’ (e distributed symmetri- 
cally around zero, etc.), if the boss does not delegate, 
then setting p = 0 is an optimal choice. Now make e€ 
riskier by letting it equal +y, with y > x. This lowers 


26 Assuming that the shock’s mean 1s the zero vector causes no loss 
of generality. The distnbution is common knowledge, so a rational 
(uninformed) agent could compensate for a non zero mean by shifting 
his policy appropriately. 

77 Model A’ also implies that if the shock 1s unimodal and continuous, 
then for all Euclidean utility functions the boes’s best induced policy 
is her ideal outcome. (Proof on request.) So the comparative static 
holds for this large class of shocks, including the canonical case of 
uniform noise. 

28 We could abuse terminology by referring to tent-shaped utility 
functions as locally risk neutral since their second derivatives are 
zero “almost everywhere.” 
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the value of not delegating: Whether the outcome is 
—y or y it is farther from her bliss point. This harms the 
boss simply because she has spatial preferences; that 
her utility function is locally linear nearly everywhere 
is irrelevant. 

Thus, what drives this effect is not the second-order 
property of risk aversion but the first-order one of 
preferring near outcomes to far ones. Indeed, in this 
example a riskier shock hurts the boss even though 
fundamentally she faces no uncertainty at all: If she 
(optimally) chooses her bliss point, then she knows, 
with certainty, that the outcome will be a distance of 
y from her ideal. That it could be left or right of her 
ideal is immaterial, by virtue of Euclideanism. So it is 
more accurate to say that the parametric change makes 
determining outcomes, ex ante, harder for the boss than 
to say that it increases uncertainty. 


Conclusion: If the uninformed boss's optimal induced 
policy equals her ideal outcome, then an increase in the 
shock’s variance (risk) makes the Boss less choosy: Her 
delegation set expands. 


Extensions of A’ 


1. Imperfectly Competent Agents, Revisited. Given 
model A”s simple technology, the boss orders the 
agents’ proximity in policy space just as she does in 
outcome space. Thus, again, “ally” has a clear mean- 
ing, so if the boss delegates, then she will pick from the 
efficient set of subordinates (as this set was defined for 
model A, extension 4b). 

Even some of the efficient agents may be unaccept- 
able. Model A tells us that every agent who would be in 
the delegation set if perfectly competent has a thresh- 
old level of competence that is necessary and sufficient 
for that agent to be acceptable. Further, in model A’ 
(but not A), this competence threshold has a simple 
monotonicity property: The closer an agent is to the 
boss, the lower the threshold. This captures a trade-off 
between ideological similarity and expertise. 


Conclusion: Greater preference similarity permits less 
competent agents to be acceptable. 


This trade-off often confronts, for example, a revolu- 
tionary regime’s choice of an officer corps immediately 
after a revolution. 


2. Reasserting Control and Costly Information- 
Gathering, Revisited. If the boss can reassert con- 
trol after seeing whether an agent gathered informa- 
tion, then agents know that delegation is reversible: 
An agent keeps authority only if he specializes. Thus, if 
acquiring information is costly, the boss’s ability to re- 
take control gives her another way to create incentives 
for agents. 

Consider the simple case: Becoming informed is 
equally costly for all agents. Then for moderate costs 
of specialization the boss will not use the ally principle. 
The reason: For moderate c’s, agents who are allies will 
not specialize. (Ideologically allied agents don’t mind 
the boss’s induced ideal policy, so they free-ride on her 
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uninformed choice.) But because the boss’s ignorant 
choice hurts moderate outliers more significantly, they 
preempt this outcome by lalizi 

Of course, the boss never delegates to extreme out- 
liers. Hence, agents who are either very similar to or 
very different from the boss will not be picked. (In two 
dimensions the delegation set is a doughnut—a ring 
around a hole.) In a further extension, the boss could 
specialize, at a cost co. Bendor and Meirowitz (2003a, 
2003b) show that the delegation set can have a hole 
(exclude allies) here too. But when the boss’s cost of 
specializing is less than the agents’, the delegation set 
is empty; the boss must get the information herself or 
go without. 


Conclusion: If the boss can reassert control and 
information-gathering is costly, then the ally principle 
fails: Nearby agents aren’t selected because they won’t 
specialize, 


3. Multiple Bosses, Revisited. Suppose, as before, 
that multiple bosses (e.g., legislators) must vote on 
whether to delegate and, if so, to whom. We assume 
that when voting on delegation versus control they use 
any weakly Paretian voting rule: The entire coalition of 
principals is decisive. We do not specify what agenda 
and voting system are used to select an agent (after 
delegating) or a policy (following control). 

The following matched pair of results shows that 
what really matters is the relative magnitudes of two 
fundamental parameters: the size of the shock and 
the distances among players’ ideal points.” The for- 
mer measures how hard it is, ex ante, for principals 
to control the outcome by themselves; the latter, their 
ideological conflicts. These results also show that the 
intuition underlying comparative static (2}-how a 
mean-preserving spread of the noise can reduce the 
expected utility of not delegating—generalizes to col- 
lective choice settings. 


Conclusion: (1) If utility is unbounded from below and 
preferences are held constant, then for a sufficiently big 
mean-preserving spread of the shock, every boss strictly 
prefers delegating, to any agent, over any choice of not 
delegating. (2) If the shock’s distribution is held constant 
and the bosses’ and agents’ ideal points are sufficiently 
close together, then every boss strictly prefers delegating, 
to any agent, over any choice of not delegating.» 


Part 2 of the conclusion has implications for the anal- 
ysis of divided government in separation-of-power sys- 
tems. It states that delegation is more likely the more 
similar are decision makers’ preferences (for a fixed 


® Several scholars working on the canonical model have noted 
this mtertwining of ideal point distances and the shock’s variance 
(Epstein 1998, Gilligan and Krehbiel 1987) However, they stressed 
how a risk-averse principal gains from msk reduction. In contrast, 
Model A’ makes no use of those properties; what counts is not un- 
certainty but distance from bliss points. 

® [Bendor and Meirowitz 2003a, 2003b] shows that both results can 
be strengthened: If the bosses’ strategies satisfy a simple weak dom- 
inance condition, then delegation occurs in every subgame-perfect 
equilibrium. 
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distribution of shocks). So on average one would expect 
unified governments to delegate more. 

These results are much stronger than merely saying 
that the bosses’ Pareto- set consists only of choices 
involving delegation. Under either condition, every 
delegation choice—authorizing any agent to imple- 
ment policy—is unanimously preferred to any choice 
in which the bosses retain control. (Typically, of course, 
the bosses will disagree about which agent should 
be picked. But Bendor and Meirowitz [2003a, 2003b] 
show that if a reasonable dominance criterion guides 
the bosses’ voting behavior, then they will vote unani- 
mously to delegate.) 

As with a single boss, what is central here is how 
far—in any direction—outcomes are from ideal points. 
When shocks are big, compared to the ideological bias 
produced by delegating, holding the policy reins is 
worse for all bosses than is delegating to a biased ex- 
pert who will know the state of the world before se- 
lecting a policy. This jibes with the traditional public 
administration view of delegation: When uninformed 
principals cannot control outcomes within “tolerable” 
limits, then giving authority to informed agents is best. 
Thus, that the classical view slighted risk aversion was 
appropriate: Al rational bosses prefer to delegate in 
the above circumstances. Risk aversion is not driving 
this response. 

The conclusion’s two parts are closely related to 
Epstein and O’Halloran’s (1999, 75) other two propo- 
sitions. Part 1 is similar to their, proposition 3, which 
states, “The closer the preferences of the president 
to those of the median floor voter, the more likely 
Congress is to delegate authority to the executive.” 
Part 2 of our conclusion is similar to their proposition 2: 
“The more uncertainty associated with a policy area 
the more likely Congress is to delegate authority to the 
executive.” 

However, our model does not merely reproduce their 
reasonable results; it generalizes them significantly. Be- 
cause their model presumes unidimensional policy and 
outcome spaces, the delegation decision is reduced to 
the choice of a unitary actor, the floor median. In con- 
trast, because our model allows for multidimensional 
spaces, it confronts the possibility of multiple bosses 
and ensuing collective choice issues. 


MODEL A” 


We have argued that the modern perspective on delega- 
tion has overestimated the importance of risk reduction 
and its partner, risk aversion. But we have not claimed 
that they do not matter. That would be silly. Just as 
most people want to reduce risk when preferences are 
insatiable (most humans prefer more money to less but 
they also buy insurance), so too it can matter for the 
satiable preferences represented by bliss point utility 
functions. To show how risk reduction and risk aversion 
affect delegation choices, we now assume that utility 
functions are strictly concave. To allow us to use the 
Arrow—Pratt measure of absolute local risk aversion on 
the loss function (and so to do comparative statics on 


the degree of local risk aversion), we assume that they 
are twice-differentiable.*! The literature’s standard— 
the quadratic—is just one example of this class of utility 
functions. 

Now that all parties are risk-averse an efficiency re- 
sult is clear. 


Conclusion: If everyone is risk averse, not delegating is 


Everyone would be better off if the boss delegated 
to an agent who implemented the boss’s ideal outcome, 
but promises to do that are not credible. The agents’ 
inability to commit harms all. 


Comparative Statics: A More Risk-Averse 
Boss 


Conclusion: As the boss becomes more locally risk 
averse everywhere (i.e. the Arrow—Pratt measure of ab- 
solute risk aversion increases), the delegation set ex- 
pands. 


This is intuitive: The more averse one is to risk, the 
more one likes certain outcomes—which an informed 
agent can provide. 


Extensions of Model A” 


1. Multiple Bosses with a Status Quo. Suppose that 
a status quo policy exists, and if the bosses cannot agree 
on an agent, then they (ignorantly) implement the sta- 
tus quo policy. Define the collective delegation set as 
the set of possible agents who could be picked (under a 
fixed voting rule) by some winning coalition, given the 
status quo. 


Conclusion: (1) If the rule is weakly Paretian and the 
status quo policy is in the bosses’ Pareto set, the collective 
delegation set always exists. (2) For any weakly Paretian 
voting rule the collective delegation set expands as the 
bosses become more locally risk-averse everywhere. 


These results extend a main finding of Gilligan and 
Krehbiel (1987)}—the informational rationale for del- 
egation, given risk-averse principals—to multidimen- 
sional policy spaces, arbitrary (symmetric, mean-zero) 
perturbations, all strictly concave Euclidean utility 
functions, and any number of bosses. Their intuition 
is simple. Delegating to an agent whose ideal point 
is the reversion policy’s expected outcome yields that 
expected outcome with certainty. Thus, all risk-averse 
bosses prefer delegating to agents with bliss points close 
to the reversion policy. Reducing uncertainty is then a 
collective good, just as Gilligan and Krehbiel argue, and 


31 Given a twice-differentiable loss function, A(e), the Arrow-Pratt 
coefficient 1s the negative of the ratio of the second and first deriva- 
tives of —A(e). For two utility functions on the same domain, one has a 
higher coefficient than the second if the first’s is larger for every value 
in the domain. So agent 1 is locally more risk-averse than agent 2 if 
the negative of agent 1’s loss function is more concave (Mas-Colell, 
Whinston, and Green 1995). 
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for exactly the reason they specify: Even if the delega- 
tion outcome is no closer to any principal’s ideal point 
than is the expected outcome under no delegation, all 
the bosses ex ante prefer the former to the latter be- 
cause they dislike risk. 

However, a collective delegation set does not exist 
in general in model A’. (It doesn’t, e.g., if the bosses’ 
utility functions are piecewise convex [local risk accep- 
tance], the shock’s support is sufficiently small, voting 
is by unanimity, and the status quo policy is in the 
Pareto set.) Hence, although risk reduction and risk 
aversion are not central to delegation, at times they are 
important—and our nested models’ sequence enables 
us to pin down when and how they matter. 


2. Increased Political Uncertainty. Principals rarely 
know agents’ ideal points with certainty. Suppose that 
the boss can only estimate them (albeit in an unbiased 
way). Then delegating entails a lottery (Bawn 1995), 
and the expected utility of delegating falls as this un- 
certainty increases. If this political uncertainty is suffi- 
ciently great, then the delegation set is empty. 


CONCLUSIONS 


We try to give the political theory of delegation a foun- 
dation wider than the canonical model’s, thereby link- 
ing modern theory to classical perspectives. Following 
the latter, we have argued that bosses delegate primar- 
ily to make good outcomes more likely and bad ones 
less so; reducing risk is a second-order concern. Hence, 
whenever possible we have analyzed delegation only 
through the lens of spatial preferences (models A and 
A’), assuming risk aversion only in the most special- 
ized model (A”). This nested approach lays bare which 
premises drive different results. 

To take an important example, the ally principle—an 
ancient rule of political delegation—holds in the very 
general setting of model A. Yet it does not always hold 
in extensions of the models. When subordinates are not 
perfectly competent the principle may fail at the level of 
model A, and if the imperfection is heterogeneous, then 
it may fail in any of the models. Further, if information 
is costly, precommitment is possible, or ex post controls 
are present, the ally principle need not hold in any of 
the models. 

Our nested approach also enables us to get a better 
grasp of the canonical model’s major premises and how 
much they matter. In particular, nesting the models 
showed that risk aversion (more generally, the shape 
of loss functions) is of only secondary importance; in 
contrast, the policy technology—especially its relation 
to uncertainty—turned out to be critical. 

Regarding extensions, the analysis of institutions 
with multiple bosses, various control schemes, and exo- 
genous commitment generates insights about the 
choices of political superiors and the relevance of in- 
stitutional devices. Surprisingly, these insights do not 
come at the price of generality: They stem from models 
that are Jess restrictive than the canonical delegation 
model. 
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At the end of the day, however, our analysis pro- 
vides good news for modern (game theoretic) studies 
of delegation: Much of the canonical model’s intuition 
survives in this paper’s more general settings, and many 
types of extensions turn out to be tractable. 

This project—trying to see the heart of delegation by 
stripping away inessentials—is not done; the political 
theory of delegation can rest on a still wider base. We 
have relied on Euclideanism (symmetric preferences) 
to keep the analysis simple and to make key notions 
(e.g., “ally”) intuitively meaningful. But this reliance 
springs not from substantive concerns but from model- 
ing’s craft demands. A model yet more general than A 
could be built, discarding Euclideanism and assuming 
only that preferences are (strictly) single-peaked. Such 
a model would help us see more clearly which proper- 
ties of delegation depend only on bliss points. Further, 
because such preferences, in contrast to “economic” 
(insatiable) ones, are quintessentially political, such a 
model would also lead us to a truly political theory of 
delegation. 


APPENDIX 


A full appendix with proofs of all the results ıs given by 
Bendor and Meirowitz (2003b). 


Model A 


Assume that preferences are representable by utility func- 
tions u(x) := h(lx — y,||), where y, in the intenor of X is 
agent i’s ideal point and A: R! > R! is a strictly decreasing 
continuous function. Without loss of generality, assume that 
h(0) = 0. We also assume that no decision maker other than 
the boss has yw = 0. Perfect shock absorption means that 
for all e’ €e S and x’ e X, there exists a unique p € P at. 
x = g(p, 6’). We use superscripts to denote coordinates of 
a vector. Conditional on a policy p the outcome x is a ran- 
dom variable with distribution F(x | p). We assume that the 
integrability conditions (1) f |x’|dF(x | p) < œo for all j and 
all p e Xand (2) f h(lx — y,||)dF(a | p) > —œ for all y, € X 
andall p € Xare satisfied. We also assume that for every yw € 
X a solution to the problem maXpar f A(x — Ymll)d F(x | p) 
exists. Finally, we assume that [| ywl] < || Ym ll. 


Extensions of Model A: (5) Agents Can Precommit. 
Preferences satisfy diversity if either (i) there does not exist 
a vectors € Xat. for all m € M, Xm = AS for some Àm E RI, 
or (1i) such a vector does exist then there must be two agents 
m,m E€ Mst. for the Àm and Ay described above Àw < 0 and 
Aw > 0. 


Proposition 1. If there are at least two agents in M and pref- 
erences satisfy diversity, then in every subgame-perfect equi- 
librium at least two agents in M offer the outcome 0 (the boss's 
ideal point) and the boss accepts one such offer. Moreover, all 
such equilibria yield the boss the same utility. 


Proof. By way of a proof by contradiction, assume that 
diversity is satisfied and the claim does not hold. This implies 
that either at most one agent promises xw = 0 or at least two 
agents make this offer and the boss does not delegate to one of 
these agents. If the latter occurs, then with positive probability 
the final outcome is not 0, but since a lottery that assigns 
probability 1 to the final outcome 0 maximizes the boss’s 
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expected utility and is available to the boss, this cannot occur 
in equilibrium. This implies that it must be the case that 
we have a subgame-perfect equilibrium in which at most one 
agent announces Xa = 0. There are two subcases to consider: 
(a) No agent offers x_ = 0 and (b) exactly one agent offers 
Xm = 0. We treat these cases in tum. 

(a) Let x/ = arg miny Xml. There are three separate sub- 
sub cases that we consider to derive the contradiction for case 
a. (1) If the boss does not accept x’, which is the best available 
offer, then 1t must be the case that no offer is in D, delegation 
does not occur, and the final outcome is nondegenerate. Buta 
deviation by any agent m to offer the policy (A (V) Nym) Yes 
which is ın the delegation set D and which he prefers to the 
lottery duced by the boss’s uninformed best response, dom- 
inates m’s offer in the conjectured equilibrium, contradicting 
the assumption that no offer is in the delegation set in an 
equilibrium. (2) If the boss does accept an offer (which must 
be x by subgame perfection) and part i of diversity is satis- 
fled, then we know that the set {z € X: {z — Yall < IX — yall 
and [izi < xI} 1s not empty. This is the set of points that 
both the boss and m prefer to the policy x’. Since this set 
is nonempty there exists a deviation by m that dominates 
her offer under the conjectured equilibrium. (3) If the boss 
does accept an offer (x’) and part i of diversity 1s not satisfied 
but part ii is, then the result follows from the median voter 
theorem. 

(b) If b attains, then we know that the boss will accept this 
offer. However, since Ya +% 0 and Minwaska Xal = y > 0, a 
deviation by agent m to, say, x = (y/2 |lymll) yn results in 
delegation by the boss to m but a final outcome that m prefers 
to the outcome 0. This contradicts the fact that x,, = 0is a best 
response for m when no other agent offers 0. E 


Model A’ 


Assume, additionally, that (1) S c P = X = R", (2) g(p, 6) = 
p +e, and (3) the distribution F’ (e7) of each coordinate of € 
is symmetric about 0. 


Co ve Statics for A’: (2) More Variable 
S Random variable a with distribution function A(-) 
is a mean- spread of b having distribution B (-) if 


3 preserving 
(i) f adA-— (a) = f b(dB(b) (we denote this mean z) and (ii) 
for any 6 > 0, Sive-t>8) dA(a) > fis aos) 4B (b) with strict 
inequality if the the left-hand side is not zero. Define the real- 


valued random vanables a* := ja — zl] and b* := Jb — zll . 


Lemma 1. If random variable a with distribution function 
A(.) is a mean-preserving spread of b having distribution B (-), 
then the distribution functions Æ (-) of a*and B*(-) of b* satisfy 
the ordering A*(-) < B*(-) wuh strict inequality on the intenor 
of the union of the supports of a* and b*. 


Proof. Assume that a 1s a mean-preserving spread of b. By 
definition we have fip,_.s) 4A(@) = fy goa, 4B (b) for any 
8 > 0 with strict inequality if the left-hand side is nonzero. 
But by definition A*(5) = 1 — fy, gq.) 4A(@) and B*(8) = 
1— fay asa) 4B (b) for ë > 0. Thus A*(-) < B*(-) with strict 
inequality on the interior of the union of the supports of a* 
and b*. a 


Proposition 2. Assume the assumptions of A’, that a is 
a mean preserving spread of b and that under either lottery —z 
is an optimal policy for the boss, then under random shock a 
the delegation set is a strict superset of the delegation set for 
shock b. 


Proof. By assumption pt = pr = — f ad A(a) = 
— f bdB(b) = —z. Thus, the expected utility to the boss 
from not delegating in each case is f h(la — zł) dA (a) = 
fh(at)dA(a*) and fh(ib-— zli)adB (b) = f h(b*)d B*(b*). 
But, by the previous lemma we have A*(-) < B*( ) with strict 
inequality on the interior of the union of the supports of 
a* and b*. Since A(.) is strictly decreasing it is well known 
that a (first order) stochastically dominated random variable 
is preferred to the random variable that dominates it, s0 
that we have fA(a*)dA*(a*) < f A(b*)d B*(b*). But, this 
means that the boss’s expected utility from not delegating 
is lower under a than under b. Since the utility to delega- 
tion is the same under a and b the result follows. | 


Extensions to A’: (3) Multiple Principals. We denote 
the set of bosses by L. Assume the following (1) In the ini- 
tial period the bosses simultaneously vote whether or not to 
delegate. A weakly Paretian voting rule is used to determine 
whether or not delegation occurs. (2) If the decision is not 
to delegate, then a finite period Baron—Ferejohn bargaining 
game ensues in which a policy p € P is chosen without knowl- 
edge of s. (3) If delegation is chosen, then a finite period 
Baron—Ferejohn bargaining game ensues over to which agent 
to delegate. 


Lemma 2. The subgame that begins with a decision to del- 
egate has a subgame-perfect Nash equilibrium in weakly un- 
dominated strategies. 


Proof. We establish the claim by backward induction. 
At any history in which delegation has occurred to agent 
m and e is revealed, the best response for agent m is to 
select policy ym —s. Given this, the decision to delegate 
to any m results in a degenerate lottery of attamning out- 
come yw with probability 1. Since the bosses have com- 
plete, reflexive, and transitive preferences over this set of 
outcomes, the finite bargaining game over these outcomes 
has a subgame-perfect equilibrium in weakly undominated 
strategies. | 


Given a multiple-boss game, a strategy profile satisfies 
condition a if at any information set in which any agent i 
has a binary choice between action a and action b in which 
the expected utility resulting from every profile of strategies 
that involve the choice of a by agent i is less than the ex- 
pected utility resulting from the play of best responses by 
agents at histories following i’s choice of b, agent i plays 
action b. 


Pro R 3. For fixed boss and agent ideal points, there 
exists a distribution F(-) of the shock s for which any strategy 
profile satisfying condition a results in unanimous votes to 
delegate by the bosses. 


Proof. The proof is constructive. Given fixed boss and 
agent ideal points and arbitrary distribution F(-) we con- 
sider L translated games constructed as follows For boss 
l € L the /-translated single-boss game has shock &' = ¢ — Jı, 
agent ideal points yh = ym — J foreach m € M, and boss ideal 
point 0. Clearly, each /-translated game satisfies the assump- 
tions of the basic game of model A’. In each /-translated 
game either all agent ideal points are in the delegation set 
or they are not. If not, then by the msk-comparative static 
we can take mean-preserving spreads of s until the dele- 
gation set covers all agent ideal points in the /-translated 
game. Thus, a mean-preserving spread of s’ exists for which 
all agent ideal points are in the delegation set. Denote this 
shock ¢'+. Now, in the multiple-boss game with random shock 
s!+ + y the boss with ideal point y prefers to delegate to any 
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agent than to select policy herself (facing uncertainty about 
shock s+ + y). Repeating these operations for each }' € L/I 
we can construct mean-preserving spreads of «/+ + y such 
that every boss in L prefers delegating to any agent in M to 
selecting policy herself (under uncertainty). Since the best 
expected utility that a boss can attain in the nondelegation 
subgame is from selecting policy herself, this implies that if 
a strategy profile satisfies condition a, then delegation must 
occur. E 
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Athenian Democracy and Legal Change 
MELISSA SCHWARTZBERG The George Washington University 


ancient Athenians regarded their ability to modify their laws as a fundamentally democratic 
i trait; indeed, the faculty of “pragmatic innovation” was well known throughout the Greek world 
and was widely viewed as a key advantage that Athens had over its rival, Sparta. The Athenian 
commitment to legal change endured despite disastrous consequences at the end of the fifth century, 
a comprehensive revision of the laws, and the complication of legal procedure in the fourth century. 
In an apparent paradox, however, the Athenians also used “entrenchment clauses” to make certain laws 
immutable. Through analysis of entrenched laws and decrees, itis shown that the innovativeness that made 
Athens enviable also made it a difficult ally; entrenchment enabled the Athenians to make its commitments 
more credible. Although today entrenchment is typically used to protect crucial constitutional provisions, 
such as rights, in the ancient world it served a strategic purpose. 


ecent work on ancient Athens has complicated, 

and even demythologized, our understanding of 

ancient democracy. We no longer view Athens 
as a simple “direct democracy,” but as a large soci- 
ety with a variety of political institutions and sophis- 
ticated legislative procedures. Yet Athens cannot be 
assimilated to a constitutional democracy any more 
than it can be reduced to a town hall meeting; because 
Athens defies easy categorization, though, the insights 
that Athens affords into the relationship between legal 
constraints and democratic decision making are richer 
and more immediately compelling than we might have 
anticipated. 

Ancient Athenians regarded the capacity to change 
laws, and, generally, to confront contingency with new 
institutional solutions, as a defining characteristic of 
their democracy. The Athenians’ ability to respond to 
problems by modifying their laws was a source of pride, 
and was widely known throughout the Greek world. 
Although the Athenians had access to a variety of in- 
stitutions by which they could have made their laws 
inflexible, they generally chose not to do so, even in 
the face of disastrous consequences at the end of the 
fifth century. The explanation for this choice is found 
in the Athenian attachment to the progressive ideology 
of pragmatic innovation, the desirability of modifying 
institutions in light of new information or changing cir- 
cumstances, a belief that proved remarkably resilient." 


Melissa Schwartzberg 1s Assistant Professor, The George 
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1 The language of pragmatic innovation is not intended to suggest 
that the Athenians were forerunners of Peirce or Dewey—although 
some may find commonalities—but instead to affirm the specific and 


In apparent contrast to this commitment to muta- 
bility, the Athenians occasionally used entrenchment 
clauses, provisions that make laws unamendable. Why 
would the Athenians, given their attachment to change- 
able law, choose to restrict themselves in such a strin- 
gent fashion, even prescribing death for those propos- 
ing modifications? Through systematic analysis of my 
collection of Athenian decrees and laws on this ques- 
tion, I demonstrate that the Athenians used entrench- 
ment in highly restrictive contexts: in certain financial 
decrees and in alliances and treaties? Although today 
entrenchment clauses are typically seen in the consti- 
tutional context, where they serve to make individual 
rights and other fundamental institutions unamend- 
able, the Athenians used these provisions exclusively 
for narrow, strategic purposes in both the international 
and the domestic contexts, and did not extend them to 
laws regulating the democracy. 

The flexibility of Athens’ laws generated problems 
for them in the international arena. Potential allies were 
skeptical of Athenian commitments, given its propen- 
sity to change course; indeed, international relations 
scholars continue to investigate the extent to which 
democracies are capable of credible commitments (e.g., 
Gartzke and Gleditsch 2003, Gaubatz 1996, Kennan 
1996, and Reed 1997). As a consequence, the Athe- 
nians required a device to signal their seriousness to 
prospective allies and to those who might seek to ex- 
ploit informational asymmetries, such as financial of- 
ficers. The use of entrenchment to enhance Athens’ 
credibility in the eyes of potential allies, however, 
should be distinguished from the potential use of these 
devices for “precommitment” more generally (Elster 
1984, 2000; Holmes 1995). The Athenians generally did 
not use entrenchment in the latter sense as a means 
of self-binding, so that a temporary passion would not 


situational nature of Athenian decision making. In Greek, pragmata 
(pL noun) means “things,” “matters,” or “deeds” and, in the political 
realm, “affairs” or “interests” 

2 To the best of my knowledge, the only extended treatment of the 
topic appears in a 1974 article by David Lewis, a late classicist, sum- 
marized in Rhodes and Lewis 1997. Rhodes and Lewis 1997 also 
offers a catalog of decrees of the Greek states, excluding Athens, 
which includes entrenchment clauses ın its index Alan Boegehold 
also briefly discusses the use of entrenchment clauses in “Resistance 
to Change in Athens,” ın Ober and Hedrick 1996. 
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overwhelm them; the single case of such an attempt, 
I will demonstrate, resulted in the suspension of the 
entrenchment clause. Entrenchment was instead used 
as a sign of Athens’ grave intent to retain a particular 
policy, given its propensity to modify other laws. The 
need for entrenchment in the international arena seems 
to have persisted into the fourth century, suggesting 
that Athens required this additional signal despite the 
presence of procedural checks designed to make legal 
change more difficult. 

The analysis here is broken up temporally, due to 
the importance of distinguishing between the legal sys- 
tems of the fifth and those of the fourth centuries. The 
account of fifth-century institutions focuses on the cre- 
ation of the ideology of “pragmatic innovation” and 
the crucial difference between Athens and Sparta in 
this regard; it is followed by a discussion of the use of 
entrenchment in the fifth century. After a brief discus- 
sion of the revision of laws at the turn of the century, 
fourth-century legal institutions are addressed, paying 
particular attention to the philosophic conceptualiza- 
tion of legal change in the period, notably by Plato and 
Aristotle. This consideration provides the context for 
a discussion of entrenchment’s use in the fourth cen- 
tury, followed by the conclusion, which highlights the 
importance of Athens for democratic theory and for 
political science in general. By reexamining Athens, I 
suggest, two key insights for contemporary democracy 
come to the fore: the distinctively democratic quality of 
the capacity to modify law and the limited and strategic 
purpose that entrenchment may serve for democracies, 
even in the constitutional context. 


FIFTH-CENTURY INSTITUTIONS AND 
“PRAGMATIC INNOVATION” 


Although we know significantly less about fifth-century 
institutions than those of the fourth century, the general 
institutional role of the bodies is reasonably well estab- 
lished (Hansen 1991, 27-52; Ostwald 1986). Legislation 
was purely a function of the Assembly and the Council 
of 500. The Council deliberated on legislation prior 
to the Assembly and had important agenda-setting 
power. Although the council could enact decrees of 
its own on minor matters (Rhodes and Lewis 1997, 13), 
the major legislative function of the Council, known 
as “probouleusis” by modern classicists, can be sepa- 
rated into two types: open and specific probouleumata 
(pl. noun). In an open probouleuma (sing. noun), the 
Council instructs the Assembly to debate a certain issue 
and, should it be deemed necessary by the Assembly, to 
pass a decree. A specific proboulewma was, in essence, a 
bill, subject to amendment, if necessary, and to ratifica- 
tion by the Assembly. The creation of probouleumata 
was passed by a single simple-majority decisive vote. 
The 500 members were composed of 50 from each of 
10 tribes, chosen by lot from those who put themselves 
forward within the tribe (ho boulomenos, or “the one 
who wishes”). These groups of 50 took turns serving 
as the prytany (prytaneia), the committee of the coun- 
cil that performed the actual function of probouleusis. 


312 


May 2004 


Each day one member of this prytany would serve as 
chairman (epistates), and a third of the prytany would 
remain on duty for the entire 24-hour period. From the 
time of Ephialtes (c. 463/2) until early in the fourth 
century, the prytany and the chairman would preside 
at meetings of the Council and of the Assembly. Both 
citizens and noncitizens could apply to the prytany to 
bring a motion before the Council, but in general the 
Council was independent with respect to its decision- 
making process (although the strategoi served as advi- 
sors, notably at the time of the Peloponnesian War). 

The Assembly met at various points during the pe- 
tiod of a given prytany, although the precise number 
of meetings is controversial (Rhodes and Lewis 1997, 
13). All citizens—i.e., all free, adult males satisfying the 
current Athenian descent requirement—could attend 
the assembly, and any probouleuma approved by a sim- 
ple majority of the Assembly, even if it contradicted an 
existing provision, became a nomos-—psephisma (law- 
decree), a fully valid law: The hierarchical distinction 
between the two, crucial to fourth-century legal history, 
did not exist in the fifth century, and the two terms 
were used synonymously (Todd 1993, 57). Although 
the laws were publicly available on stone slabs (stelai), 
the stelai were scattered throughout the city in the fifth 
century, and the revision of prior law by enactment of 
new legislation was therefore virtually undetectable; 
to the extent that a procedure for resolving incoher- 
ence even existed conceptually, lex posteriori seems to 
have prevailed. Moreover, the absence of jurists was 
extremely important for the development and practice 
of law at Athens, especially in the fifth century: Law 
was exclusively the stuff of amateurs, and the reforms 
at the turn of the century were designed to remedy 
some of the incoherence that resulted from the lack of 
systemization (Todd 1993, 54). This aversion to profes- 
sionalism in law endured, however: Even in the fourth 
century, litigants regularly appeal to their amateurism 
(as in Demosthenes 1935, 54.17). 

The difficulty in ascertaining what the law actually 
was at a given moment was deeply problematic for lit- 
igants, and for the jury courts as a whole, naturally; 
the absence of a police force, and the reliance on 
private individuals for prosecution, lent an additional 
dimension of whim to adjudication (Todd 1993, 79). 
Pericles introduced pay for those serving in the jury 
courts (dikasteria), which Aristotle recognizes as a 
democratic move inasmuch as it includes those who 
would otherwise lose wages and would attract the im- 
poverished more generally (Aristotle 1996, XXVI, 
1998b, VI, 2); members of the Assembly were not paid 
in the fifth century, but magistrates and Council mem- 
bers were. The jury courts heard dokimasiai, prospec- 
tive reviews of one’s eligibility to hold office in the polis 
(but not one’s competence), and euthynai, retrospec- 
tive renderings of accounts, of magistrates. 

The flexibility of the law in the fifth century had the 
consequence of unpredictability, because the laws were 
changed so frequently that it was difficult to anticipate 
the consequences of actions. However, this appears to 
have been an unintended result of regular modifica- 
tions. The public placement of stelai with inscribed rules 
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would seem to have been intended to enable individ- 
uals to determine whether their behavior, or the be- 
havior of others, complied with the law and, therefore, 
to anticipate the consequences, positive or negative, 
of their actions. Certainly, there may have been var- 
ious reasons for the display of inscriptions. Through 
analysis of “formulae of disclosure,” statements on the 
inscriptions addressing the political reasons for the in- 
scriptions, Charles Hedrick (1999) demonstrates that 
the erection of honorary decrees, for example, was in- 
tended to serve as an incentive for others to emulate 
the honorees’ actions, whereas other stelai, including 
financial documents, included a phrase indicating that 
the inscription was offered “so that anyone who wishes 
can see it” (411). Although one might suggest that pub- 
licity could have been window-dressing on the part of 
the Assembly, signaling publicity as it acts in a covert 
fashion, this explanation seems at odds both with the 
enormous size of the Assembly (Hansen suggests that 
6,000 attended on average in the fourth century, al- 
though perhaps fewer during the Peloponnesian War) 
and with the Athenians’ general ideological commit- 
ment to transparency, as demonstrated by institutions 
such as the euthynai. 

The charge of graphe paranomon, by which a pro- 
poser of a decree could be prosecuted for a proposal 
deemed at odds with the legal standards of Athens— 
even if these principles were not always transparent— 
also suggests that the Athenians were not interested 
in instability for its own sake. Although graphe para- 
nomon was frequently used as a political weapon, the 
Athenians nevertheless had a strong enough belief in 
the continuity and stability of their laws over time— 
and that their laws ought to cohere with the legal and 
social order as a whole (Ostwald 1986, 135-36)—1o give 
sense to the charge of unlawful amendment. It does 
seems clear that the public display of inscriptions had 
at least the purpose of making their contents generally 
accessible to the population, and given the costliness 
of erecting stelai, there is little reason to suspect that 
the Athenians actively preferred unpredictable rules. 
So if the Assembly did not wish to make its laws unpre- 
dictable, then why did it boast such a simple mechanism 
for legal change change in the fifth century? 

Especially in contrast with the conservative Spartans, 
Athenians took great pride in their ability to con- 
front the unexpected with modified rules and institu- 
tional novelty: innovation, in the pragmatic sense of 
creating novel institutions and tools to confront new 
challenges? The dichotomy between the innovative 
Athenians and the conservative Spartans was widely 
known, and was a source of pride for Athens. Athens 
claimed to have invented virtually everything. “Even 
the method of sowing grain was supposed to have been 
passed on from Eleusis,” ancient historian Christian 


3 Saxonhouse (1996) offers a thoughtful account of the ability to 
change policies as characteristic of democracy in Thucydides in Athe- 
nian Democracy: Modern Mythmakers and Ancient Theorists (59- 
86), and Meter’s recent narrative history of Athens from the seventh 
through the end of the fifth century, Athens: A Portrait of the City 
in Its Golden Age (1993), emphasizes the importance of innovation, 
both institutional and technical, for the development of Athens. 


Meier (1993, 411) writes, and cites a late fifth-century 
song by Timotheus that highlights this boast: “I do 
not sing old songs; my own are much better. Young 
Zeus is in charge now; Cronos is dethroned” (407). 
The innovatory ways of Athens, however, were some- 
times regarded as a liability: the selection of a Spartan, 
Eurybiades, instead of an Athenian as commander of 
the Greek forces at Salamis in 480 was due to what 
was perceived as Athenian instability, which “gen- 
erated anxiety if not outright distrust among their 
allies. ... With a Spartan commander they could at least 
expect courage and decisiveness ...” (21). 

Thucydides highlights the Athenian attachment to 
innovation and the democratic nature of that commit- 
ment, contrasting it with the Spartans’ conservatism. 
The Mytilenian debate, for example, offers a look at 
the democratic character of the ability to change course. 
Although this is slightly distinct from innovation, part 
of what the Athenians understand as democracy, on 
Thucydides’ reading, is the freedom to be unbound by 
prior decisions and, given new information, to redirect. 
Encouraged by Cleon, the Athenians decide to pun- 
ish the revolt of Mytilene by slaughtering the entire 
adult male population and enslaving the women and 
children. However, by the following day, the Atheni- 
ans are struck by the cruelty of the decision, and an 
assembly is called to debate the motion. Cleon ad- 
monishes the Athenians to remain undeterred, arguing 
that the greatest weakness in a democracy is “the con- 
stant change of measures” (Thucydides 1996, 3.37.3). 
Instead, citizens should strive, in accordance with their 
rejection of expertise, to stick with good laws, which 
they know to be wiser than intellectuals: “[B]ad laws 
which are never changed are better for a city than good 
ones that have no authority, ...; [More gifted fellows] 
are always wanting to appear wiser than the laws, and 
to overrule very proposition brought forward, thinking 
that they cannot show their wit in more important mat- 
ters and by such behavior too often ruin their country” 
(3.37.34). 

For Cleon, decisiveness is crucial, whereas debate 
is both vain and ruinous; as a result, he believes 
that democracies are incapable of governing others 
(Thucydides 1996, 3.37.1). In contrast, Diodotus argues 
that reopening deliberation permits a better answer to 
be reached, assuming that debate is not hindered by 
charges of corruption. On substance, Diodotus claims 
that the decision ought to depend on expected future 
benefits, rather than on desire for revenge for earlier 
deeds; as the Mytilenean democrats did not support the 
decision to revolt, by killing them, the Athenians would 
deprive themselves of potential allies. Diodotus, and 
the ability to change course, narrowly carries the day, 
and only 1,000 members of the upper class are killed. 

Similarly, the Corinthian speech at Sparta in 432, 
which leads to the declaration of war against Athens, 
contrasts the competing modes of decision making, 
emphasizing Athenian innovation and Spartan conser- 
vatism. The Corinthians, attempting to persuade the 


4 Translations are from the revised Crawley edition (Thucydides 
1996). 
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Spartans to act against Athenian aggression, portray 
the Spartans as excessively cautious and the Athenians 
as foolhardy in their daring: 


The Athenians are addicted to innovation, and their de- 
signs are characterized by swiftness alike in conception and 
execution; you have a genius for keeping what you have 
got, accompanied by a total want of invention, and when 
forced to act you never go far enough. Again, they are 
adventurous beyond their power, and daring beyond their 
judgment, and in danger they are sanguine. (Thucydides 
1996, 1.70.2-3) 


The Corinthians describe the Athenians as persis- 
tent, and as remaining undeterred by even a series of 
defeats as they try to achieve their ends. However, they 
are restless and desirous; as soon as they accomplish 
their goal, they set another before themselves. The 
Spartans’ reluctance to act, rooted in their desire for 
peace and stability, places them in jeopardy as they con- 
front the ingenuity of the Athenians, the Corinthians 
maintain: 

It is the law, as in the arts so in politics, that improvements 

ever prevail; and though fixed usages may be best for undis- 

turbed communities, constant necessities of action must be 
accompanied by the constant improvement of methods. 

Thus it happens that the vast experience of Athens has 

carried her further than you on the path of innovation. 

(Thucydides 1996, 1.71.3) 


Conservatism is the best response to stability: The 
Corinthians do not praise innovation for its own sake. 
When circumstances remain constant, the sedentary 
Spartans may enjoy the predictability of their lives, 
marked by rigid adherence to custom. But the Atheni- 
ans have learned how to confront contingency with cre- 
ative action. The diversity of their obstacles has proved 
edifying: Progress, on the Corinthians’ reading, derives 
from encounters with the unknown and the need to 
develop new tools to conquer misfortunes. 

The belief in Spartan stability was both well estab- 
lished and enduring: one of the few good sources on 
Spartan institutions, Plutarch’s Life of Lycurgus, was 
written in the beginning of the second century of the 
common era, more than 500 years later (MacDowell 
1986, 14~22).° Although, as Plutarch ([1864] 2001) him- 
self acknowledged, “There is so much uncertainty in the 
accounts which historians have left us of Lycurgus, the 
lawgiver of Sparta, that scarcely any thing is asserted 
by one of them which is not called into question or 
contradicted by the rest” (52), the legend of Lycurgus 
was undoubtedly known to the Athenians® and affected 


5 Because of the time gap, Plutarch may not be the best source for 
the fifth- and fourth-century Athenian beliefs about the system, but 
his reliance on Thucydides, Xenophon, and Plato allow us to draw 
some infe ; moreover, MacDowell emphasizes, contra Gomme 
(1945), A Histoncal Commentary on Thucydides (1.84), that his tex- 
tual evidence was rather good. 

6 There are certainly problems with attempting to ascertain the 
knowledge of the average Athenian, but Herodotus, Thucydides, and 
Xenophon all visited Sparta, and given the context of the Pelopon- 
nesan War, it is reasonable to assume that the average Athenian 
knew something about the enemy (MacDowell 1986, 15). 
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their perceptions of the Spartans. The dating of Ly- 
curgus’ creation of the Spartan institutions is uncer- 
tain, but Thucydides (1996, 1.18.1) places it at more 
than 400 years since the end of the Peloponnesian War. 
The structure of Spartan institutions is not important 
here, but the laws to which the Spartans, as the enemy, 
adhered was a matter of considerable interest to the 
Athenians and served as a competing conception of 
legislation. 

Herodotus (1987), for one, believed that Lycurgus 
“took care” to ensure the endurance of his laws (1.65): 
In fact, Plutarch ([1864] 2001) reports that when Ly- 
curgus died in Crete, his Cretan friends followed his 
request and scattered his ashes there for fear that “if his 
relics should be transported to Lacedaemon, the peo- 
ple might pretend to be released from their oaths, and 
make innovations in the government” (80). Perhaps 
the plan worked, because the Spartans were thought 
to have been profoundly law-fearing (Herodotus 1987, 
7.104);’ in Plato’s The Laws (1980) (written in the 
350s and 340s, some hundred years later), the Athe- 
nian Stranger comments that one of the best laws of 
the Spartans is that “which does not allow any of the 
young to inquire which laws are finely made and which 
not” (634d-e). Archidamus, in the debate at Sparta, 
maintains that their steadfastness is attributable to their 

ining: “And we are wise, because we are educated 
with too little learning to despise the laws, and with 
too severe a self-control to disobey them” (Thucydides 
1996, 1.84.3). Whether these laws were unwritten is 
uncertain, but Plutarch ([1864] 2001) writes that there 
was even a rhetra against writing them down (63);° it 
appears safe to say that although the bulk of Lycur- 
gus’ laws remained unwritten, some laws were codified, 
and later laws certainly were. But that the Spartans 
had an aversion to changing their institutions, and a 
fear of their corruption, is well documented (Plutarch 
[1864] 2001, 80), extending to a ban on foreign travel 
and to the expulsion of aliens, for fear that foreigners 
might have a pernicious effect on the citizens or the 
laws. 

Whereas the Spartans wished above all to protect 
their institutions from change and undue influence, the 
Athenians prided themselves on their ability both to 
revisit their decisions and modify their institutions to 
fit their present needs and to innovate where neces- 
sary. Rather than remaining bound by custom or tradi- 
tion, the Athenians viewed change as an affirmative 
good, not as a weakness; the Athenians’ prior deci- 
sions are valuable only insofar as they help them to 
address new problems. Yet in this context the deci- 
sion to make certain laws unamendable is surprising: 
Why would they inhibit their ability to modify their 
institutions, given the importance of this mechanism 


7 For Aristotle (1988), the Spartan constitution’s stability is owed 
to its status as a mixed regime (politera) (IV, 9, 1294b13-1295a1) 

8 The “Great Rhetra,” asit is known, is generally accepted as an early 
written law. Gagarin (1986) asserts that “Sparta appears consciously 
to have rejected the use of written laws and to have relied on an 
increasing degree of control over the educational system to achieve 
[. J] authority over its citizens” (140) 
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for their self-conception? One answer may be that the 
Athenians were well aware of the costs associated with 
their innovativeness, both in the domestic context and 
in the international arena. Thucydides suggests that al- 
though the creativity of the Athenians served them well 
in war-making, the tendency toward inconstancy ulti- 
mately led to domestic strife. Moreover, he suggests 
that the Athenians were blind, perhaps willfully so, 
to the risks associated with their flexibility and were, 
therefore, surprised when their changes of direction 
led to poor consequences (Thucydides 1996, 8.1.1-4).? 
However, the use of entrenchment indicates that the 
Athenians were conscious of the pitfalls of their com- 
mitment to flexibility, particularly with respect to their 
alliances. The Athenians used entrenchment clauses 
strategically, it appears, as an occasional corrective to 
their propensity to redirect in light of new information, 
and to appease allies who might view Athens as un- 
trustworthy. 


FIFTH-CENTURY ENTRENCHMENT 


The beginning of Athenian democracy is marked by en- 
trenchment and apparent attempts at self-binding, if in 
a time-limited form: The Athenians “bound themselves 
by great oaths that for ten years they would live under 
whatever laws Solon would enact,” and Solon left the 
country for 10 years so that he could not be persuaded 
to modify the laws (Herodotus 1987, 1.29). A closer 
look at entrenchment clauses in the fifth century shows 
that entrenchment typically served a strategic purpose, 
at odds with our contemporary conception of entrench- 
ment as a device designed to protect the most salient 
or salutary provisions. In addition, the liberal usage of 
these devices today is starkly at odds with the penalties 
attached for introducing a proposal in contrast to these 
clauses, ranging from atimia (loss of citizen rights) to 
death. 

Identifying the provisions that remain flexible is 
somewhat simpler than categorizing the entrenched." 
Neither honorary nor dedicatory decrees, which tended 
to recognize a particular good deed or set of good 
deeds to the city, were ever entrenched, perhaps be- 
cause of a recognition that an honoree could commit 
an offense and deserve punishment severe enough to 
nullify the prior honors. Rules regulating offices were 
not entrenched, neither in Athens nor in other parts 
of the Greek world, although the actual case number 
is extremely small and therefore somewhat less reli- 
able; the absence of entrenchment clauses attached to 


9 I am grateful to an anonymous reviewer for drawing my attention 
to this point. 

10 The mayor sources for the decrees and laws are standard refer- 
ence works of inscriptions in translation, Fornara’s (1983) Archaic 
Tunes to the End of the Peloponnesian War and Harding’s (1985) 
From the End of the Peloponnenan War to the Battle of Ipsus, to en- 
sure comprehensiveness and consistency of translation (the author's 
Greek is not expert level), the sources were checked against Tod 1948 
(vols 1, 2) and Meiggs and Lewis 1969. The complete data set is 
available from the author, along with a concordance table. 


“constitutional provisions,” however, might hint that 
these positions, and their duties, were among those in- 
stitutions that the Athenians, at least, wished to keep 
flexible. Decrees relating to sacred matters were also 
not entrenched; whether or not they were actually 
changeable, however, is not clear. 

The most commonly entrenched provisions are those 
relating to alliances and treaties, both in Athens and 
throughout the Greek world."! Non-Athenian decrees 
offer other forms of entrenchment; for example, an al- 
liance between Elis and Heraea is bounded by a sunset 
clause, indicating that it will last for 100 years, but ad- 
ditionally prescribes that “if anyone does harm to this 
writing, whether private citizen or official or commu- 
nity, to the sacred penalty [a talent of silver] shall he be 
liable which is here written down” (Fornara 1983, 25, 
p. 29). The use of sunset clauses in this period is some- 
what ambiguous, because all peace treaties included 
sunset clauses in the fifth century (Ryder 1965, 5) and 
because the period of 100 years is often taken by schol- 
ars to mean forever. 

The explanation for the entrenchment of alliances 
and treaties may well be located in the belief by the 
Greek world that Athens could not be depended on to 
keep its promises. A passage from the Old Oligarch’s 
Constitution of the Athenians, perhaps written toward 
the end of Pericles’ career, illustrates the claim that 
democratic cities have a tendency to defect from al- 
liances: 


Further, for oligarchic cities it is necessary to keep to al- 
liances and oaths. If they do not abide by agreements or 
if injustice is done, there are the names of the few who 
made the agreement. But whatever agreements the popu- 
lace makes can be repudiated by referring the blame to the 
one who spoke or took the vote, while the others declare 
they were absent or did not approve of the agreement 
made in the full assembly. If it seems advisable for their 
decisions not to be effective, they invent myriad excuses 
for doing what they do not want to do. And if there are 

any bad results from the people’s plans, they charge that a 

few persons, working against them, ruined their plans; but 

if there is a good result, they take the credit for themselves. 

(Old Oligarch 2.17; in Xenophon 1984) 

Because assessing responsibility in oligarchies is 
easy—the number of agents is relatively small—the in- 
centive to fulfill obligations is high, lest one be punished 
personally by proalliance forces or by the wronged ally. 
However, in large democracies like Athens, determin- 
ing who is culpable is more difficult: The composition 
of the assembly changes constantly, and decisions are 
made by simple majority, allowing a substantial minor- 
ity to claim that they did not support defection and 
that they are therefore blameless. As a result, the Old 
Oligarch claims, Athens is a less reliable ally. Because a 
simple majority of the Council and the Assembly could 
revoke any decree, including those specifying relations 
between Athens and another state, potential allies were 





11 Ag Rhodes and Lewis (1997, 16) confirm. They argue that the only 
entrenched decrees in Athens are ones of alliance and treaty, which 
seems incorrect given the evidence here. 
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fearful that the Athenians might violate a treaty. Al- 
though the benefits to an ally might have made entering 
into a treaty worthwhile, even if the risk that the Athe- 
nians would later defect was relatively high, the Atheni- 
ans may have needed to signal an intention to comply, 
at a minimum. This attempt at credible commitment 
may explain the presence of entrenchment clauses in 
treaty-decrees. 

Which treaties were entrenched? Given the prob- 
lems of epigraphic evidence (Finley 1975, passim, Todd 
1993, 30-44), it is impossible to know for certain, but 
investigation of the extant decrees indicates that al- 
liances with Rhegium and Leontini almost certainly 
were; although modification of the law is not subject toa 
penalty, the references to the alliance lasting “forever” 
are intended to be taken literally and ought, therefore, 
to be counted as entrenched (Rhodes and Lewis 1997, 
524). 

The decrees read as follows, as translated by 
Fornara. Athenian Alliance with Rhegium (433/2) 
(Fornara 1983, 124, pp. 124-125): 


[Alliance shall be made] between the Athenians and [the 
Rhegians. The oath] shall be sworn by the Athenfians 
so that everything will be] trustworthy and guileless and 
s[traightforward on the Athenians’ part,] forever, in rela- 
tion to the Rhegians. [They shall swear the following:] ‘As 
allies we shall be trustworthy [and just and] steadfast and 
reliable [forever to the Rhegians and] we shall provide 
them with aid if [.] [- - -]’. 


Athenian Alliance with Leontini (433/2) (Fornara 1983, 
125, pp. 125-126): 


Alliance shall be made between the Athenians and 
Leontinians and the oath shall be given and taken. [The 
oath shall be sworn] by the Atheni[ans as follows:] ‘As 
allies we shall be [to the Leont}inians forever [guileless] 
and reltable’. [The Leontinians likewise shall] swear: [‘As 
allies we shall forever be to the Athenians] guileless [and 
reliable.]’ 


Why were some treaties left without entrenchment 
clauses? In the first place, again, as the treaties are 
in generally poor condition, it is difficult to determine 
in several cases if they do lack entrenchment clauses. 
However, for example, the decree establishing a treaty 
between Athens and Egesta is in reasonably good 
condition (except for a lacuna), and does not have 
an entrenchment clause. The decree reads as follows 
(Fornara 1983, 81, p. 81): 


That it [be] sworn [by everyone shall be the] generals’ 
responsibility [- -] [-14-] with the oath-commissioners so 
that [- -] [-11-] This decree and the [oath shall be inscribed 
on a marble stele on the] Akropolis by the Secretary of 





12 T follow the symbol usage of the Fornara (1983) volume [] “enclose 
letters or words that no longer stand in the text as it survives, but have 
been restored by modern scholars”, [. .] “indicates by the number 
of dots the exact number of missing letters where no restoration 1s 
attempted”; [- - -] “mdicates an indeterminate number of missing 
letters”, vacat “mdicates that an entire line or space between entire 
lines were left vacant”, lacuna “indicates that a portion of the docu- 
ment is missing”; and italics “indicate that only a part of the onginal 
word 1s extant on the document” (xxi-xni) 
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the Boule. [The Poletai are to let out the contract.] The 
Kolakretai are to provide [the money. Invitation shall be 
offered for] hospitality to the embassy of the E[gestaeans 
in the Prytaneion at the] accustomed time. Euphe[mos 
made the motion. Let all the rest be as (resolved) by the] 
Dai butin future, when [-9-} [-19-] herald shall introduce 
-14-] [- - -}. 


If, in fact, this decree lacks an entrenchment clause 
(as is commonly considered) and if, in fact, the dat- 
ing of the decree to 418/417 is correct, instead of an 
earlier alliance at 457/8 in the time of Laches,? why 
might this be? One explanation might be that the Eges- 
tacans were extremely eager to secure Athenian sup- 
port in their war with the Selinuntines and their allies, 
the Syracuseans (Thucydides 1996, 6.62), and as a re- 
sult, they were comparatively weak, from a bargaining 
perspective. The Athenians wished to conquer Sicily, 
“though they had also the specious design of aiding 
their kindred and other allies in the island” (6.6.1), and 
therefore were willing to ally themselves, although the 
need was not nearly so immediate. Since the Athenians 
preferred to keep their alliances flexible, as the discus- 
sion of the Athenian interest in flexibility showed, they 
would, in general, like to operate without the use of 
entrenchment clauses. When the Athenian bargaining 
position was weaker—a more pressing need to reach 
agreement, a more powerful potential ally—and they 
needed the allegiance of a recalcitrant state, the en- 
trenchment clause might serve to signal Athenian com- 
mitment. But when the advantage was overwhelmingly 
on the side of the Athenians, as may have been the 
case with the Egestaeans, they could frame the treaty 
as they preferred and, thus, could refuse to include an 
entrenchment clause. 

Procedures regulating tribute payment by the allies 
to the Athenians, additionally, were entrenched and 
even have a penalty attached to their modification. 
In the case of the decree relating to the appointment 
of tribute collectors (Fornara 1983, 133, pp. 149-150) 
(423 B.C.), the person conspiring to invalidate the de- 
cree (or to thwart the payment of tribute) is charged 
with treason. The decree relating to the reassessment 
of the tribute of the Athenian empire (136) (425/4 B.C.) 
prescribes the loss of citizen-rights (atimia) and the con- 
fiscation of property for those raising a motion to nullify 
the decree. Given that the allies were aware of Athe- 
nian inconsistency (based on both the simple-majority 
decision procedure itself and the unstable composition 
of the assembly), it is possible that they might have 
hoped to evade their tribute payments either by the 
nullification of the decree by a later majority or by fail- 
ure to enforce the decree. On this reading, the penalty 
provision is perhaps best read as emphatic—pay the 
tribute or else—but the purpose of entrenchment itself 
is to signal commitment to the allies. 





D That ıs, as Mattingly (1996) argues against the more conservative 
dating of Lewis and others Lewis, among many other classicusts, 
is typically inclined to date decrees including a three-bar sigma as 
earlier, but this 1s one of the most controversial points in epigraphy 
Given the Thucydidean context, the preponderance of the evidence 
(with the exception of the sigma) is ın favor of the later date 
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Some financial decrees were also entrenched, and 
even prescribe the penalty within the decree itself— 
and the penalty for proposing a change of the law of- 
ten takes the form of the death penalty. One is the 
famous Athenian coinage decree (Fornara 1983, 97), 
which bars the use of foreign coins, weights, or mea- 
sures; it entrenches a section stipulating that surplus 
from a minting operation is to go into a special fund, 
and one who proposes to do otherwise is subject to the 
death penalty.!* This might suggest a legislative check; 
given that demagogues performed the bulk of detailed 
financial work (Hornblower 1992, 122), entrenchment 
here could have served to thwart efforts at siphoning 
from the fund. Entrenchment, then, arises in a case of 
informational asymmetry that may easily lead to ex- 
ploitation: the demagogues have access to resources 
and to information that may be inaccessible to the peo- 
ple as a whole, and the use of entrenchment emphasizes 
the consequences of taking undue advantage of this 
position. 

Another source, Kallias’ second financial decree 
(Fornara 1983, 119), is somewhat less fraught with tex- 
tual ambiguity and lends itself more readily to inter- 
pretation. The decree addresses the completion of por- 
tions of the Acropolis and specifies that no other use 
of Athena’s money is permitted and proposals to do so 
will be penalized, unless an immunity vote (adeia) is 
passed. Moreover, the decree says that both the need 
for an immunity vote and the penalty are identical to 
those governing the proposal of a property tax. So it 
appears that matters related to the usage, and collec- 
tion, of funds are governed by a delaying procedure, 
indicating some general concern about risks of abuse 
in financial matters. The argument, common in pre- 
commitment models, that time delays permit cooling 
off could explain the entrenchment in these provisions, 
especially given the risks associated with financial mat- 
ters, as the next example, relating to the case of the 1,000 
talents, demonstrates. (Note, however, that time delays 
may permit publicity, which may have the perverse ef- 
fect of heating up debate.) The use of entrenchment to 
regulate matters relating to property is not restricted to 
Athens: In Halicarnassus (Fornara 1983, 70) and Locris 
(33), also, provisions relating to disputed property and 
to the settlement of new territory, respectively, are en- 
trenched. 

The final case of financial entrenchment is one that 
was actually abrogated. At the beginning of the war, 
around 431, the Athenians enacted a decree preserving 
a special fund of 1,000 talents from the money in the 
Acropolis and 100 superior triremes, which were to be 
used only in case of an enemy attack by sea on Athens. 
The decree prescribed the death penalty for anyone 


14 Traditionally, the decree has been understood as stating that those 
who propose to use foreign coins are lable to the death penalty 
However, Lewis repeatedly argued that the death penalty instead 
apphed to proposals against the fund (most notably, m Lewis 1997, 
116-30); his reading ıs probably to be trusted The dating 1s also 
controverstal, because of the use of the three-bar sigma, but the later 
dates (425/4 or before 414, instead of 450-446) seem have found 
adherents recently 


who suggested, or put to the vote, a proposal to spend 
the money for any other purpose (Thucydides 1996, 
2.24.1). This appears to be a reasonably straightforward 
case of precommitment; that is, the Athenians recog- 
nized that in the heat of war, they might be inclined 
to spend recklessly and, as such, decided to create an 
emergency fund for a genuine threat on their land, with 
a penalty of death for those suggesting to touch it. How- 
ever, not even the threat of capital punishment could 
dissuade the Athenians from modifying the Jaw in a 
moment of crisis, and as a consequence, the extent to 
which we should believe that the Athenians engaged in 
self-binding is in question. 

After the failure of the Sicilian expedition in 413, 
the Athenians saw that their enemies were prepared to 
redouble their efforts against them and, perhaps worse, 
that their allies were becoming restless). When Chios, 
the best among the allies revolted, and it seemed clear 
that the other allies were likely to follow, the Athe- 
nians panicked: “In the consternation of the moment 
they at once canceled the penalty imposed on who- 
ever proposed or put to the vote for using the thou- 
sand talents which they had jealousy avoided touch- 
ing throughout the whole world, and voted to employ 
them to man a large number of ships [. . . ]” (Thucydides . 
1996, 8.15.1). The entrenchment of this provision could, 
in fact, have exacerbated their fearfulness: The lack 
of immediate access to the funds sharpened their hys- 
teria. Had the money been readily available, they 
might have engaged in a more rational, less anxious 
deliberative process (if they had not already spent 
the talents, an admitted risk). Although the threat of 
revolution would in any event have elicited a highly 
emotional response, the feeling of being constrained, 
perhaps even trapped, might have heightened the 
response. 

Given the Athenian commitment to legal change, 
some strong motivation must have been present for 
entrenchment clauses to appear. Athens wanted to be 
capable—and to be regarded as capable—of making 
binding agreements with allies. The belief that democ- 
racies were incapable of committing themselves in- 
hibited their treaty-making ability, and so the Athe- 
nians had to send a signal to potential allies that they 
would not abrogate their agreements, via the use of en- 
trenchment clauses. Although Athens’ allies must have 
known that the possibility of abrogation remained— 
that entrenchment was “only words,” given the absence 
of penalties—the presence of entrenchment clauses in 
certain key cases suggests that they were meaningful 
both to Athenians and to the allies as an emphasis on 
their intention, given the Athenian tendency toward in- 
constancy. The Athenians also may have feared abuse 
by demagogues, and entrenching the punishment en- 
abled them to affirm the severity of the crime. Great 
ease of legislation could render them vulnerable by per- 
mitting the depletion of necessary resources, especially 
in the face of foreign threats. 

Although the Athenians believed that their strength 
derived in no small part from their ability to confront 
contingency with innovative solutions, their tendency 
to change law could have lethal consequences. The 
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capacity to reverse course took a famously tragic form 
in 406, at the trial of the generals for the battle at 
Arginusae. The Athenians had amassed what remained 
of their resources, 10 generals and 150 ships, to rescue 
their fleet at Samos, which had been captured in the har- 
bor of Mytilene. However, the rescuers, having stopped 
at the Arginusae islands for dinner, were met by the 
Peloponnesian fleet, and the Athenians were heroically 
victorious in the ensuing battle. Despite the victory, 
some ships had been lost, and so two generals and 
47 ships had been left behind to collect the shipwrecked 
sailors, while the remaining generals and ships headed 
to Mytilene to rescue the fleet. A storm derailed both 
missions: The eight generals informed the Council and 
the Assembly that they had been unsuccessful, and the 
two generals left at Arginusae returned to Athens. The 
eight generals were immediately recalled to Athens for 
the process of euthynai. 

A probouleuma was called for an immediate vote on 
the guilt of the generals, in violation of decrees govern- 
ing due process, specifying that each accused person 
was to defend himself separately and that a trial be- 
fore a jury court was required. Despite the presence 
of these laws, “the masses shouted that it was mon- 
strous for anyone not to let the people do whatever 
they wanted,” which was to summarily execute the gen- 
erals (Xenophon, Hellenica, 1.7.12; trans. in Ostwald 
1986, 444). Although the Assembly was temporarily 
persuaded to try the generals separately, after an ob- 
jection was raised by one member, they changed their 
minds and executed the generals. In part, it was this 
tension on the part of the Athenians—on the one hand, 
to enact whatever they wished whenever they chose 
and, on the other, to restrain themselves from their 
worst impulses—that the revision of laws at Colonus 
was intended to resolve. 


REVISION OF THE LAWS 


The series of revisions began in 411 following the deci- 
sion (under compulsion) by the Assembly to turn the 
democracy over to oligarchs, which Aristotle (1996, 
29.2) claims was intended to win the support of 
Persia. Despite the pressures that the Athenians were 
under, itis unsurprising that they might wish to stabilize 
the laws, given the turbulence of the late fifth century. 
The reason for the choice to inscribe the centuries-old 
laws of Dracon and Solon, rather than drafting new 
laws, is less obvious, although, again, both oligarchs 
and democrats were in agreement that their rules con- 
stituted the ancient constitution (Hansen 1991, 162). 
Inasmuch as there was debate between the democrats 
and the oligarchs over the content of these laws, for 
both informational and ideological reasons, it is rea- 
sonably clear that the motivation for the use of these 
laws was less their prescriptions than the legitimizing 
force that references to the “ancient constitution” pro- 
vided. For the oligarchs, the ancient constitution was an 
explicitly conservative institution, designed to return 
Athens to its state before the “populism” of the late fifth 
century. 
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The 30 men,” known as syngrapheis,'® were elected 
at least in part to draw up proposals derived from 
the laws of Solon—demonstrating the force of the ap- 
peal to the “ancient constitution” (patrios politeia)— 
which was perceived as the true form of Athenian 
government prior to its corruption by fifth-century 
legislation (Ostwald 1986, 371). The proposals were 
presented to the Assembly, which had been gathered in 
Colonus, about an hour and a half outside of Athens, 
and surrounded by armed guards. The distance from 
the turbulence of Athens could have served to insu- 
late the assembly from pressure of the people as a 
whole or to allow secrecy; in any case, the lower classes 
would have been unlikely to travel this far (Meier 1993, 
557; Ostwald 1986, 374). However, the presence of the 
guards could have served both to protect the assembly 
and to ensure the performance of their duties, the lat- 
ter thereby constituting an implicit threat. The remote 
location also freed them from having to address the 
pragmatic concerns of an unstable city. Toward that 
end, perhaps, the graphe paranomon, the charge that 
one had proposed an illegal decree, was immediately 
suspended for the duration of the meeting at Colonus 
(Thucydides 1996, 8.67.2). The rationale for the sus- 
pension was a return to the ideology of innovation, 
or perhaps an appeal to deliberation for informational 
purposes, in contrast, perhaps, to the political wrangling 
of the late fifth century. 

The Assembly ratified all of the proposals, which 
were generally oligarchic. They included the abolition 
of payment for public service, the election of archons 
with an appointed Council of 400, and the limitation of 
the franchise to 5,000 citizens (Thucydides 1996, 8.67.3). 
After a final proposal for the appointment of 100 ana- 
grapheis (publishers of laws) (Aristotle 1996, 30.1),!” 
the Assembly was then dissolved. After the support of 
the proposals at Colonus, mercenary foreign hoplites 
policed the streets of Athens, and the Council of 400 
needed weapons to evict the democratic Council; thus, 
it appears that the measures did not enjoy popular sup- 
port, but whether that was a cause or an effect, or nei- 
ther, of the remote location of the meeting is unknown. 

After the overthrow of the oligarchy and the return 
of democracy in the spring of 410, the Athenians em- 
barked upon a comprehensive restoration of the laws, 
conducted by appointed anagrapheis. Given the lati- 
tude that this board would have in determining the 
content of these laws, and the subsequent importance 


15 Although Thucydides writes of only 10 syngrapheis (8 67), which 
Meer (1993, 557) follows Aristotle’s account of the 30 men is con- 
sidered to be more reliable, as by Ostwald (1986, 369). 

16 Syngrapheis is derived from the verb which means 
“to compose in writing.” Ostwald (1986) writes that “we should, 
therefore, expect a syngrapheus to collect facts and materials from 
various quarters and then weld them into a coherent whole in his 
written report, and what little we know of the activities of syngrapheis 
confirms this” (415-16). 

17 Anagrapheis is derived from anagraphern, or “to write up for dis- 
play m public,” and Ostwald (1986) suggests that their responsibility 
was to prepare the final texts of laws and publish them, moreover, it 
appears they had considerable discretion (416-18). Harnson (1955) 
notes that although anagraphein means “to publish,” ıt does not nec- 
essarily mean “engrave on stone” (30). 
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of these laws, the accountability of the anagrapheis was 
essential, and as a result, they were subject to the retro- 
spective review of euthyna.'® The job of the anagrapheis 
was originally intended to take only four months, but 
the project quickly expanded from the republication of 
the Solonic laws still in force and Dracon’s homicide 
law to a broader investigation of laws in force which 
were perceived as part of the “ancient constitution.” 
As a result, the revision and the inscription of the laws 
on the Stoa Basileios (“the portico of the king archon” 
in the Agora [Hansen 1991, 163]) took until 404, when 
a second oligarchic interlude took them out of force. 

In 403, a final revision specified that the laws were 
again to be in accordance with Solon and Dracon but 
noted that “anyone who wishes” may suggest revi- 
sions; this may remind us of the suspension of graphe 
paranomon prior to the meeting at Colonus. Both the 
generally oligarchic Assembly in the earlier revision 
and the democrats engaged in the later process feared 
that the institution of graphe paranomon quashed the 
ability to improve one’s institutions, although even 
the apparently conservative institution of the graphe 
paranomon enabled the democracy—as embodied in 
the jury courts—to reconsider the decisions that the 
democracy—as embodied in the Assembly—had en- 
acted (Ober 1996, 119). Innovation had an ambiguous 
position during this period, as the rhetoric of the “an- 
cient constitution,” and the desire to preserve the laws 
of Solon and Dracon, was conservative in the restora- 
tive sense. However, a call for “good suggestions” in the 
decree enabling the revision had the clear purpose of 
introducing new institutional alternatives into the pool 
of available mechanisms. 

Two elected bodies were in charge of the revision; 
the Assembly had no role whatsoever in the creation 
of the law which was to be in force subsequently in 
Athens (Hansen 1991, 163; Ostwald 1986, 512-13). The 
laws were to be inscribed and an uninscribed law had 
no force, as the decree cited in Andocides, On the Mys- 
teries, makes clear: “A law which has not been inscribed 
shall not be employed by officials on any matter whatso- 
ever” (Antiphon and Andocides 1998, I.87). But after 
Andocides, there are no more references to the Stoa 
Basileios; instead, when a source for a law is cited, it is 
either a stele or a state archive. Hansen (1991) writes 
that “the explanation is no doubt that the new revised 
corpus of laws did not stay unchanged for many years 
after 400, and corrections became so extensive that the 
Athenians had to give up continually republishing them 
on stone. The original laws were, thereafter, written 
on papyrus and kept in the archive; some were also 
copied and published on stone, but the idea of a law- 
code stable enough to be worth engraving in marble 
was abandoned” (164). 

Although the process became more complicated, 
the ability to change law was retained. Given the ar- 
duousness of the revision process that the Athenians 
had just undertaken, one might have expected that the 





18 Harrison (1995, 30) seems to imply that this ıs yearly, but Ostwald 


(1986, 417) argues that the yearly euthyna was suspended for ana- 
graphas and were reviewed only when their job was finished. 


Athenians would have permanently entrenched their 
inscribed laws. But they did not, and the commitment 
to pragmatic innovation again provides the reason. 


FOURTH-CENTURY INSTITUTIONS 


A distinction between nomoi and psephismata shaped 
the fourth-century legislative process, called nomothe- 
sia, by which laws were both enacted and amended. An- 
docides reports a law separating nomoi, or general and 
time-unlimited laws, from psephismata, decrees that 
were specific, and/or of limited duration: “No decree 
of the Council or Assembly shall prevail over a law” 
(Antiphon and Andocides 1998, 1.87). Nomoi were 
higher norms in hierarchical relation to decrees. A 
psephisma could never contradict an existing nomos, 
and in fact a psephisma was automatically repealed if 
it conflicted with a new nomos (Hansen 1978, 324); the 
proposer of a new psephisma, if it contradicted a nomos, 
would be subject to graphe paranomon. 

The procedures for creating and changing nomoi 
were distinct from those governing psephismata. A po- 
tential psephisma was put forward by probouleuma by 
the Council (as seen in the fifth century), and the As- 
sembly had the choice of altering, rejecting, or accept- 
ing the psephisma.!® Nomoi were subject to a more 
cumbersome procedure, in which the Assembly had, 
in essence, only an originating role. Moreover, the cre- 
ation of nomoi always took the form of amending the 
existing code. The modification of a given law, then, was 
not considered a piecemeal adjustment, but a change 
to the body of legislation as a whole, with the implica- 
tion that even a small amendment altered in a way the 
nature of the corpus. 

First, a proposal was made by ho boulomenos, any- 
one who wished, to modify the code, and the Assembly 
decided whether a revision might be necessary. If so, 
five defenders of the law, who would argue that the 
law was adequate as written, would be selected. The 
institutional bias is conservative, with a strong aversion 
to novelty in legislation; this is reflected both in the 
epigraphical evidence and in the political thought of 
the time. However, the conservatism was tempered by 
the progressiveness of the mechanism; a law mandated 
that one who proposed amending an existing law was 
required to simultaneously suggest an alternative pro- 
posal (Demosthenes 1935, 24.44).” Although the pref- 
erence was against change, the mechanism encouraged 
the introduction of competing solutions. 

Once accepted by the Assembly, the proposal was 
posted before the statues of the heroes in the Agora, 
publicizing the measure and allowing ho boulomenos to 
say what he wished. The Council exercised its agenda- 
setting power to fix the program at the Assembly, in- 
cluding the submission of a probouleurma, at which a 





19 Comprehensive discussions of these ures can be found in 
Sealey 1982, passım, and m Hansen 1991, 165-77, especially Good 
pmmary sources include Demosthenes’ (1935) Aganst Tımocrates 
4.21, 24.33) and Aeschmes (2000) 3.38-3.39 
In the classical literature, this is known as the “repeal law”; see 
Hansen 1991, 166, and MacDowell 1975, 64. 
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decision to set up nomothetai (law-makers), including 
their number and salary, was made. On the day of 
the hearing, the nomothetai were selected by lot from 
among the 6,000 who had taken the oath of jurors for 
that year (the “Heliastic Oath,” as it is known by classi- 
cists, in which the jurors promised to render verdicts 
in keeping with nomoi and psephismata, or, if none 
existed, with their own sense of justice) (Hansen 1991, 
357; MacDowell 1978, 64). The number of nomothetai 
appears to have varied with the importance of the mat- 
ter: either 501, 1,001, or 1,501, at least (Hansen 1991, 
168). 

At a tnal-like meeting chaired by a nine-person 
board (proedroi and a foreman, known as epistates 
ton proedron), the law was interrogated. The pro- 
poser speaks first, followed by the five defenders, and 
the nomothetai vote on the measure. Thus, whereas 
psephismata begin with the phrase, “It was decided by 
the people” or “It was decided by the council and the 
people” (Hansen 1991, 167),”1 nomoi have instead, “It 
was decided [resolved] by the nomothetai.” 

With the new distinction between law and decree in 
the fourth century came a separation of the procedures 
for the charges of “unconstitutionality.” The graphe 
paranomon was used exclusively for proposers of de- 
crees, in the case that a decree contradicted a law or its 
enactment was procedurally invalid; the graphe nomon 
me epitedeion theinai addressed proposers of laws, in 
the case that the law violated substantive legislative 
principles or its enactment violated proper procedures. 
Both sorts of accusations could also address concerns 
about the broader democratic character of the law. The 
punishment for graphe paranomon for those convicted 
was a fine, which could lead to atimia, or loss of citizen- 
rights. The punishment for conviction of graphe nomon 
me epitedeion theinai was even more severe; in one 
case from Demosthenes (1935, 24.138), the penalty was 
death. 

The rectitude and continued validity of the laws were 
affirmed at regular intervals by two laws, known as the 
“inspection law” and the “review law.” The inspection 
law, cited by Aeschines 2000 (3.38-3.40) in Against 
Ctesiphon (Hansen 1991, 166) requires that the thes- 
mothetai, a board of six archons, review the law an- 
nually to determine whether there are inconsistencies, 
duplications, or invalid laws in force and, if so, to post 
them for the people, who may call nomothetai to rectify 
the situation. Under the review law, at the first Assem- 
bly meeting of the year, the entire law-code was put 
forward for evaluation. If the people chose to reject 
any section, any citizen could propose a change, and 
the mechanism for nomothesia, with the appointment 
of nomothetai, operated as above. The review law, in 
contrast to the institution of the defenders of the law, 
is strikingly progressive. By opening the laws to whole- 
sale revision on a yearly basis, and piecemeal revision 
as needed, the Athenians reaffirmed their commitment 
to legal flexibility. 

MacDowell (1978) notes, “Presumably it was felt that 
the old method, by which a simple majority vote at one 





21 The formulation in Harding 1985 is “Resolved by ....” 
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meeting of the Boule and one meeting of the Ekklesia 
was enough to abolish any existing law, however fun- 
damental, or to make a new one, however drastic, was 
one of the things which had made revolution easy in 
404. What was needed was a more careful check on 
legal changes before they took effect” (48). Although 
too-frequent changes were certainly destabilizing, not 
even Plato argued that legislation ought to remain un- 
changed. Given the comprehensive revision of laws 
that had just been completed, the Athenians might 
well have wished to forbid their laws to be amended. 
The process was turbulent, begun by oligarchs and in- 
terrupted via another coup, and the Athenians must 
have relished its conclusion. Nevertheless, they chose 
to preserve the ability to modify their laws, and the 
commitment to flexibility in the fourth century again 
derives from the concern that law ought to be able to 
contend with changing circumstances. 

The belief that law ought to be flexible on the grounds 
of improvability (and overgenerality) is not simply im- 
plicit institutionally but is addressed explicitly in the 
philosophical accounts of law in this period. Although 
Plato and Aristotle both reject regular modifications 
on “legitimacy” grounds, each defends the capacity 
to change law, although in general Plato is far more 
skeptical than Aristotle about the advantages of law. 
Ideally, for Plato ([1957] 1992), as argued in Statesman, 
the true statesman rules without a need for law, but 
given the difficulty of finding such a person, the second- 
best option is rule by good laws in the form of a written 
code or by “laws that are unwritten but embody ances- 
tral customs” (295a, 297e). The dissatisfaction with law 
derives from its necessarily wide scope, and its con- 
sequent inability to properly specify what is best for 
each member of the community (294a-b). Moreover, 
the unstable nature of human existence is at odds with 
permanent rules (294b). This does not mean, however, 
that the laws should keep apace with these changes, or 
that anyone who wishes ought to be able to persuade 


. the city to adopt new laws: The penalty for acts contrary 


to the laws, or for modification of these laws, ought to 
be death or other severe penalties (297e). The change 
of the laws weakens the force of the laws as a whole 
and, as such, is a step toward vice. 

The Athenian Stranger offers a scathing satire of de- 
cision making under the Athenian Assembly, in which 
a rule is made permitting anyone who wishes to advise 
the Assembly on navigation and medicine, two typical 
Platonic examples of spheres in which expertise is cru- 
cial. The decrees that the Assembly enacts with respect 
to these matters are to be inscribed, “and some of the 
Tules so resolved are ordained as unwritten ancestral 
customs” (Plato [1957] 1992, 298e), perhaps a jibe at 
the efforts by democrats and oligarchs alike to high- 
light certain preferred provisions as part of the ancient 
constitution. Further, each year magistrates are chosen 
by lot to navigate and tend to the ill and, at the end 
of the term, are subject to euthyna, or the obligation 
to render accounts (299a). Young Socrates is properly 
horrified at the picture before him, and as a result the 
Athenian Stranger suggests that instead laws against 
inquiring into the legislator’s laws and influencing 
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people against the laws ought to be established, with 
strict penalties.” 

This is not to say that changes may not be introduced. 
However, they may only be enacted by the legislator 
or by a similarly enlightened small group (as no large 
group is capable of acquiring any art) (Plato [1957] 
1992, 300d-e). The laws are improvable, as the legis- 
lator takes into account new information or “changing 
winds” (295d), but only the enlightened will be able 
to discern the correct legislative response. To rule a 
mass of people, then, the second-best course of action 
is rigid law, modifiable when necessary only by a legis- 
lator, ideally, though, flexible and specific guidance by 
a true statesman is the preferred solution. 

In Laws, the Athenian Stranger refines the argu- 
ment, reiterating that the generality of legislation will 
leave certain gaps in the law, which the administra- 
tors of the law ought to repair, with the assistance of 
the original legislator, if he is alive (Plato 1980, 772b- 
c). Experience will demonstrate where these holes ex- 
ist, and the administrators should settle these matters 
yearly, until they are satisfied. (For matters of sacrifices 
and dances, the Athenian Stranger suggests that this 
period of experimentation—the tinkering with law to 
ensure a good fit—might be 10 years [772b-c].) After 
this point, the law ought to be considered immutable, 
but the Athenian Stranger cannot entirely eliminate the 
possibility for change. If the “force of circumstances” 
is compelling enough, the administrators must con- 
sult the officials, the citizens, and the oracles of the 
gods, and if the verdict is unanimous, then amend- 
ment is permitted (772d). Thus, specification of the 
laws ought to take place and, in fact, is encouraged 
to occur by the regularity of the process, but only in 
the first few years. After that time, if the state of the 
world changes profoundly, laws may be modified, if 
by a cumbersome process, to keep up with circum- 
stances. 

In Politics, Aristotle emphasizes his concern that 
laws not be changed too regularly. Weighing the ad- 
vantages of small improvements over the risks of ha- 
bitual modification, which may lead to disobedience, 
he argues that small errors should probably be left 
alone. The strength of the law, Aristotle (1998b) argues, 
is in the habit of obedience supporting it; whereas a 
particular art, like medicine or navigation, may be im- 
proved without risks—other than a potentially harm- 
ful outcome if enacted by the unenlightened, as the 
Athenian Stranger claimed—regular innovation in law 
serves only to weaken law itself (II.8, 1269a12—a20). As 
Bernard Yack (1993) explains, “The acceptance of legal 
limitations rests more on habit than on instrumental 
rationality” (184). 

However, Aristotle (1998a) echoes the Athenian 
Stranger’s concerns about the generality of law, asin the 
famous passage on equity in The Nicomachean Ethics 


2 As noted above, in the discussion of Sparta, the Athenian Stranger 
in The Laws praised the Spartans for the laws forbidding the young 
from inquiring into the relative merits of the law (Plato [1957] 1992, 
634d-e). 


(V.10, 1137b12-23). The problem with law is that it is 
overly broad and, therefore, likely to err in its applica- 
tion to specific cases. This is not a default of law, nor 
does it imply the fallibility of the legislator, but rather a 
problem due to the disjunction between the particular- 
ity of practical matters and the universality of rules. As 
a result, where correction is needed for the sake of eq- 
uity, a modification of the law is required, by reference 
to the legislator’s intention, if he would have known 
of the particular case.” In Politics, Aristotle (1998b) 
offers a similar point, noting that it is inadvisable to 
leave written laws unchanged, because whereas law is 
concerned with generality, “actions are concerned with 
particulars” (I.8, 1269a5). 

In the fifth century, the Athenians ruled mostly by 
particular decrees, rather than general laws, which Aris- 
totle (1998b) describes as the worst kind of democracy 
(IV.4, 1292a). Moreover, these decrees were in some 
cases entrenched and then abrogated, in what might 
be viewed as Aristotle’s worst-case scenario, as in the 
case of the 1,000 talents. Yet neither Plato nor Aris- 
totle rules out the modification of laws; in Statesman, 
although there is a strong preference to leave laws by 
an enlightened legislator untouched, modifications are 
ultimately permitted. Although the legal institutions of 
the fourth century also reflect a commitment on the part 
of the Athenians to stable law, by permitting modifica- 
tions to occur, the Athenians reaffirmed their prefer- 
ence for flexibility, although in a fashion distinct from 
the simple-majority changes of the fifth century. This 
point is often neglected in favor of the “sovereignty of 
law” conception of the fourth century. Sealey (1986), 
in particular, argues that demokratia meant “rule of 
law,” rather than “popular sovereignty” (146-48). Yet 
as Ober argues, the concept of sovereignty is anachro- 
nistic in the ancient world, and in any case ought not 
to be viewed simply as the institutional locus of legal 
power. Instead, Ober (1996) argues, “The concept of 
sovereignty can usefully be applied to democracy only 
by replacing the idea of ‘sovereignty as located in insti- 
tutions’ with ‘sovereignty as the ability to change insti- 
tutions” (121). Drawing on Ober’s insight, the logic of 
pragmatic innovation is able to provide a foundation 
for democratic agency as directed toward the modifi- 
cation of laws in light of new technical knowledge or 
circumstantial change. The Greek world’s relationship 
to Athens in the fourth century, as demonstrated in the 
use of entrenchment, reflected both the institutional 
changes in Athens and its enduring commitment to 
mutable law. 


B This nod toward intentionalism 1s echoed in Rhetoric at 137410. 
The distinction between changes made by judges and legislative 
amendment 1s certainly contested. Judicial lawmaking 1s in this con- 
text read as amendment. 

24 Josiah Ober (1989) pomts out this “constitutional law trap” into 
which those seeking an extenor rule of law in Athens tend to fall (22) 
The argument here 1s not designed to capture the understanding of 
law on the part of the Athenians in the fourth century, but only to 
point out that the nsttutions of the Athenians in the fourth century, 
and the criticisms offered of legal institutions by elites, may have 
permitted more flexibility than 1s ordinarily believed. 
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ENTRENCHMENT IN THE FOURTH 
CENTURY 


As in the fifth century, the epigraphic record is some- 
what spotty, and it is somewhat difficult to make general 
claims as a result: Whereas there are 488 extant psephis- 
mata —although many are so fragmented as to make in- 
terpretation difficult—there are only eight nomoi. This 
is likely due in part to the infrequent publication of 
laws on stone, but Hansen (1991) argues that the cre- 
ation of nomoi was substantially less common (176). 
This does not make analysis impossible, but it does sug- 
gest that great care is needed in drawing conclusions. 
This is not solely a problem of nonspecialists; classicists 
tend to assume that fourth-century provisions included 
entrenchment clauses while citing very few cases in 
which they actually appear (Hansen 1991, 165; Lewis 
1974, 88) and identify only one entrenched nomos at a 
maximum. 

Alliances continued to be entrenched, in general, 
until the middle of the fourth century, and the cases 
in which they do not appear to be are decrees that are 
in generally poor condition (Harding 1985, 43, p. 60) 
and, thus, inconclusive. Again, this is a somewhat 
weaker form of entrenchment, a “for all time” clause, 
analogous to the “forever” provision seen in the fifth 
century. But classicists tend to take seriously both “for- 
ever” requirements and specifications that the alliance 
should endure for a particular period; given this, the 
inclusion of decrees specifying that alliances ought to 
endure “for all time” should count as entrenched. The 
fact that alliances continue to be entrenched is likely 
attributable to the perception that because alliances 
were still made by the Assembly and Council, the abil- 
ity for them to commit themselves continued to be 
rather shaky. The most famous case of entrenchment in 
a fourth-century alliance is certainly that of the Second 
Athenian Confederacy (Harding 1985, 35, pp. 48-52.), 
which enabled Athens to become leader of the Greeks. 
The decree provides a penalty for one who proposes 
a contrary decree in the form of atimia (loss of civil 
rights), loss of property, death, and burial neither in 
Athens nor in allied lands. The remarkably severe form 
of entrenchment provided in this decree may empha- 
size the extent to which Athens was perceived as being 
likely to deviate. 

As in the fifth century, honorary decrees remain un- 
entrenched, but the fourth century does offer a non- 
Athenian case, from Iasos, of an entrenched honor 
(Harding 1985, 114, p. 142). Gorgos and Minnion, sons 
of Theodotus, apparently did good deeds for the city, 
not the least of which was the recovery of an inland lake, 
and were rewarded with an exemption from taxation 
and the front seat at public festivals for all time. The 
presence of such an entrenched honor suggests that the 
decision to leave honors without entrenchment clauses 
in Athens was intentional; by leaving the honor flexi- 
ble, it recognizes that the recipient of the honors may 





Z An eighth law has been published recently in Stroud (1998). The 
bulk of the evidence provided in Harding (1985) is overwhelmingly 
1n the form of decrees, and the general claims derived from decrees 
are more reliable. 


later injure the city, and the honor should be a special 
recognition at a given time (often in the form of a 
golden crown). 

Laws, in general, do not appear to have been en- 
trenched. A particular good example of this phe- 
nomenon is a law specifying the procedures for silver 
coinage (Harding 1985, 45, pp. 61-64.), roughly analo- 
gous to the coinage decree in the fifth century. Unlike 
the fifth-century coinage decree, which is entrenched, 
the coinage law is flexible. Note also that it has been 
elevated to the status of a nomos and is, therefore, not 
subject to the potential abuses of the Assembly. Ad- 
ditionally, the law provides. as in accordance with the 
notion of nomoi as hierarchically superior to psephis- 
mata, that “if there is any decree that has been inscribed 
anywhere on a stele (that is) contrary to this law, let it 
be destroyed by the secretary of the Boule” (Harding 
1985, 45, p. 63). Even Eukrates’ law against tyranny of 
337/6, which held that revolutionaries against the peo- 
ple could be murdered with impunity and prescribed 
disenfranchisement (atimia) for magistrates who de- 
liberated during a tyranny—perhaps thereby lending 
legitimacy to the tyrants—refrains from entrenching 
the democracy. 

It may be argued that Demosthenes, in Against Aris- 
tocrates, offers evidence of an entrenched law, because 
it prescribes atimia for modification of an aforemen- 
tioned homicide statute (Hansen 1991, 165). However, 
Lewis (1974, 88) notes that this is actually Drakon’s 
homicide law (cited differently, and incompletely, in 
Fornara 1983) and, as such, perhaps ought not properly 
to be considered within the scope of fourth-century leg- 
islation per se. Moreover, Demosthenes is extremely 
supportive of unamendable law: Note, for example, 
Demosthenes’ (1935) praise for the Locrians, who 
heard arguments for changes of law with a noose 
around the proposer’s neck (which was tightened if the 
proposal was defeated), and who thus changed only one 
law in 200 years, is another indicator of his tendency to 
reject modification (24.139-143).6 

Despite the sparseness of the evidence, what can we 
make of the possibility that laws, in the fourth century, 
did not include entrenchment clauses, and, moreover, 
that only a single category of decrees, alliances, tended 
to use them? The comparatively deliberate procedure 
of the fourth century might well lend itself to a dimin- 
ished usage of these provisions, at least for domestic 
matters. After all, the procedure for enactment of legis- 
lation, and for its revision, is lengthy and cumbersome, 
as shown above. Moreover, the enactment of nomoi 
was a relatively infrequent act. As such, if entrench- 
ment clauses once served to flag provisions that were 
especially crucial, in the context of a single, simple- 
majority procedure covering all legislation, the need 
for this sort of identification could be replaced by the 
use of nomoi. The need for a cooling-off process, simi- 
larly, was diminished when a slower lawmaking process 





26 For example, as seen above, Hansen (1991) argues that Demos- 
thenes wrongly characterizes the propensity of the Athenians to en- 
act nomoi and marvels at scholars’ willingness to “put uncritical trust” 
in Demosthenes (176). 
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was developed, as was any possible desire for precom- 
mitment. Externally, though, the Athenians were still 
perceived as untrustworthy democrats, and as such the 
entrenchment clause continued to serve as a means by 
which they could signal their commitment to potential 
allies. 

Although the legislative procedure became more 
complicated in the fourth century, it is essential to keep 
in mind the enduring commitment to flexibility, which 
could easily have been jettisoned after the disasters 
of the late fifth century and the subsequent revision 
of the laws. The reason for why they did not may be 
found in the enduring Athenian commitment to prag- 
matic innovation. In the cases in which entrenchment 
appears, it appears to derive from the concern that the 
Athenian tendency to adjust its institutions in light of 
new information may leave both Athens and its allies 
vulnerable. 


DEMOCRACY AND LEGAL CHANGE 


The Greeks, we have seen; both envied and feared the 
Athenian ability to innovate. Although, as Thucydides 
suggests, this quality enabled Athens to succeed in 
the military arena, it nevertheless undermined Athens’ 
ability to create alliances with other city-states and en- 
sure that its choices could be sustained in even the short 
term. Yet by the time of the revision at Colonus, the 
Athenians were well aware of the pernicious potential 
of flexible law and, nevertheless, decided to reaffirm 
the use of mutable law, if with some procedural checks. 
Here, I have offered an account of Athens as motivated 
by the ideology of pragmatic innovation, which initially 
appeared to contradict my findings from an analysis of 
entrenched decrees in fifth- and fourth-century Athens. 
The effort at entrenchment should not be viewed gener- 
ally as an attempt at precommitment, I have suggested, 
but as a check on a body with the potential to exploit 
informational asymmetries, the fifth-century dema- 
gogues, and as a sign of seriousness to prospective allies 
who had little confidence that Athens would be faithful. 
To all this, however, some political scientists may ask, 
Why should the details of Athenian institutions inter- 
est us, other than mere antiquarianism? One reason 
is that Athens has occupied a distinctive place in po- 
litical science as the exemplar of classical and “direct” 
democracy. The authors of the Federalist Papers repeat- 
edly reflected on the “turbulent democracies of ancient 
Greece” (Hamilton, Madison, and Jay [1788] 1961, 14, 
101); Madison famously wrote, “Had every Athenian 
citizen been a Socrates, every Athenian assembly would 
still have been a mob” (55, 342). Today, we still tend to 
project both our aspirations and our misgivings about 
democracy on Athens, while simultaneously relying on 
our conceptions of Athens to provide us with source 
material for normative accounts of democratic decision 
making. It therefore remains important to try to get 
Athens right, to accurately portray its institutions and 
its ideologies, lest it distort our models of democracy. 
A second reason, closely related to the first, is the 
“critical: distance” that reflection on ancient cultures 


affords us (Yack 1993, 18). The potential risks of le- 
gal flexibility are obviously much graver when people’s 
rights and lives are at stake. Yet this is no reason to 
dismiss theoretical inquiry grounded in historical re- 
search as dangerously indulgent. Instead, Athens pro- 
vides us with an opportunity to investigate a link be- 
tween democratic innovation and legal flexibility with 
some insulation from the language and preconceptions 
of contemporary debates over constitutionalism. Al- 
though we cannot be insulated from the distortions of 
our contemporary perceptions, we can at least attempt 
to unpack the institutions and ideas enveloping Athens 
as carefully as possible and, having done so, reevaluate 
our assumptions in light of our findings. 

Two key insights about democracy result from this 
analytical process. The first is the fact that although 
at least some institutional commitments are certainly 
necessary for democratic agency, the ability to modify 
any institutions has long been viewed as the preroga- 
tive and as even a defining characteristic of democracy. 
Since this capacity leaves prospective allies nervous, 
however, democracies may occasionally have recourse 
to hyperconstraining devices, such as entrenchment, to 
signal that they at least intend to take seriously a par- 
ticular commitment. Yet the faith that allies ought to 
have in a democracy that views change as fundamen- 
tal is, of course, limited, and as a result, a prospective 
ally’s decision to embark on a particular alliance may 
be made on grounds other than trustworthiness, such 
as the relative power of the partner. 

The concern that Athens would amend ostensibly 
entrenched clauses was not restricted to prospective 
allies: The Athenians understood that the law would be 
unable to restrain a determined demos. The Athenians’ 
prescription of death, in certain cases, for proposing an 
entrenched law’s amendment suggests that they knew 
well that an effort to change the law might succeed, 
so that an additional, individualized barrier to modi- 
fication was necessary. Yet even this constraint could 
be overcome, as the discussion of the thousand talents 
suggests, in light of changed circumstances. As a conse- 
quence, Athens could not have reasonably anticipated 
that entrenchment would prove an infallible barrier to 
efforts at change; in fact, they could not have believed 
that any such barrier could possibly exist. Entrench- 
ment constituted a concerted effort on the part of the 
Athenians to signal both to themselves and to others 
the importance of a particular norm or treaty but could 
not credibly have been regarded as an effective means 
of self-binding. 

The second implication of the discussion of Athens 
is that the use of entrenchment in the contemporary 
world may be both more strategic and less effective 
than constitutional designers and theorists have tended 
to believe. The most famous use of entrenchment is 
the protection of the human dignity clause in the post- 
World War II German Basic Law. However, we should 
not believe that entrenchment can only be used to pro- 
tect individual rights or fundamental institutional ar- 
rangements. Rather than shielding these types of laws 
from modification, entrenchment may serve a narrower 
instrumental purpose, securing instead the outcomes 
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of controversial bargains or even illiberal provisions. 
Bracketing the question of the genuine immutability 
of the laws, we might note that the two laws explicitly 
restricted from amendment in the American constitu- 
tion in Article V are those regulating the slave trade 
(through 1808) and the equal suffrage of states in the 
Senate, respectively. These two matters were the most 
contentious of the convention, comprising the most del- 
icate bargains and, in the case of the former, resulting 
in a morally heinous outcome. Although we can still be- 
lieve that entrenchment has value as a “gag rule,” taking 
divisive issues off the table in order to enable ordinary 
politics to unfold (Holmes 1995, 202-35), it is difficult to 
argue that entrenchment exists purely as an additional 
protection for virtuous constitutional norms. An Athe- 
nian would view the faith contemporary constitution- 
makers and theorists display in entrenchment’s abil- 
ity to protect key norms, such as individual rights, as 
poignantly naive. 

This is not to suggest that democracy rests in op- 
position to the rule of law, either empirically or theo- 
retically. As Jean Hampton and many others have ar- 
gued, rule of law provides a stable set of procedures 
through which democracies may operate and offers a 
predictable and egalitarian means for the resolution of 
disputes. Yet Athens reminds us that as crucial as these 
institutions may be for the functioning of the democ- 
racy, they are nevertheless the product of collective de- 
cision making by the citizens and subject to reconsider- 
ation at any time. The Athenians drew on the ideology 
of pragmatic innovation each time they enacted legis- 
lation, creating law without appeal to constitutionalism 
or revelation, and their prior decisions had little nor- 
mative weight over their current judgments. Improv- 
ability and novelty, especially with respect to institu- 
tions, exemplified the democratic character of Athens, 
especially in contrast to the fabled conservatism of the 
Spartans. The inventiveness of the Athenians, indeed, is 
their greatest legacy: Even today, attention to the hoary 
example of Athens yields remarkably fresh answers to 
enduring questions. As the Athenians subjected their 
democratic institutions to ongoing scrutiny, we are re- 
warded with insights into democracy through perpetual 
reflection on Athens. 
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Political Science and Political Understanding: Isaiah Berlin on the 


Nature of Political Inquiry 
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saiah Berlin is remembered for his positive/negative liberty distinction and his value pluralism, but he 

was also an active participant in the debate over the nature of political inquiry. This essay argues that 

his neglected contribution to this debate is central to his thought and a valuable resource in today’s 
debate over political science’s methods and ends. I first show how Berlin understood the relationship 
of empirical science to humanistic study. I then demonstrate that his conceptions of political judgment 
and the “sense of reality” were intended as alternatives to the scientific pursuit of political knowledge. 
Finally, I argue that his Churchill and Weizmann essays present exemplars of the moral excellence 
Berlin considered necessary to ennoble liberal society and the political understanding indispensable to 
comprehensive political inquiry. I conclude by noting how Berlin’s critique of scientific political inquiry 
informs his liberalism and his own methods of political inquiry. 


gence of an old debate. The question of whether 

the study of politics is better regarded as an art or 
a science is again on the minds of political inquirers. In 
part this new debate but continues a familiar one over 
the place of political theory within political science.! 
Yet the fundamental question at issue today—namely, 
“Can we know what is worth knowing about politics 
through scientific research methods alone?”—is hardly 
limited to theorists (Grant 2002, 578). Recently po- 
litical scientists in other subfields have also called at- 
tention to the unique nature of their inquiry; witness 
Rogers Smith (2002), who has observed that political 
science is distinct from the other sciences insofar as its 
subject is human beings and its conclusions affect its 
subject in ways those of other sciences cannot. Put dif- 
ferently, if political science aspires to afford its subjects 
a better understanding of themselves as well as improve 
their lives, it is humanistic in its ends even if scientific 
in its means, and it has even been said that a “student 
of society, for the very sake of being a scientist, must 
first and above all be a humanist” (Cook 1955, 272). 
But what does this mean? Precisely what—if any—sort 
of “humanism” should science in general and political 
science in particular embrace? 

Among the last century’s most careful students of 
this question was Isaiah Berlin. Today Berlin is re- 
membered above all for his delineation of two types 
of liberty and his value pluralism (e.g., Galston 1999, 
Gray 1996, and Riley 2001). Recent work has also done 
much to recover his significance as a historian of ideas 
(e.g., Cracraft 2002 and Wokler 2003). Yet his legacy 
deserves further reconsideration. In the 1950s, the same 
decade that saw the publication of his most famous 


P olitical scientists are currently witnessing a resur- 
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1 For the current debate over the place of political theory in political 
science, see, ¢ g~, Shapiro 2002; Skanner et al. 2002, 14-15; and Smith 
2002 For the antecedents of this debate in the 1950s, see, e.g., Easton 
1951, Eckstein 1956, and Friedrich 1958 and, in Britain, Greaves 1960. 


essays on liberty, his was also a principal voice in the 
quarrel between the sciences and the humanities? In 
part his contribution to this debate took the form of an 
attempt to define the proper place of political theory 
in political science (Berlin [1962] 1999; cf. White 2002). 
Yet on the whole Berlin was less interested in the prac- 
tice of professional political theorists than in political 
practice itself, and two of his most urgent concerns were 
to illustrate the dangers posed by applying the meth- 
ods of natural science to political practice and political 
inquiry and to promote the recovery of an alternative 
and nonscientific approach to political understanding. 

Below I focus on Berlin’s largely neglected work 
on political judgment and the methods of the so- 
cial sciences to present his conception of the promise 
and the limits of scientific political inquiry.” These 
essays—particularly those on the political judgment of 
Winston Churchill and Chaim Weizmann—are central 
to Berlin’s political thought, I argue, as they provide the 
natural culmination to an inquiry that began with his 
study of Marx’s and Tolstoy’s philosophies of history. 
The paper’s next section examines Berlin’s essays on 
scientific history, focusing on their discussion of the 
place of empirical science in humanistic study. I then 
turn to Berlin’s contemporaneous essays on judgment 
and the “sense of reality” to show how his study of 
scientific history’s methods informed his evaluation of 
the methods of political inquiry. Here I argue that his 
claim against scientific history—that misapplying the 
methods of natural science in historical research pre- 
cludes moral judgment under the guise of objectivity 
and neutrality—was the basis of his claim that scientific 
political inquiry encourages a positivism that denies in- 
dividual human agency and, by extension, individual 


2 This quarrel reached its peak ın Britain in the 1950s with Leavis 
1962 and Snow [1959] 1998. On the manrfestations of this quarrel! 
in contemporary political science owing to the challenges posed by 
behaviorahsm, see, e.g, Behnegar 2003, 9-27, and Gunnell 1986, 10- 
42 

3 The present essay thus does not aspire to intervene in the debate 
over the compatibility of value pluralism and liberalism but to il- 
luminate Berim’s critique of rationalism and to respond to the call 
for further study of his understanding of statesmanshrp and political 
judgment (see Lilla, Dworkm, and Silvers 2001, xi) 
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excellence. The paper then turns to the essays on 
Churchill and Weizmann to argue that Berlin intended 
these to provide models both of the sort of moral ex- 
cellence that he believed to be threatened by scientific 
determinism yet indispensable to modern politics and 
of a particular approach to political understanding that 
he considered indispensable to any comprehensive po- 
litical inquiry. By so doing J aim to shed new light on this 
forgotten side of Berlin’s project as well as point to its 
implications for the current debate over the scientific 
study of politics. 


SCIENTIFIC DETERMINISM AND THE 
HUMAN SCIENCES 


Berlin’s inquiry into the effects of the incursions of 
natural science methodology on political and historical 
study began in his Karl Marx (KM; 1939),4 was further 
developed in “The Hedgehog and the Fox” ([1953a] 
1994) and “Historical Inevitability” ([1954a] 2002), and 
received final, conclusive statements in “The Concept 
of Scientific History” ([1960a] 1999) and his essay on 
Vico ([1960b] 2000). Taken together, these essays de- 
fine and critically assess an emerging development in 
social science and suggest an alternative approach to 
the study of history. 

The target of Berlin’s criticism in these essays is, 
broadly, scientific determinism. Scientific determinism, 
on his definition, is comprised of two elements: first, 
“monism”—the belief that there exist single, discover- 
able, final solutions to all conceptual problems (polit- 
ical problems included)—and, second, a belief in the 
existence of inexorable forces to which all human af- 
fairs, historical and political, are subject. “Historical 
Inevitability” ([1954a] 2002) trains its sights on both 
phenomena. It particularly aims to expose the dangers 
of a social science dedicated to the systematic pursuit 
of “one complete and all-embracing pyramid of scien- 
tific knowledge; one method; one truth; one scale of 
rational, ‘scientific’ values” —a pursuit Berlin does not 
hesitate to castigate as a “naive craving for unity and 
symmetry at the expense of experience” (L, 96; cf. KM, 
30). Students of his political theory will recognize here 
the foundations of his value pluralism. Yet it is crucial 
to an understanding of his intentions to see that his ob- 
jection to monism was not simply to its idea of a single 
truth, but to its notion of a single truth scientifically 
discovered (see also Hausheer 1983, 51-57). 

As both a historian of ideas and a political theo- 
rist Berlin indeed was fascinated and worried by those 
“hypnotised by the magnificent progress of the natural 
sciences of their day” (CC, 106-7). He knew that the 


4 In referrmg to Berlin's works I use the following abbreviations: 
AC—Against the Current, CC—Concepts and Categories, CTH—The 
Crooked Timber of Humanity, FB—Freedom and Its Betrayal, KM— 
Karl Marx, L— Liberty, PI—Personal Impressions, POI—Power of 
Ideas, RT—Russian Thinkers, SR—The Sense of Realty, TCE— 
Three Critics of the Enlightenment, WE—“Weizmann as Exilarch”; 
and ZPWW— Zionist Politics in Wartime Washington. These abbre- 
viations are given m brackets after the appropriate references in the 
References 
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extraordinary successes of scientific methods in certain 
fields of inquiry were likely to seduce those in other 
fields seeking comparable successes: hence the quar- 
rel of the natural sciences and humanities launched 
by Descartes and continued by those who applied the 
methods of the former to the subjects of the latter 
(TCE, 9, 28-29, 37-40, 112). Berlin also foresaw the 
consequences of pursuing historical and political in- 
quiry in accord with the Cartesian preference for the 
“principles and rules which alone guarantee scientific 
validity” (CC, 103). His own empiricism led him to be 
particularly wary of the distorted perspective that nec- 
essarily follows when such methods are used to inter- 
pret historical data. Marx and Tolstoy revealed this to 
him most clearly. Both thinkers, he claims, “saw clearly 
that if history was a science, it must be possible to 
discover and formulate a set of true laws of history 
which, in conjunction with the data of empirical ob- 
servation, would make prediction of the future... as 
feasible as it had become, say, in geology or astronomy” 
(RT, 32). The target of Berlin’s attack is precisely this 
misguided faith in history’s discoverable laws—“the be- 
lief that everything is caused to occur as it does by the 
machinery of history itself’ or by “impersonal forces” 
(L, 103). Thus he insists that the quest to discover “in- 
exorable, all-pervasive historical laws” alone capable 
of revealing a “unitary pattern” blinds the scientific 
historian to a better way of seeing historical connec- 
tions (L, 55, 155; cf CC, 104-7; KM, 115). Berlin in- 
deed considers anathema that conception of inquiry in 
which understanding is reduced to the ability to see 
patterns, and wisdom reduced to a knack for seeing in 
which direction the world is inexorably moving (L, 104, 
113). 

Berlin then presents two objections to the scien- 
tific approach to history: one on the grounds of its 
impracticality and another on the grounds of its con- 
sequences. His first objection rests on his insistence 
that it is fundamentally misguided to claim that “all 
that exists is necessarily an object in material nature, 
and therefore susceptible to explanation by scientific 
laws” (L, 108). Berlin hardly denies science its suc- 
cesses and readily grants its utility when limited to its 
proper objects. His fear is rather that science is too 
often led in its enthusiasm to overreach. The histori- 
cal record, he insists, is too complex to be adequately 
treated by its methods. This too is noted by Tolstoy. 
The second appendix to War and Peace reveals the 
folly of attempting to subject the infinite complexity 
of the “uninspectable” elements of the moral and po- 
litical and spiritual worlds to fixed rules (RT, 73; cf. 
SR, 33). Berlin’s agreement with Tolstoy on this point 
is evident in his own claim that no single human be- 
ing can grasp the unfathomable array of factual com- 
plexity necessary for a truly comprehensive and “scien- 
tific” understanding of that “larger scheme of things” 
of which we are a part (RT, 74; SR, 15, 34n). Berlin of 
course was no reactionary, and he was quick to place 
the label of “an absurd nostalgic delusion” on the be- 
lief that natural science and technology “prevents us 
from direct contact with reality—‘being’—which pre- 
Socratic Greeks or medieval Europeans saw face to 
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face” His claim is more modest: “My argument is 
only that not everything, in practice, can be—indeed 
that a great deal cannot be—grasped by the sciences” 
(SR, 48). i 

As the sciences cannot grasp everything, Berlin con- 
cludes that there must exist a sphere in which “hu- 
man categories” rather than scientific categories ap- 
ply: “namely the world of human beings” (L, 19; cf. 
TCE, 43). History—what Aristotle describes as the 
record of what human beings have done and thought 
and suffered—is the preeminent account of this sphere 
(AG, 95; PI, 10; CC, 103; Cracraft 2002, 296-97). Such 
a view of history contains within itself a critique of 
scientific history. Historical study is distinguished by 
the unique nature of the inquirer’s relationship to his 
subject matter. As history and politics are human activ- 
ities and reflections of human beings themselves, their 
study is necessarily self-referential and, unlike natural 
science, demands a certain intimacy between inquirer 
and subject: “In history we are the actors; in the natural 
sciences mere spectators” (TCE, 88; cf. 161). For this 
reason, students of “human affairs” must neither begin 
with nor aspire to an objective or neutral analysis of 
data: “Here J am not primarily an external observer, but 
myself an actor; I understand other human beings, and 
what it is to have motives, feelings, or follow rules, be- 
cause I am human myself” (CC, 129). Thus “wholly de- 
personalised history” is “a figment of abstract theory,” 
as historians will find it impossible “to suppress even 
that minimal degree of moral or psychological insight 
and evaluation which is necessarily involved in viewing 
human beings as creatures with purposes and motives” 
(L, 140-41). On these grounds Berlin calls historians to 
abandon all aspirations to the perspective of detached 
spectators and calls them, instead, to strive to under- 
stand their subjects “from the inside” (AC, 95). Berlin 
took this lesson in part from his study of Vico, who 
impressed on him the necessity of “entering into” the 
past and understanding men and their actions internally 
(TCE, 18-19, 31, 47-50, 52, 65, 129; CC, 137). The lesson 
was reinforced by his study of Herder’s conception of 
Einfithlen, that natural and irrepressible sympathy that 
binds inquirer to subject and enables each to recognize 
the uniquely human status of the other (TCE, 14, 211- 
12, 236, 318). To privilege “some infallible scientific 
key” over Vico and Herder’s imaginative sympathetic 
method Berlin considered “one of the most grotesque 
claims ever made by human beings” (SR, 21; cf. Cracraft 
2002, 292). 

To understand why Berlin uses the language of the 
grotesque to describe what seems to be a mere method- 
ological error requires examining his thoughts on the 
political consequences of scientific determinism. In one 
sense, these consequences are well known to students 
of his political theory. The slippery slope from monism 
to totalitarianism is a prominent feature of his most 
familiar work: that faith in the “inexorable” must lead 
to “demands for human sacrifice” (L, 106n; cf. 212). Yet 
Berlin rejects determinism not only on the grounds of 
its political consequences, but also for its moral conse- 
quences, Thus his more subtle charge is that determin- 
ism claims that “individual responsibility is, ‘in the end, 


an illusion,” as “the individual’s freedom of choice (at 
any rate here, below) is ultimately an illusion” (L, 107, 
110). Determinism aspires to bring a “happy release 
from responsibility” and to eradicate all standards for 
moral judgment, and it is this claim that Berlin rejects 
and counters by reestablishing the reality and necessity 
of free choice to a fully human life (L, 160; cf. FB, 32). 
Indeed, Berlin’s famous defense of free choice via neg- 
ative liberty is itself an element of this larger project to 
reestablish an appreciation of the moral responsibility 
and agency of individual political actors (L, 27, 115, 
131). 

The crux of Berlin’s argument against determinism 
is then that it inhibits the natural judgments of the 
inquirer—the very judgments encouraged by the sym- 
pathy that binds inquirer to subject. The detached in- 
quirer rejects these; to suppose nothing, to propose 
nothing, to impose nothing, and only to explain—such 
is his ostensible goal (L, 125, 127). But even if such 
a degree of detachment could be achieved, would it 
be desirable? Berlin thinks that it would permit us 
to say nothing more than that “Alexander, Caesar, 
Atilla, Muhammad, Cromwell, Hitler are like floods 
and earthquakes, sunsets, oceans, mountains; we may 
admire or fear them, welcome or curse them, but to 
denounce or extol their acts is (ultimately) as sensible 
as addressing sermons to a tree” (L, 115-16). So con- 
ceived, determinism is not merely an assault on agency, 
but a concerted attempt “to overthrow some of the 
most deeply rooted moral and intellectual habits of hu- 
man beings” (L, 163). In particular, it assaults decent 
common opinions about what is “morally praisewor- 
thy or blameworthy” and what constitutes “worth and 
desert” (L, 116n). Thus Berlin’s most forceful charge 
against scientific determinism is that its surface ob- 
jectivity conceals a decided prejudice against the re- 
ceived common morality it seeks to displace, as its goal 
is to demonstrate that “moral responsibility is a pre- 
scientific fiction” or “delusion” and that “praise and 
blame are subjective attitudes put to flight by the ad- 
vance of knowledge” (L, 138, 154; cf. 6, 21, 103, 118, 
122, 268-69; RT, 69). It is precisely this subjectivism that 
he challenges in reestablishing against determinism the 
possibility that men in fact aspire to “deserve (and 
not merely elicit or respond to) praise and blame, ap- 
proval or condemnation” (L, 110). Hence his repeated 
insistence on “the assumption of the reality of human 
choices” and “the sense of mission and dedication, the 
voice of duty” that he thought the twentieth century 
must recapture if it was to survive itself (L, 121, 127). 
Put differently, Berlin calls for a recovery of a sense of 
honor derived from acting honorably and an appreci- 
ation of the difference between what is praiseworthy 
and what is praised. 

The positivist assault on moral judgment, Berlin con- 
cludes, is nothing less than an assault on the “central 
values... common to human beings as such, that is, 
for practical purposes, to the great majority of men in 
most places and times” (L, 25). Indeed far from reject- 
ing these, Berlin, like Hume, calls political inquirers 
to respect “the normal thoughts of ordinary men” and 
“the language of common sense” (L, 138, 142, POI, 
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20-21).* Berlin in fact was deeply sympathetic to 
Tolstoy’s claim that “only by patient empirical observa- 
tion could any knowledge be obtained,” but also to his 
claim that “simple people often know the truth better 
than learned men, because their observation of men 
and nature is less clouded by empty theories” (RT, 55). 
Better then that historians join to their empirical labors 
this prereflective form of judgment than an enthusiasm 
for methods or laws. For historians, being themselves 
men, can never expect to “escape from having to adopt 
some position about what matters and how much”—an 
inescapability that, he insists, “alone is enough to ren- 
der the notion of a ‘value-free’ history, of the historian 
as a transcriber ipsis rebus dictantibus, an illusion” (L, 
23). To have values is to be human, and for this rea- 
son Berlin could never subscribe to the “morally ‘neu- 
tral’ attitude of natural scientists” (L, 141). To accept 
their “imaginary scientific canon which distinguishes 
between facts and values very sharply” would be to 
“do violence to the basic notions of our morality” and, 
thereby, fail “to face the fact of human responsibil- 
ity.” Thus Berlin’s final word on the scientific histori- 
ans: “They throw dust in their own eyes as well as in 
ours, obstruct our vision of the real world, and further 
confuse an already sufficiently bewildered public about 
the relations of morality to politics, and about the na- 
ture and methods of the natural sciences and historical 
studies alike” (L, 163-65). 


POLITICAL JUDGMENT AND THE SENSE 
OF REALITY 


Berlin’s critique of scientific history rests on the claim 
that its idealism glosses over the complexity of histor- 
ical and political reality. In this respect his critique of 
idealism in political inquiry mirrors his contemporane- 
ous critique of idealism in political theory and polit- 
ical practice. Berlin of course presented his response 
to political idealism in his lecture on two concepts of 
liberty (Skinner 2002b, 239-43). But what solution does 
he offer to the former problem, that of idealism in in- 
quiry? His answer is to be found in his exposition of the 
“sense of reality,” the idea at the heart of his thoughts 
on historical and political judgment and understanding. 
Berlin develops his thoughts on the sense of reality in 
another set of essays of the 1950s, concurrent with and 
intimately tied to his critique of scientific determinism. 
Taken together, these essays—“Realism in History” 
([1953b] 1996), “Realism in Politics” ([1954b] 2002), 
and “Political Judgement” ([1957] 1966)—further de- 
velop and push toward completion the critique of sci- 
entific determinism examined above.‘ 





5 For Berlin’s debt to Hume, see, eg, Gray 1996, 9 (cf. 81-82), 
Hampshire 1991, 129; and Wollheim 1991, 78. Also, Berlin’s clam 
here and elsewhere that human affairs depend upon “the moral 
categones and concepts which normal language incorporates and 
expresses” (L, 163) suggests the residual influence of the Oxford 
philosophy m which he was tramed, and particularly the understand- 
ing of “ordinary language” advanced by J L Austin among others 
sce PI, 139-42), 

The essay reprinted as “The Sense of Reality” in the book of the 
same title was originally delivered as a lecture at Smith College in 
1953 under the title “Realism in History” 
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What then is the “sense of reality,” and what is its 
relationship to Berlin’s critique of determinism? We 
have already seen him allege that determinism assaults 
the decent moral sentiments and judgments of common 
life. But precisely that which science rejects he reclaims 
for the sense of reality. “What in ordinary life we call 
adequate explanations often rest not on specific pieces 
of scientific reasoning, but on our experience in gen- 
eral, on our capacity for understanding the habits of 
thought and action that are embodied in human atti- 
tudes and behaviour, on what is called knowledge of 
life, sense of reality” (CC, 128). The proper antidote 
to the myopic monistic understanding of human be- 
havior is thus not philosophic wisdom but wisdom of a 
different sort. The “knowledge of life” that is synony- 
mous with the sense of reality is discovered not through 
solitary contemplation—to which Berlin is consistently 
hostile, though often more content to caricature than 
repudiate—but instead “springs from interaction with 
others and with the surrounding environment” (CC, 
134). To depart from scientific methods is thus not to 
abandon science for philosophy, or to embrace that 
which “in some way ‘transcends’ or is ‘beyond’ nor- 
mal experience”—“some special act of magical divina- 
tion not describable in the language of ordinary experi- 
ence.” Berlin instead recommends a process that enters 
“intimately into our most normal experience, and is a 
kind of automatic integration of a very large number 
of data too fugitive and various to be mounted on the 
pin of some scientific process” (SR, 24). The sense of 
reality is perhaps best understood as a refined common 
sense, a skeptical alternative to the claims of both social 
science and philosophy. 

Neither scientific nor philosophical, the sense of re- 
ality is essentially practical and political, and Berlin 
presents it in language reminiscent of Aristotle’s ac- 
count of practical wisdom. Thus he explains that the 
sense of reality lies neither in “some transcendent sense 
of the inexpressible oneness of life to which poets, mys- 
tics and metaphysicians have in all ages testified” nor in 
a deeper awareness of “the ‘iron laws’ of the sciences.” 
The sort of understanding he admires is instead that 
of “the permanent relationships of things, and the uni- 
versal texture of human life, wherein alone truth and 
justice are to be found by a kind of ‘natural’—somewhat 
Aristotelian—knowledge.” And like that of Aristotle’s 
phronimos, such knowledge rests on “an awareness, not 
necessarily explicit or conscious,” or even one capable 
of being articulated by its owner (RT, 69-71). But where 
are such wise men to be found, if not among scientists 
or philosophers? At times Berlin points to Tolstoy (RT, 
50); at other times, to his characters (RT, 74). But to see 
it at its peak, he turns to two other groups: “historians 
and novelists and dramatists and ordinary persons en- 
dowed with understanding of life,” and statesmen (SR, 
25, 47; POI, 138, 188; cf. Cracraft 2002, 287). 

Berlin’s account of the historian’s sense of reality 
in “Realism in History” begins with the now familiar 
critiques of inexorability, the search for patterns, and 
the reverence for the “machinery of determinism” (SR, 
1-2). Yet for all this he never calls the historian to aban- 
don the epic theorist’s attempt to understand human 
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experience as a whole; the “Wirkungszusammenhang, 
the general structure or pattern of experience,” is in 
fact “absolutely indispensable to the historian” (CC, 
139). His point is rather that a means of pursuing com- 
prehensive understanding other than those offered by 
scientific methods must be discovered. The precision 
afforded by such methods, he insists, is achieved at the 
cost of a more comprehensive understanding of the en- 
tirety of experience. Again like Aristotle, Berlin insists 
that inquirers should strive only for that level of preci- 
sion that is appropriate to the nature of their subjects: 
“to preach mechanical precision, even in principle, in 
a field incapable of it is to be blind and to mislead oth- 
ers” (SR, 53). In historical inquiry the reduction of the 
complexity of experience to precise patterns of elegant 
simplicity misses the “total texture” of human history, 
compounded “of literally countless strands” (CC, 114). 
What is required is rather a way of seeing and under- 
standing that, in charting patterns, captures rather than 
denies that “‘thick’ texture” of conflicting ideas, beliefs 
and events of which the historical record is comprised 
(CC, 139; SR, 30-31). 

In rejecting determinism Berlin thus calls for a re- 
covery of judgment. In response to detractors skeptical 
of the “mysterious capacity” of “the so-called faculty 
of judgment” (CC, 116), he explains that historians 
require 


a capacity for integration, for perceiving qualitative sim- 
jlarities and differences, a sense of the unique fashion in 
which various factors combine in the particular concrete 
situation... The capacities needed are rather those of as- 
sociation than of dissociation, of perceiving the relation of 
parts to wholes, of particular sounds or colours to the many 
possible tunes or pictures into which they might enter, of 
the links that connect individuals viewed and savoured as 
individuals, and not primarily as instances of types or laws. 
(CC, 140) 


The “gifts that historians need” thus differ markedly 
from the gifts required for progress in the natural sci- 
ences: not a mere facility with “idealised models,” but 
“something at the opposite end of the scale, namely 
an eye for what is unique and unrepeated”—an eye 
for what renders an idea, individual, or culture distinct 
as opposed to an eye for how it might be assimilated 
into an ideal framework (SR, 22). Rather than force 
the discrete elements of experience to fit a model, the 
best historians create syntheses of meaning by allow- 
ing unique and idiosyncratic facts to speak for them- 
selves. Meinecke is a favorite model of this art; it was 
his achievement “to avoid constricting and distorting 
formulae, fanatical faith in laws that social change must 
obey, into which all the facts must be compressed.” His 
“unbroken sense of reality” consisted in a sensitivity to 
the individual and particular. 

Berlin’s admiration of Meinecke is profound. Indeed 
he might well have been describing himself when he 
describes Meinecke’s central questions as “the relation 
of values (both of historians and of men in general) to 
objectively established facts and to the conclusions of 
the natural sciences” and “the apparent incompatibility 
between the methods of the natural sciences and those 


of humane studies, and the implications of this for po- 
litical and individual morality.” But most telling is his 
account of the source of Meinecke’s concern with such 
problems: “These problems arose for him not merely 
as a historian or as a student of historical method, but 
as a German and a human being” (POI, 209-10). At the 
heart of both Berlin’s and Meinecke’s struggles against 
determinism is the conviction that what it means to be 
human is revealed only in being and acting with other 
human beings. Understanding thus begins with sym- 
pathy and the moral appreciation of others it affords, 
and it is for this reason that Berlin numbers among 
the talents “indispensable to historians, but not (or not 
to such a degree) to natural scientists,” a “capacity for 
understanding people’s characters, knowledge of ways 
in which they are likely to react to one another, abil- 
ity to ‘enter into’ their motives, their principles, the 
movement of their thoughts and feelings” (CC, 133). 
Meinecke thus reinforced Vico in reminding students 
of history and politics of the necessity of “a capacity 
for sympathy and imagination beyond any required by 
a physicist” (CC, 136). 

Such, in brief, is Berlin’s critique of and remedy for 
historical inquiry. But what might such an understand- 
ing of history imply for a student of politics? Quite a bit, 
Berlin thinks: “History is the account of the relations 
of humans to each other and to their environment; con- 
sequently what is true of history is likely to be true of 
political thought and action as well” (SR, 28). Berlin’s 
own approach to political inquiry was decisively shaped 
by his approach to history. Just as he began his study of 
historical inquiry with a question concerning the utility 
of scientific history, his study of political inquiry be- 
gins with a question concerning the nature of political 
knowledge: 


What is this knowledge? Is it knowledge of a science? Are 
there really laws to be discovered, rules to be learnt? Can 
statesmen be taught something called political science— 
the science of the relationships of human beings to each 
other and to their environment—which consists, like other 
sciences, of systems of verified hypotheses, organised under 
laws, that enable one, by the use of further experiment 
and observation, to discover other facts, and to verify new 
hypotheses? (SR, 40)’ 


Berlin clearly answers in the negative: “There is no nat- 
ural science of politics any more than a natural science 
of ethics” (SR, 49). But what route did he take to this 
conclusion, and what alternative does he provide? 
Berlin’s engagement with the French Enlightenment 
laid the foundation for his own “suspicion of intellectu- 
als in politics” (SR, 52). His antipathy to the disastrous 
political consequences of philosophical rationalism is 
obvious in the BBC Radio talks reprinted under the 
unsubtle subtitle Six Enemies of Human Liberty (FB). 





7 Berlin’s cntique of the language of verification here and elsewhere 
in his political writings recalls his 1939 essay on the verification princi- 
ple (CC, 12-31) and again points to hus training m philosophy. Certain 
elements of his political thought also anticipate other philosophi- 
cal debates; compare the treatment of determinism and agency in 
“Histoncal Inevitability” to, e.g , the treatment of determinism and 
responsibility offered in Strawson 1962 
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Here he developed his view of the rationalistic excesses 
of Holbach, Helvétius, and La Mettrie (among others), 
the fruit of their insistence on the calculability of human 
behavior and their belief that politics is in fact capable 
of being reduced to a science (SR, 41; CC, 153; AC, J; 
L, 108-9; TCE, 276-78; FB, 5-10; KM, 31-34, 66). This 
project came of age with Plekhanov, he further insists, 
whose careful study of the French materialists inspired 
the program that earned him the title “the father of 
Russian Marxism” (Berlin [1956] 2002, 129-30). Yet 
Berlin’s criticism goes deeper than the familiar notion 
that the scientific enthusiasm of the eighteenth-century 
philosophers laid a foundation for modern totalitari- 
anism (POI, 141-42; L, 278). For not only does Berlin 
fear the consequences of the positive program of the 
Philosophes, but also he fears the loss of that which 
they sought to replace. The philosophes taught that 
“political judgement need never again be a matter of 
instinct and flair and sudden illuminations and strokes 
of unanalysable genius; rather it should henceforth be 
built upon the foundations of indubitable knowledge” 
that emerge from the “rational study of human nature.” 
The “mysterious art of government was to be myste- 
rious no longer,” having been reduced to “a matter 
of professional competence and specialisation.” Thus 
their hope: that “social engineers” or a “despotism of 
an élite of scientists” would replace statesmen (SR, 43; 
FB, 24; c£ POI, 135; SR, 28; L, 85-88). 

In his own writings Berlin sought to rescue the pos- 
sibility of “political genius” from the philosophes and 
their modern utilitarian disciples who “substitute coun- 
terfeit science for individual judgement” (SR, 40, 52). 
To do so he first examines the nature of this genius. Just 
as he earlier distinguished the historian’s “gift” from 
the scientist’s, so again does he distinguish the states- 
man’s “gift” as “wholly incompatible with faith in the 
supremacy of some idealised model” (POI, 140). His 
gift too cannot be taught: It “cannot be wholly learnt 
from books or professors” but, rather. “requires consid- 
erable personal experience and natural aptitude” (SR, 
41). Like the phronimos who knows how to act but 
cannot articulate the principles by which he does, these 
“statesmen of genius” have “understanding rather than 
knowledge—some kind of acquaintance with relevant 
facts of such a kind that it enables those who have it 
to tell what fits with what: what can be done in given 
circumstances and what cannot, what means will work 
in what situations and how far, without necessarily be- 
ing able to explain how they know this or even what 
they know” (SR, 32). Like artists who know their me- 
dia, political geniuses chart courses of action “which 
they find it difficult if not impossible to explain in clear 
theoretical terms” (POI, 139). 

In describing the statesman’s gift, Berlin also appro- 
priates the metaphor of sight that characterizes the 
historian’s gift. Thus in rejecting the notion that the 
ever-changing data of historical and political reality can 
“be caught and pinned down and labelled like so many 
individual butterflies,” Berlin insists that in order to 
construct patterns true to life, “one needs to see”— 
one must have a “direct, almost sensuous contact with 
the relevant data.” It will not be enough merely “to 
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recognise their general characteristics, to classify them 
or reason about them, or analyse them, or reach con- 
clusions and formulate theories about them” (SR, 46). 
Thus like the historian, the statesman must abandon 
theory and instead return to a more common way of 
seeing. Berlin has no patience for obscurantism and re- 
jects the notion “that there is some virtue in darkness 
as such, that the most important things are too deep 
for words”—there is no defense of any sort of “magic 
eye” in Berlin’s work (SR, 48, 46). Instead he calls for 
a return to “empiricism,” though not the empiricism 
of the scientists, to be sure. Rather than begin with a 
model, proper empiricism begins by surveying the data 
and then constructs a view of the whole from a synthesis 
of the unique and discrete, preferring the atypical to 
the typical (SR, 45). No mysterious power, this art is 
merely common sense and common apprehension re- 
fined to extraordinary levels, for the “power of integrat- 
ing or synthesising the fleeting, broken, infinitely vari- 
ous wisps and fragments that make up life at any level” 
is the common property of “every human being” who 
must do so “if he is to survive at all” (SR, 47). The states- 
man’s vision is best understood as a profoundly more 
acute version of the apprehension or common sense 
that enables ordinary people to navigate ordinary life, 
the fullest and most refined type of “that ‘sense of real- 
ity’ which largely consists in semi-conscious integration 
of a large number of apparently trivial or unnoticeable 
elements in the situation that between them form some 
kind of pattern which of itself ‘suggests’—‘invites’—the 
appropriate action” (POI, 139; cf. SR, 51). 

At its best the sense of reality is then a sense “of what 
fits with what, of what cannot exist with what” (RT, 
69). In politics, as in the study of history, this manifests 
itself as an appreciation of the natural limits of human 
beings. Statesmen, no less than historians, require an 
intimate and sympathetic understanding of the unique 
claims of their particular human subjects. In his essay 
on Einstein and Israel, Berlin insists that “those who 
deal with human beings and their affairs need some 
awareness of the essential nature of all human expe- 
rience and activity, a sense of the limits of what it is 
possible for men and women to be or to do” (PI, 75). 
This grasp of human limits ultimately distinguishes the 
“non-scientific, non-generalising” political understand- 
ing of man from the scientific understanding, as it is the 
duty of “historians or men of action” to paint “a picture 
of men as free, sometimes strong, and largely ignorant 
that is the precise contrary of the scientific view of them 
as weak, determined and potentially omniscient” (SR, 
38-39; cf. POI, 141). 

Better and safer then to be a skeptic than a determin- 
ist. To take the latter route—to believe that all human 
activity can be graphed or modeled—can lead, at best, 
only to a dehumanized utilitarianism or, more likely, 
to the belief that “Lenin, Hitler and Stalin and their 
minor followers elsewhere” sought to put into practice: 
“that human beings are a good deal more plastic than 
was hitherto thought” and that “almost anything, at 
any rate far more than was hitherto thought possible, 
can be altered” (SR, 11; cf. POI, 204). Merely to cast 
off the shackles of inexorability is thus not enough. To 
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avoid a reign of destruction even more devastating than 
positivism, the individual who can “cause vast changes 
to occur” and put into motion plans of “vast extent,” 
capable of affecting “the fortunes of mankind to a rad- 
ical degree,” must be tethered by a recognition of what 
human beings can and cannot be lest he become a tyrant 
himself (POI, 137-38). True political greatness is then 
not only a greatness that liberates rather than enslaves, 
but one founded on an appreciation of both the promise 
and the limits of human nature that a sense of reality 
affords. Put differently, political greatness properly un- 
derstood depends upon a particular approach to po- 
litical understanding, one that prioritizes the sense of 
reality’s comprehensive insights over the specific 
knowledge attained through specialized inquiry. Berlin 
paints this greatness in his essays on Churchill and 
Weizmann. In these portraits he illustrates not only 
the indispensable role of political greatness in political 
life, but also the indispensable role of a humanistic or 
nonscientific approach to political understanding. 


POLITICAL ACTION AND POLITICAL 
INQUIRY 


Berlin most clearly defines this greatness in three 
essays: “Winston Churchill in 1940” ([1949] 2001), 
“Chaim Weizmann’s Leadership” ({1954c] 2002), and 
“Chaim Weizmann” ([1958] 2001). Far from mere ha- 
giographic éloges, these essays bring to completion 
Berlin’s critique of the scientific study of politics. The 
conception of political greatness they offer represents 
their author’s most concentrated and sustained attempt 
to recover praise and blame, honor and responsibility, 
human agency and choice, from determinism’s assaults. 
In so doing, the essays present a challenge to a certain 
received understanding of his project. Berlin’s thoughts 
on value pluralism in liberal society have received care- 
ful attention, but lost has been his conviction that the 
continued endurance of liberal values (including plu- 
ralism) requires individuals who possess virtues that 
exceed those values traditionally promoted by liberal 
society. Berlin’s first claim in these essays is then that 
the cultivation of a type of nobility to which positivism 
is hostile and that pluralism alone can neither create 
nor sustain is indispensable to democracy’s preserva- 
tion. But Berlin’s thoughts on greatness are important 
not merely for what they reveal of his understanding 
of political action, but also for what they reveal of his 
understanding of political inquiry. His essays on polit- 
ical genius or greatness also aspire to reform and to 
rehabilitate political inquiry itself. 

Berlin’s unfashionable and unabashed admiration 
for greatness is clearly meant in the first instance as a 


8 For Berln’s most vivid description of the darker side of this 
sort of individual greatness, see Berlm 1952 (published under the 
pseudonym “O. Uts”). The article’s principal claim is that without 
Stalin the Soviet regime must collapse, but in establishing this darm 
Berlin paints Stalin as an example of the continued persistence and 
necessity of individual leadership even in a regime and “an age when 
the social sclences claim to be able to predict more and more ac- 
curately the behavior of groups and individuals, rulers and ruled” 
(197). 


challenge to his colleagues and contemporaries. “Social 
theorists of various schools,” he explains, “sometimes 
try to convince us that the concept of greatness is a 
romantic illusion—a vulgar notion exploited by politi- 
cians or propagandists, and one which a deeper study 
of the facts will always dispel.” This view he calls a 
“deflationary theory” and claims that it can only be 
defeated by “coming face to face with an authentic in- 
stance of greatness and its works” (PI, 34). Now Berlin 
was no naïve partisan of great-man theories of history 
or politics, to be sure, and he is clearly sympathetic 
to Tolstoy’s assault on the folly of simple souls who 
think that in Napoleon alone lies the explanation of 
his world. Yet for all this he never denies—and rather 
insists upon—the efficacy and necessity of individual 
political actors. In his own writings Churchill and Weiz- 
mann play especially conspicuous epic roles, and his 
portraits of them are suffused with the spirit of John 
Stuart Mill’s warning (which he quotes directly) that 
the “individually small” men of today would do well 
to remember that “it was men of another stamp than 
this that made England what it has been; and men of 
another stamp will be needed to prevent its decline” 
(L, 239). Reading them we cannot miss the delight 
Berlin took in celebrating the magnanimity of his sub- 
jects and challenging the scientific predilection toward 
the classificatory and microscopic. Thus his Churchill: 
“a man larger than life,” a “gigantic historical figure 
during his own lifetime,” the “largest human being of 
our time” (PI, 23). So too his Weizmann: a man “of 
vast historical magnitude,” a giant figure “of more than 
human size, an immortal hero” who “overshadowed 
his contemporaries”—indeed “not an inhabitant of the 
twentieth century” (POI, 194; PI, 61). Together they 
represent to Berlin the possibility that the war may not 
have extinguished a certain sort of human excellence. 
The bond that unites Churchill to Weizmann runs 
deep.? Yet Berlin would also have us attend to a crucial 
distinction between them and the two types of great- 
ness and approaches to political understanding that 
they represent. Berlin’s renowned division of hedge- 
hogs from foxes has been put to so much use and abuse 
alike that one hesitates to use it as an explanatory 
device, yet the construct appropriately illuminates his 
careful distinction between his two model statesmen. 
The categories originally referred to two incommen- 
surate approaches to the comprehensive understand- 
ing of reality: hedgehogs, distinguished by a monistic 
understanding defined by a single discoverable princi- 
ple; and foxes, who deal in multiplicities of data, gifted 
alike at the study of the discrete and at the science that 
unites them (RT, 22). But only a year after the initial 
publication of “The Hedgehog and the Fox,” precisely 
the same dichotomy reappears in Berlin’s delineation 


9 The romantic love of human potential and self-realization not only 
drew nm to his two great-souled men, but also drew them to each 
other, Berhn explains. Their “natural bond” was due “to the immense 
love of life ın both.. they were both on the side of whatever ex- 
panded, grew, was likely to stand up, to animate and to quicken vital 
processes, and against all that tended towards contraction, stillness, 
everything which sprang from caution, hugged the shore, wished to 
conserve and not transform” (POI, 192). 
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of “two types of political greatness, incompatible with, 
and indeed sometimes opposed to, each other.” On 
the one side stand those dedicated to “the simplicity 
and nobility of the central principle to which they ded- 
icate all that they have,” who “impose some pattern 
so clear, so uncomplicated, upon the manifold diver- 
sity of life” and devote themselves to the realization 
of one “unadorned central doctrine.” Such we might 
call political hedgehogs. But another type of greatness 
also exists, that of political foxes. These too are distin- 
guished by their approach to political understanding: 
“So far from ignoring the infinite complexity of the 
life which surrounds them, they have an unanalysable 
capacity for integrating the tiny fragments of which it 
is composed into some coherent, intelligible pattern.” 
They have “antennae” that are “extremely sensitive 
and record half-consciously a vast variety of experi- 
ence; but instead of being overwhelmed by so much, 
their genius consists precisely in the fact that they are 
able to integrate it—not by any conscious process, butin 
some semi-instinctive fashion—into a single coherent 
picture” (POI, 186-88; cf. PI, 28-29). 

Berlin greatly admires the political hedgehog, yet his 
admiration hardly prevents him from actively doubting 
whether his is in fact the sort of greatness that ought to 
be encouraged in a world still recovering from two great 
wars. Thus he repeatedly recalls us to a troubling side of 
the political hedgehog: He is “sometimes fanatical” and 
“somewhat inhuman” and even sometimes exhibits a 
“somewhat inhuman fanaticism” (POI, 186-88; cf. RT, 
22). Elsewhere Berlin speaks of his “single principle 
and fanatical vision” and his “blindness and stubborn 
self-absorption” (PI, 28). But Berlin most pointedly 
criticizes this sort of political greatness for its incom- 
patibility with the values and ideals of the age in which 
he himself lives. This greatness, he insists, belongs to 
another world; at its best it rises to “the noble grandeur 
of the great and simple heroes of classical antiquity.” 
How different, and how much more appropriate to the 
age of democratic egalitarianism, is the greatness of the 
political fox, he suggests—the greatness of those “who 
possess the gifts of ordinary men, but these in an almost 
supernatural degree,” who “do not merely stand for but 
understand the aspirations of many humble persons,” 
and who elicit from others “a delight in their acces- 
sibility, their democratic quality, their human failings” 
(POI, 186-88). Berlin’s twofold division of excellence is 
then at once a referendum on the sort of political great- 
ness appropriate for democratic as opposed to aristo- 
cratic ages as well as a referendum on the merits of two 
different approaches to political understanding. Insofar 
as his portraits of Churchill and Weizmann represent 
these two types of greatness, we find in them both his 
understanding of modern magnanimity and his defense 
of the approach to political Knowledge likeliest to lead 
to genuine political understanding. 


10 Roosevelt ıs another worthy candidate for the ttle of Berlin’s 
quintessential political fox Certainly his accounts of Roosevelt’s 
political genius bear important resemblances to the qualities he em- 
phasizes in his studies of Weizmann (see especially PI, 11-12, 21, 
28-29, and also the companson of Roosevelt and John F Kennedy 
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Churchill, Berlin’s quintessential political hedgehog, 
represents both a final incarnation of classical political 
virtue and a potentially dangerous approach to polit- 
ical wisdom. More than once Berlin calls attention to 
Churchill’s “Periclean reign,” “his glorious Periclean 
tule,” and his role as “a mythical hero who belongs to 
legend as much as to reality” (PI, 16; ZPWW, 53; PI, 23). 
But what principally distinguished Churchill was his ab- 
solute commitment to that same all-encompassing and 
singular vision that distinguishes the political hedge- 
hog, his “stubborn faith and a single-minded, unchang- 
ing view of the public and private good” (PI, 7). Able 
to conceptualize and articulate a vision far nobler than 
that of any other man of his age, Churchill painted 
that vision “in primary colours, with no half-tones”; 
the monist’s love of elegant simplicity animates his 
“heroic, highly coloured, sometimes over-simple and 
even naive, but always genuine, vision of life” (PI, 3, 
5). But Berlin finds this vision particularly fascinating 
because it was alive, active, and creative rather than an- 
tiquarian and reconstructive. Prescriptive rather than 
descriptive, this sense of historical reality transformed 
political reality, and this “magical power to transform” 
to which our attention is repeatedly called is meant to 
appear at once enrapturing and terrifying (PI, 9, 14, 
16). He profoundly admires Churchill’s capacity to en- 
noble Britons through words that “idealised them with 
such intensity that in the end they approached his ideal 
and began to see themselves as he saw them” (PI, 14). 
He is awed by Churchill’s ability to lift Britons “to an 
abnormal height in a moment of crisis,” the product 
of an imagination that in envisioning what men might 
become refused to be limited by what they are (PI, 
16). But such disregard of what men can and cannot 
be Berlin finds both admirable and unsettling. Thus 
“Churchill’s strength” is precisely “what is most fright- 
ening in him”: that he was utterly devoid of “number- 
less sensitive antennae” that bring home the oscilla- 
tions of the external world. Churchill “does not reflect 
a contemporary social or moral world”; instead “he cre- 
ates one with such power and coherence that it becomes 
a reality and alters the external world by being imposed 
upon it with irresistible force” (PI, 12-13; cf. L, 196-97). 
In other words: The political hedgehog lacks the virtue 
of the political fox. 

“Churchill is preoccupied by his own vivid world, 
and it is doubtful how far he has ever been aware of 
what actually goes on in the heads and hearts of others” 
(PI, 13). On this point Berlin distinguishes Weizmann’s 
magnanimity and his approach to political understand- 
ing from Churchill’s. Churchill succeeded precisely 
because he refused to be persuaded by—indeed he 
refused even to attend to—human weakness or lim- 
itations. In contrast, Weizmann’s sympathy with the 
suffering, his keen attentiveness to the human elements 


that Berim offered in his 1965 interview with Arthur Schlesigner Jr. 
as published in the New York Review of Books, 22 October 1998). 
Yet the Weizmann essays are infused with an emotional investment 
in their subject wanting in Berlin's treatment of Roosevelt, perhaps 
because the two (on Berlin’s own admission) never met (PI, 25). In 
the same vein, Berlin’s Herz] sometimes to rival Churchill 
as the consummate political hedgehog (see POI, 189) 
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of his political world, was his preeminent feature. His 
sense of reality too was shaped by his appreciation of 
history; just as Churchill’s image of the ancient con- 
stitution was the principal source of his vision of what 
Britons might become, Weizmann’s grasp of the his- 
torical experience of Jewish exile was the source of his 
vision of what Israel might become (PI, 43, 45; WE, 
17). Yet Churchill’s historical sense was born of an 
affair with its heroes and not its downtrodden—with 
Marlborough, not the masses. But it was this that distin- 
guished Weizmann, the one Jews recognized as “flesh 
of their flesh, a man of the people” (PI, 47; POL, 190; 
Cocks 1999). Weizmann, we learn, 


stood near the centre of the consciousness of his people, 
and not on its periphery; his ideas and his feelings were, as 
it were, naturally attuned to the often unspoken, but always 
central, hopes, fears, modes of feeling of the vast majority 
of the Jewish masses, with which he felt himself, all his 
life, in deep and complete natural sympathy. His genius 
largely consisted in making articulate, and finding avenues 
for the realisation of, these aspirations and longings; and 
he did this without exaggerating them in any direction, or 
forcing them into a preconceived social or political scheme, 
or driving them towards some privately conceived goal of 
his own, but always along the grain. (PI, 46; c£ PI, 29; POI, 
188) 


It was Churchill’s genius to deny the limits imposed on 
man by both nature and history, but it was Weizmann’s 
to place these at the center of his project. Weizmann’s 
understanding of history is thus not merely a rejec- 
tion of the determinists’ fanatical faith in “historical 
‘laws’ or formulae” in favor of a “sense of the cor- 
relation of real historical forces,” but also counters 
monists like Procrustes who would impose fixed pat- 
terns onto society (POI, 192; L, 216). His sense of re- 
ality afforded him precisely what Churchill’s approach 
to political understanding did not, namely, awareness 
of “the strengths and weaknesses, the purposes and 
characters of the human beings and institutions with 
which he was concerned” (POI, 192). Both intellectual 
and political hedgehogs are susceptible to fanaticism, 
but Weizmann, “above all things an empiricist,” pos- 
sessed a historical imagination that led him to deny 
utopia and, instead, afforded him an “almost infallible 
sense of what cannot be true, of what cannot be done” 
(PI, 45; cf. 57). “On good terms with reality,” he was 
free “from fanaticism and intolerant, utopian idealism,” 
even amidst “intense vision and singleness of purpose” 
(WE, 18-20). Churchill was animated by an imagina- 
tion that favored “ideal models in terms of which the 
facts are ordered in the mind” and imposed this model 
onto reality so as to render its multiplicities more uni- 
form. But Weizmann’s sense of reality and antennae 
constructed a vision of reality more true to life from 
a comprehensive synthesis of its diverse elements and 
suggested political arrangements “that embody decent 
respect for the opinions of mankind” (PI, 9; WE, 21). 
Berlin’s distinction of Churchill from Weizmann, the 
political hedgehog from the political fox, returns us 
to our original question regarding the nature of po- 
litical inquiry. That modern political stability depends 
on the cultivation of a particular type of democratic 


political leader is only the first of the lessons that he 
means to teach in his essays on statesmanship. Among 
his other aims is that of restoring to political inquiry 
an appreciation of the ways in which individual actors 
shape political life. For not only does Berlin argue that 
great political actors are indispensable to politics, but 
also he insists that an appreciation for “the efficacy of 
individual initiative” is indispensable to any political 
science that aspires to a comprehensive understanding 
of political life (SR, 12). Just as no inquiry in the natural 
sciences would be considered complete if its methods 
could not account for the most powerful forces that 
shape its subject, so does Berlin suggest that no polit- 
ical inquiry is complete if it fails to account for “the 
influence on public events of individual character and 
individual behaviour” (ZPWW, 8). Greatness itself he 
measures as a capacity to shape political life to an ex- 
treme degree: “To call someone a great man is to claim 
that he has intentionally taken (or perhaps could have 
taken) a large step, one far beyond the normal capaci- 
ties of men, in satisfying, or materially affecting, central 
human interests” (PI, 34; L, 97n1). His own motive for 
writing on Churchill is to remind us of this greatness: “I 
thought, and still think, that his part in 1940 in saving 
England (and, indeed, the vast majority of mankind) 
from Hitler had been insufficiently remembered and 
that the balance needed to be restored” (PI, ix; CC, 
138). So too he reminds us that “[w]ithout Weizmann’s 
policies during the war the State of Israel could scarcely 
have come into existence” (ZPWW, 57). In ascribing 
the rescue of Britain and the birth of Israel to the efforts 
of individual actors, Berlin suggests that study of them 
is required of any inquirer who aspires to understand 
the whole of the events they shaped. Berlin’s claim is 
thus that it is no less essential for political inquirers than 
for political actors to recover a sense of what individual 
human beings can and cannot do. His own attempt to 
restore to political inquiry an appreciation for human 
agency represents one way he points to the place of 
“humanism” in a comprehensive political science, 

But Berlin would not have political science merely 
recover an appreciation for the actions of Weizmann 
or Churchill. In describing their approaches to political 
knowledge he also calls political inquirers to recover a 
certain side of their approach. His sharpest critique of 
the scientific approach to political inquiry concerns its 
specialization. Specialists seeking knowledge of their 
world are inevitably limited by both their interests and 
their methods to acquiring precise knowledge of spe- 
cific elements, Berlin thinks. Moreover, these elements 
present themselves as objects of study not because we 
are impressed by their political import, but because of 
a “specific interest” we have in these areas owing to 
“our practical needs or theoretical interests” (SR, 14). 
Again, Berlin grants that such focus can bring spectac- 
ular results in particular fields of inquiry: “There are 
vast regions of reality which only scientific methods” 
illuminate, and there “what science can achieve must be 
welcomed” (SR, 48). Yet he doubts that politics should 
be counted among these fields, as the methods of the 
specialist he judges to be incompatible with the peculiar 
character of the subjects of political inquiry. Even if 
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the individual could be sufficiently isolated so as to be 
treated in the manner that specialization requires, he 
claims that as soon as we see “how many notions, eth- 
ical, political, social, personal, go to the making of the 
outlook of a single person,” we “begin to realise how 
very small a part of the total our sciences”—and the 
humane sciences included—“are able to take in” (SR, 
15). Similarly, just as the complexity of the individual 
renders it impossible to isolate the elements of which 
he is comprised, so too does Berlin claim that the very 
complexity and nature of political life itself renders its 
component phenomena similarly resistant to isolation. 
To claim otherwise, he insists, would be to do violence to 
the distinguishing characteristic of political life, namely, 
the “complicated network of relationships involving 
every form of human intercourse” that grows “more 
and more insusceptible to tidy classification, more and 
more opaque to the theorist’s vision as he attempts to 
unravel their texture” (SR, 30). Herein lies the danger 
of those who “take their knowledge of a small portion 
of the scene to cover the entire scene,” believing that 
“methods or models which work well in particular fields 
will apply to the entire sphere of human action” (SR, 
37, 49). In political inquiry, as in political action, an 
appreciation of reality’s complex multiplicity affords a 
more accurate and more comprehensive grasp of the 
whole of political life than either a specialized inquiry 
that focuses only on its specific parts or a systematic 
inquiry that, in aspiring to uncover “patterns,” over- 
looks the distinct elements that make a political world 
unique. This is the core of Berlin’s critique of all reduc- 
tive approaches to political understanding, whether of 
monists or determinists or hedgehogs or eighteenth- 
century rationalists: “The truth they ignored was the 
existence of too great a gap between the generalisation 
and the concrete situation—the simplicity of the for- 
mer, the excessive complexity of the latter” (SR, 34). 
Berlin’s ultimate claim then is that the “excessive 
complexity” of political reality cannot be penetrated 
by a political inquiry itself excessively complex in its 
methods and language. His rhetoric begins to outrun 
him when he insists that a statesman “faced with an ag- 
onising choice of possible courses of action” is unlikely 
to “employ a team of specialists in political science”; 
such rhetoric obscures a more subtle point. Specialists 
are gifted at generalizing and comparing, but in politics 
“what matters is to understand a particular situation in 
its full uniqueness” (SR, 44-45). Full appreciation of 
this uniqueness or distinctness in turn cannot be gar- 
nered through fact-gathering, but only by “a form of 
understanding and not of knowledge of facts in the or- 
dinary sense” (SR, 23; c£ CC, 125). Ultimately Berlin 
thinks that this “form of understanding” is missed by 
those “objective” inquirers who “regard facts and only 
facts as interesting and, worse still, all facts as equally in- 
teresting” (PI, 17). In contrast to their approach, Berlin 
seeks to recover “those ways of assessing and analysing 
facts which are intrinsic to our normal daily experience 
as human beings in relation to each other,” by aspiring 
to grasp “the whole intellectual, imaginative, moral, 
aesthetic, religious life of men” in ways that admittedly 
“may not pass the scrutiny of a purely fact-establishing 
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inquiry” (SR, 27). Understanding politics requires at- 
tending not merely to concrete facts or to hidden pat- 
terns, he insists, but to seemingly more prosaic phe- 
nomena: “the half noticed, half inferred, half gazed-at, 
half unconsciously absorbed minutiae of behaviour and 
thought and feeling which are at once too numerous, 
too complex, too fine and too indiscriminable from each 
other to be identified, named, ordered, recorded, set 
forth in neutral scientific language” (SR, 23; cf. 15). 

Berlin’s aim in making such a claim is to encourage 
political inquirers to recover precisely that apprehen- 
sion of reality’s “thick texture” that distinguished the 
historical inquiry of Vico and Meinecke and the po- 
litical judgment of Weizmann. Their ambition to un- 
derstand society from within led not to relativism but 
to an attempt to grasp human reality as a whole, and 
their genius lay in their “capacity for taking in the to- 
tal pattern of a human situation, of the way in which 
things hang together” (SR, 50). This attention to the 
“total pattern of a human situation” points to the most 
important sort of “humanism” that Berlin means to 
restore to political inquiry. As we saw above, he insists 
that the sense of reality that makes historical and polit- 
ical understanding possible is founded on an apprecia- 
tion of the concepts and categories of “commonsense 
knowledge” (CC, 115). Yet scientific inquiry, insofar as 
it is “free from control by common sense,” is unable 
to afford such an appreciation (CC, 141). What is re- 
quired of political inquiry is a return to the claims of 
ordinary life in order to guarantee that such inquiry 
will remain animated by a focus on the most important 
elements of human life. Political understanding, Berlin 
claims, requires not simply sophistication in methods 
of revealing either facts or patterns, but appreciation 
of the “most pervasive, least observed categories, those 
which lie closest to us and which for that very reason 
escape description” (SR, 16). 


CONCLUSION 


Half a century ago Berlin declared that the “quarrel 
between these rival types of knowledge—that which 
results from methodical inquiry, and the more impal- 
pable kind that consists in the ‘sense of reality, in 
‘wisdom’—is very old” (RT, 78; c£. AC, 80; POI, 220; 
L, 141; CTH, 1-2). Here I have sought to demon- 
strate that Berlin himself was an active participant in 
a particular instantiation of this quarrel that erupted 
among political scientists in the 1950s. His contribu- 
tion to this quarrel, moreover, intersects with aspects 
of other contemporary positions. Leo Strauss (1961) 
criticized Berlin for his relativism, yet from the pre- 
sentation above it would seem that Berlin anticipates 
Strauss’s own critique of the reduction of politics to 
“social engineering,” his warning against attempting to 
understand politics “from without, in the perspective 
of the neutral observer,” his claim that political under- 
standing begins with “prescientific awareness,” his cau- 
tion against organizing political inquiry around a search 
for “patterns,” and his critique of scientific political in- 
quiry for its “break with common sense” (Strauss [1962] 
1995, 206, 211, 212, 219; cf. Behnegar 2003, 141-206, 
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and Pippin 2003). Even stronger is Berlin’s connection 
to Michael Oakeshott, with whom he also shares a cri- 
tique of the “assimilation of politics to engineering,” 
a distinction of “technical knowledge” from “practical 
knowledge,” a dismissal of the philosophe to whom “all 
knowledge appears equally significant,” a rejection of 
scientific history and its quest for “general laws,” a dis- 
taste for the modern intellectual world’s “addiction to 
‘practice’”, an appreciation of the danger of academic 
“specialization” for political inquiry, an insistence that 
the language of politics is “the language of every-day, 
practical life” including talk of “praise and blame,” and 
a belief that what “the statesman requires” are “the 
ordinary ‘faculties’ and ordinary knowledge that ev- 
eryone (even the convinced rationalist) uses every day 
in the conduct of his life” (Oakeshott [1962] 1991, 9, 
12, 138, 153, 181, 190, 206; 1993, 107). Paul Franco 
has recently noted certain of these similarities in the 
course of comparing Oakeshott’s critique of rational- 
ism to Berlin’s. Franco’s comparison also prompts him 
to wonder whether Berlin’s critique “remains bound 
up with the ideological wars of the fifties, attacking an 
enemy that is now no longer recognizable.” Thus he 
asks a crucial question: Is Berlin’s target in fact “some- 
thing of a straw man at the beginning of the twenty-first 
century?” (Franco 2003, 490) This question deserves an 
answer. What relevance indeed does Berlin’s critique 
of rationalism have for political inquiry today? 
Berlin’s critique of scientific political inquiry, I have 
sought to show, focuses on two goals: first, recovering 
within political inquiry an appreciation for the great- 
ness or nobility of individual political actors and, sec- 
ond, restoring to political inquiry an appreciation for 
aspects of political life better apprehended by the sense 
of reality than by scientific methods. That the first goal 
originated as a response to the political conditions of 
the 1950s is beyond doubt. Berlin’s essays on Churchill 
and Weizmann clearly seek to resuscitate an appreci- 
ation for excellence in the face of the profound pes- 
simism of the postwar age, which, it was then sug- 
gested, was “deeper than it has been perhaps since St. 
Augustine wrote De Civitate Dei” (Cobban 1953, 328). 
In the face of this pessimism, Berlin’s essays called at- 
tention to political agency and to the endurance of the 
possibility of genuine human excellence. In so doing 
they aim to restore an appreciation for nobility in a 
world characterized by despondency and obsessed with 
utility (L, 81). The year that witnessed his essay on 
Churchill Berlin (1950a) himself described as indiffer- 
ent to the fact that “large beings once walked the earth” 
(xxiii). His own writings on greatness sought to counter 
this indifference and the associated danger “that the 
commonplace or the counterfeit may be over-praised 
by those who, in their terror of missing a masterpiece 
for lack of sensibility or perception see a swan in every 
goose” (Berlin 1951a, xxix). To train the eyes of politi- 
cal inquirers to distinguish excellence from mediocrity 
is the aim of the Weizmann essay in particular. Berlin 
regarded Israel’s founding as “a living witness to the 
triumph of human idealism and will-power over the al- 
legedly inexorable laws of historical evolution.” So too 
did he regard its first president as a model of the “aes- 
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thetic splendor” that, in defying “the forces of nature 
and history,” is “what ultimately refutes utilitarianism” 
(POI, 161; L, 338; ch ZPWW, 67). 

Yet the question remains: Is this project still relevant 
today? Those yet concerned to ennoble the character 
of modern liberalism might reply that Berlin offers 
an alternative to contemporary liberalism’s dominant 
emphasis on institutions and procedures. Berlin’s lib- 
eral synthesis accommodates both negative liberty and 
moral nobility and suggests that to admire men who 
“‘did not belong to their century” is hardly “antilib- 
eral” (Arendt [1968] 1995, 101). Rather than call his 
liberal credentials into question, Berlin’s attachment 
to nobility instead suggests an alternative provenance 
for his liberalism—and herein lies its interest today. 
Berlin’s critique of liberalism’s Enlightenment her- 
itage prompted him to study the nineteenth-century 
Romantic and Counter-Enlightenment traditions—a 
study that led him to Mill, Constant, Herzen, and 
Tocqueville, each of whom also sought to harmonize 
a democratic commitment to pluralism with an aris- 
tocratic taste for greatness. Indeed with Tocqueville, 
Berlin agrees that it is necessary for all those interested 
in the future of democratic societies to “make contin- 
uous efforts to spread within these societies a taste for 
the infinite, a sentiment of greatness.” Yet also like Toc- 
queville he insists that those living today “ought not to 
strain to make ourselves like our fathers, but strive to 
attain the kind of greatness and happiness that is proper 
to us” (Tocqueville 2000, 519, 675).!! Tocqueville and 
Berlin were each sincere liberals, and rather than quar- 
rel with the egalitarianism that replaced an older un- 
derstanding of nobility, they admire not the nobility 
that challenges democracy but the nobility necessary 
for democracy. Berlin’s portrait of Weizmann is his own 
attempt to define the greatness proper to us. Churchill 
embodies a great but past age and was “perhaps the 
last of his kind,” but Weizmann, champion of “civilized 
values,” recognized the dignity of common life, which 
in the democratic age may be the noblest substitute for 
the lost promise of classical magnanimity (PI, 22; WE, 
18-21). 

The second aim of Berlin’s critique of scientific polit- 
ical inquiry, the recovery of the sense of reality, was also 
deeply indebted to his study of the Romantic Age and 
the Counter-Enlightenment, and particularly to their 
attempt to defend commonsense wisdom from the as- 
saults of Enlightenment rationality. Such an endeavor 
nicely captures Berlin’s characteristic approach to the 
study of the history of ideas. Berlin’s scholarship is 
deeply informed by his understanding of the central 
political problems of his own age, most importantly by 
the struggle of democracy against communism, which 
he took to be the apotheosis of both positive liberty and 
monistic rationalism. His attempt to bring the history 


11 Berlin’s debts to Mill, Constant, and Herzen have often been 
noted, but his sumilarities to Tocqueville also deserve recognition. 
Many of the ideas that he celebrates m Herzen and Mill and incorpo- 
rates in his own synthests—the nobility of free choice, the rejection of 
scientific determinism and embrace of self-determinism, the prefer- 
ence for greatness over mediocrity, and the mpartance of individual 
responsibility—he also found in Tocqueville (see Berlin 1965). 
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of ideas to bear on contemporary political problems 
points to a second reason why Berlin’s critique of scien- 
tific scholarly methods remains worthy of attention to- 
day. In both his day and ours, Berlin’s own chosen field, 
the history of ideas, has been criticized for elevating 
concern for method over concern for what is politically 
important, and historians of political thought have been 
compelled to defend their field against the charge that it 
is more historical than political (see, e.g., Skinner 2002a, 
5—7). But whatever other charges may be levied against 
Berlin’s approach to the history of ideas, this is not one 
of them. The boldness of the brushstrokes with which 
he painted his portraits of past thinkers leaves them 
conspicuously open to critique on the grounds of their 
accuracy and subtlety. Yet his approach was guided by 
the conviction that it is the responsibility of an intellec- 
tual historian to exercise careful judgment in choosing 
his material, and that it is necessary when examining 
the thoughts of the past to focus on those ideas which 
remain politically important. Thus more necessary than 
method for historians of ideas is what comes before 
method, namely, the judgment that enables us to deter- 
mine which ideas are worth our attention and efforts 
and which are—to use a favorite locution of Berlin’s— 
“justly forgotten.” Otherwise we run the danger of end- 
ing up like Tolstoy, equally appalled by scientific history 
and by what seems the consequence of its rejection: 
“the apparently arbitrary selection of material, and the 
no less arbitrary distribution of emphasis, to which all 
ie oa writing seemed to be doomed” (RT, 32). 

historian of ideas Berlin’s methods were gov- 
erned by a focus on recovering those ideas that remain 
of genuine political importance. This same focus is at 
the heart of his critique of the methods of scientific 
political inquiry. Like many political scientists today, 
Berlin was troubled by the propensity of contempo- 
rary academic specialization to mistake increases in 
methodological precision for increases in genuine un- 
derstanding. It is for this reason that he calls attention 
to the sense of reality. His hope is that the sense of 
reality will promote the recovery of the commonsense 
perspective of ordinary life and ordinary language and, 
thereby, narrow the gap that separates specialized in- 
quirers from political reality. By so doing he reminds 
us that the debate over the scientific approach to po- 
litical inquiry should not be limited to a debate over 
the relative merits of philosophical and empirical ap- 
proaches. Berlin would have agreed that political sci- 
ence in both its quantitative and its qualitative forms 
“may have more to offer by asking the right questions 
than by getting the right answers” (Jervis 2002, 189). 
To theorists and empiricists alike he reminds us that 
political inquiry, no less than political action, depends 
for its success not simply on the methods that enable 
us to solve problems, but also on the judgment that 
enables us to appreciate which questions remain most 
worth asking. 
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often practiced but little understood. A “case study,” I argue, is best defined as an intensive study 


(eter pracicsà atte understood and explain the utility, of the case study method, a method 


of a single unit with an aim to generalize across a larger set of units. Case studies rely on the 
same sort of covariational evidence utilized in non-case study research. Thus, the case study method is 
correctly understood as a particular way of defining cases, not a way of analyzing cases or a way of 
modeling causal relations. I show that this understanding of the subject illuminates some of the persistent 
ambiguities of case study work, ambiguities that are, to some extent, intrinsic to the enterprise. The travails 
of the case study within the discipline of political science are also rooted in an insufficient appreciation of 
the methodological tradeoffs that this method calls forth. This paper presents the familiar contrast between 
case study and non-case study work as a series of characteristic strengths and weaknesses—affinities— 
rather than as antagonistic approaches to the empirical world. In the end, the perceived hostility between 
case study and non-case study research is largely unjustified and, perhaps, deserves to be regarded as a 
misconception, Indeed, the strongest conclusion to arise from this methodological examination concerns 
the complementarity of single-unit and cross-unit research designs. 


he case study occupies a vexed position in the 
discipline of political science. On the one hand, 
methodologists generally view the case study 
method with extreme circumspection (Achen and 
Snidal 1989; King, Keohane, and Verba 1994; Lieberson 
[1991] 1992, 1994; Njolstad 1990). A work that focuses 
its attention on a single example of a broader phe- 
nomenon is apt to be described as a “mere” case study. 
At the same time, the discipline continues to pro- 
duce a vast number of case studies, many of which 
have entered the pantheon of classic works (Allen 1965; 
Allison 1971; Dahl 1960; Johnson 1983; Kaufman 1960; 
Lazarsfeld, Berelson, and Gaudet 1948; Lijphart 1968; 
Pressman and Wildavsky 1973). Judging by recent 
scholarly output, the case study method retains con- 
siderable appeal, even among scholars in research 
communities not traditionally associated with this 
style of research—e.g., among political economists and 
quantitatively inclined political scientists (Acemoglu, 
Johnson, and Robinson 2003; Bates et al. 1998; Rodrik 
2003). By the standard of praxis, therefore, it would 
appear that the method of the case study is solidly en- 
sconced and, perhaps, even thriving. 
Thus, a paradox: Although much of what we know 
about the empirical world is drawn from case studies 
and case studies continue to constitute a large propor- 
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tion of work generated by the discipline, the case study 
method is held in low regard or is simply ignored. Even 
among its defenders there is confusion over the virtues 
and vices of this ambiguous research design. Practi- 
tioners continue to ply their trade but have difficulty 
articulating what it is that they are doing, methodolog- 
ically speaking. The case study survives in a curious 
methodological limbo. 

How can we understand the profound disjuncture 
that exists between the case study’s acknowledged con- 
tributions to political science and its maligned status 
within the discipline? If case studies are methodologi- 
cally flawed, why do they persist? 

The paper is divided into two parts. The first part fo- 
cuses on matters of definition. I argue that for method- 
ological purposes a case study is best defined as an 
in-depth study of a single unit (a relatively bounded 
phenomenon) where the scholar’s aim is to elucidate 
features of a larger class of similar phenomena. It is 
demonstrated that case studies rely on the same sort 
of covariational evidence utilized in non-case study re- 
search. Thus, the case study method is correctly un- 
derstood as a particular way of defining cases, not a 
way of analyzing cases or a way of modeling causal 
relations. I show, finally, that this understanding of the 
subject illuminates some of the persistent ambiguities 
of case study work, ambiguities that are, to some extent, 
intrinsic to the enterprise. 

In the second part of the paper I proceed to examine 
the contrast between case study and non-case study 
work. The central argument here is that the differences 
between these two genres are best understood as char- 
acteristic strengths and weaknesses—affinities—rather 
than antagonistic approaches to the empirical world. 
‘Tradeoffs, rather than dichotomies, characterize the on- 
going case study/non-case study debate, 


WHAT IS A CASE STUDY? 


What is a case study, and how is it differentiated from 
other styles of research? Regretfully, the term “case 
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study” is a definitional morass. To refer to a work as a 
case study might mean (a) that its method is qualita- 
tive, small-N (Yin 1994); (b) that the research is ethno- 
graphic, clinical, participant-observation, or otherwise 
“in the field” (Yin 1994); (c) that the research is charac- 
terized by process-tracing (George and Bennett 2004); 
(d) that the research investigates the properties of a sin- 
gle case (Campbell and Stanley 1963, 7; Eckstein [1975] 
1992); or (e) that the research investigates a single phe- 
nomenon, instance, or example (the most common us- 
age). Evidently, researchers have many things in mind 
when they talk about case study research.! As a result of 
this profusion of meanings, proponents and opponents 
of the case study marshal a wide range of arguments 
but do not seem any closer to agreement than when 
this debate was first broached several decades ago. 

How, then, should the case study be understood? The 
first three options enumerated above (a—c) seem inap- 
propriate as general definitions of the topic since each 
implies a substantial shift in meaning relative to estab- 
lished usage. One cannot substitute case study for qual- 
itative, ethnographic, ot process-tracing without feeling 
that something has been lost in translation. These defi- 
nitions are best understood as describing certain kinds 
(subtypes) of case studies, rather than the general phe- 
nomenon itself. The fourth option (d) equates the case 
study with the study of a single case, the N= 1 research 
design. This is simply wrong, as argued at length below; 
case studies always employ more than one case. The 
fifth option (e), centering on phenomenon, instance, or 
example as the key term, is correct as far as it goes 
but also ambiguous. Imagine asking someone, “What 
is your instance?” or “What is your phenomenon?” A 
case study presupposes a relatively bounded phenome- 
non, an implication that none of these terms captures. 

As a substitute for these flawed definitions, I propose 
to define the case study as an intensive study of a sin- 
gle unit for the purpose of understanding a larger class 
of (simular) units. A unit connotes a spatially bounded 
phenomenon—e.g., a nation-state, revolution, political 
party, election, or person—observed at a single point in 
time or over some delimited period of time. (Although 
the temporal boundaries of a unit are not always ex- 
plicit, they are at least implicit.)? 

To clarify this definition we must establish the rela- 
tionship of the case study, so defined, to other terms in 
this crowded semantic field. Following is a set of nested 
definitions, which should be read carefully. A “pop- 
ulation” is comprised of a “sample” (studied cases), 
as well as unstudied cases. A sample is comprised of 
several “units,” and each unit is observed at discrete 
points in time, comprising “cases.” A case is comprised 
of several relevant dimensions (“variables”), each of 
which is built upon an “observation” or observations. 





1 In addrtion to sources cited above, see Brady and Collier 2004, 
Campbell (1975) 1988, Davidson and Costello 1969, Feagin, Orum, 
and Sjoberg 1991, George 1979, McKeown 1999, Ragin 1987, 1997, 
Ragin and Becker 1992, and the symposium, “The Case Study 
Method in Sociology,” in Current Sociology, Volume 40, Number 
1 (Spring 1992). 

2 Similar understandings of the term “unrt” can be found elsewhere 
(eg-, King, Keohane, and Verba 1994, 76-77). 
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For those familiar with the rectangular form of a dataset 
it may be helpful to conceptualize observations as cells, 
variables as columns, cases as Tows, and units as either 
groups of cases or individual cases (depending upon 
the proposition and the analysis). 

The most important point is that all these terms are 
definable only by reference to a particular proposition 
and a corresponding research design. A country may 
function as a case, a unit, a population, or a case study. 
It all depends upon what one is arguing. In a typi- 
cal cross-country time-series regression analysis (e.g., 
Przeworski et al. 2000), units are countries, cases are 
country-years, and observations are collected for each 
case on a range of variables). However, shifts in the 
unit of analysis of a proposition change the referen- 
tial meaning of all terms in the semantic field. If one 
moves down one level of analysis the new population 
lies within the old population, the new sample within 
the old sample, and so forth, such that an observation 
in the original proposition now becomes a case. Pop- 
ulation, unit, case, and observation are nested within 
each other. Since most social science research occurs 
at several levels of analysis these terms are generally in 
flux. Nonetheless, they have distinct meanings within 
the context of a single proposition, which defines the 
principal unit of analysis. 

I do not issue this somewhat novel definition of case 
study (an intensive study of a single unit for the purpose 
of understanding a larger class of units) with any hopes 
of displacing common usage. Indeed, there isno harmin 
continuing to refer to a case study in the various ways 
listed above (options a-e). What is important is that 
we have recourse to a narrower and clearer definition 
when methodological confusions arise so that we have a 
way to arbitrate such confusions. The definition chosen 
here is useful in this regard. Moreover, it captures the 
essential features of other extant definitions; it is reso- 
nant (Gerring 2001, chap. 3). Finally, as the succeeding 
portions of this paper show, it clarifies the distinctive 
features of a broad class of work in the discipline of 
political science and in neighboring fields of the social 
sciences, It is theoretically useful. 


The Case Study Method Considered 
as an Emplrical Endeavor 


The distinctiveness of the case study is most clearly un- 
derstood when placed within a broader set of method- 
ological options. To understand what a case study is, 
one must comprehend what it is not. 

All empirical evidence of causal relationships is co- 
variational in nature. A purported cause and effect must 
be found to covary. They must appear and disappear, 
wax and wane, or perform some other transformation 
in tandem or at some regular, more or less predictable, 
intervals. Even where this covariation is imagined, as 
in a counterfactual thought experiment, the evidence 
we imagine is of a covariational sort. Conversely, the 
absence of such covariation is taken as disconfirm- 
ing evidence. If the appearance and disappearance 
(waxing/waning et al.) of X and Y are not associated 
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TABLE 1. Research Designs: A Covarlatlonal Typology 


































Temporal! Variation 
No Yes 
None (1 unit) [Logically Impossible] (a) Case study | 
Spatial Varlat Within-untt (b) Case study ll (c) Case study Ill 
pan On Across-untt (d) Cross-sectional (e) Time-serles cross-sectional 


Across- and within-unit _(f) Hlerarchical 


(g) Hierarchical time-series; Comparative-historical 





in any way that can be rationally explained, and hence 
predicted (or postdicted), then the empirical evidence 
suggests that a causal relationship does not exist? 

This provides a useful way of typologizing various 
research designs. Covariation may be observed (a) in 
a single unit diachronically, (b) within a single unit 
synchronically, (c) within a single unit diachronically, 
(d) across units synchronically, (e) across units syn- 
chronically and diachronically, (f) across and within 
units synchronically, or (g) across and within units syn- 
chronically and diachronically, as depicted in Table 1. 

It will be seen that the case study occupies one of 
three possible cells. Type I case studies examine vari- 
ation in a single unit over time, thus preserving the 
primary unit of analysis. Other case studies break down 
this primary unit into subunits, which are then sub- 
jected to covariational analysis—either synchronically 
(type I) orsynchronically and diachronically (type MI). 
These are the three logically conceivable approaches 
to the intensive study of a single unit where that unit is 
viewed as an instance of some broader phenomenon. 
Consequently, when one refers to the case study method 
one is in fact referring to three possible methods, each 
with a different menu of covariational evidence. 

The bottom half of Table 1 lays out various across- 
unit research designs (where some important element 
of the empirical analysis involves comparisons across 
units). Here I have listed the methods most commonly 
identified with these research designs. Across-unit anal- 
ysis without any explicit temporal component (d) is 
usually classified as “cross-sectional” (even though a 
temporal component is usually simulated with indepen- 
dent variables that are assumed to precede the depen- 
dent variable under investigation). When a temporal 
component is included we often refer to the analysis as 
“time-series cross-sectional” (TSCS) or pooled time- 
series (e). When one examines variation across- and 
within units in the same research design one is said to 
be employing a “hierarchical” model (f). Finally, when 
all forms of covariation are enlisted in a single research 
design the resulting method is described as “hierar- 
chical time-series” (if quantitative) or “comparative- 
historical” (if qualitative) (g). Of all crogs-unit research 


3 Note that covariation (or correlation) refers to the mutual relation- 
ship between X and Y; variation, to the behavior of a single variable 
These words are often used interchangeably Hume’s word for this 
was constant and others have been employed as well. I 
should clarify that although the empirical component of a causal argu- 
ment is covariational in nature, successful causal arguments depend 
upon more than just covariation. Among other things, a convmang 
causal account must identify a causal mechanism (see below). 








designs the case study is probably closest to the latter, 
where levels of analysis move up and down more or less 
simultaneously and where a small number of units are 
subjected to intensive study. Indeed, the comparative- 
historical study may be looked upon as a series of 
case studies combined with explicit cross-unit analysis 
(Mahoney and Rueschemeyer 2003). 

Having placed these standard cross-unit research de- 
signs within a covariational typology one must also take 
note that each of these methods might also be em- 
ployed as a case study. A case study may employ cross- 
sectional, TSCS, hierarchical, hierarchical time-series, 
and perhaps even comparative-historical models. It all 
depends upon the proposition in question. Specifically, 
it is the purposes to which these analyses are put, and 
hence the definition of a unit, that determines whether 
or not they are appropriately referred to as case studies. 
This will become clearer as we proceed. 


The N Question 


I have argued that what distinguishes the case study 
method from all other methods is its reliance on covari- 
ation demonstrated by a single unit and its attempt, at 
the same time, to illuminate features of a broader set of 
units. It follows from this that the number of cases (N) 
employed by a case study may be either small or large 
and, consequently, may be evaluated in a qualitative or 
quantitative fashion.‘ 

To see why this must be so let us consider how a case 
study of a single event—say, the French Revolution— 
works. Intuitively, such a study provides an N of one 
(France). If one were to broaden the analysis to include 
a second revolution (e.g., the American Revolution), it 
would be common to describe the study as comprising 
two cases. Yet, as I have argued preliminarily, this is 
a gross distortion of what is really going on. It would 
be more correct to describe such a study as comprising 
two units, rather than two cases, for a case study of a 
single event generally examines that event over time. 
France is observed before, during, and after the event 
to see what changed and what remained the same after 
this cataclysmic event. These patterns of covariation 
offer the empirical clues one needs to reach conclusions 
about causation. They also create multiple cases out of 
that individual unit. N =2, at the very least (e.g., before 
and after a revolution), in a case study of type I. 


4 This section explains and elaborates on a theme first articulated by 
Campbell (1975) 1988, itself a revision of Campbell's earlier perspec- 
tive (Campbell and Stanley 1963). 
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If, instead, there is no temporal variation—if, for ex- 
ample, the French Revolution is examined at a single 
point in time—then the object of investigation will be 
covariational patterns within that unit, a case study of 
type II. Within-unit cases consist of all cases that lie ata 
lower level of analysis relative to the inference under in- 
vestigation. If the primary unit of analysis is the nation- 
state, then within-unit cases might be constructed from 
provinces, localities, groups, or individuals. The pos- 
sibilities for within-unit analysis are, in principle, in- 
finite. Indeed, within-unit N often swamps across-unit 
N, particularly where individuals comprise the relevant 
within-unit case. A single national survey will produce a 
larger sample than any conceivable cross-country anal- 
ysis. Thus, in many circumstances case studies of type I 
comprise a larger N than cross-sectional analyses or 
TSCS analyses. 

Evidently, if a case study combines both temporal and 
within-unit variation, as in case studies of type III, then 
its potential N increases accordingly. This is probably 
the most common genre of case study analysis. 

These covariational facts hold true regardless of 
whether the method is experimental or nonexperimen- 
tal. It is also true of counterfactual reasoning, which 
typically consists of four cases—the actual (as it hap- 
pened) before and after cases and the before and after 
cases as reconstructed through counterfactual reason- 
ing (i.e., with an imagined intervention). In short, the 
case study does not preclude high-N; it simply precludes 
across-unit N (by definition). 

What, then, of the classic N=1 research design, 
which haunts the imaginations of social scientists ev- 
erywhere? This hypothetical research design occupies 
the empty cell in Table 1. Its cell is empty because it 
represents a research design that is not logically fea- 
sible. A single unit observed at a single point in time 
without the addition of within-unit cases offers no evi- 
dence whatsoever of a causal proposition. In trying to 
intuit a causal relationship from this snapshot—a sin- 
gle case without within-unit covariation—we would be 
engaging in a truly random operation, since an infinite 
number of lines might be drawn through that one data 
point. 


Ambiguttles-Necessary and Unnecessary 


The effort in this section has been to clarify what it 
means to conduct a case study. I have argued that a case 
study is most usefully defined as the intensive study 
of a single unit wherever the aim is to shed light on 
a question pertaining to a broader class of units. Al- 
though this definitional exercise does not settle all the 
ambiguities besetting the case study research design, it 
does provide a way of understanding ambiguities that 
remain. Six issues deserve emphasis. 

The first ambiguity concerns the problem of distin- 
guishing different types of covariational evidence. We 
have pointed out that case studies may observe a sin- 
gle unit through time (type I), synchronic within-unit 
variance (type II), or synchronic and diachronic within- 
unit variance (type OI). Notice that types I and MI, but 
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not type I, involve a change in level of analysis, since 
cases are drawn from phenomena within the primary 
unit (as defined by the proposition of interest). Thus, 
some case studies—but not all—involve a change in the 
primary unit of analysis. To complicate matters further, 
case studies often combine observations of the primary 
unit over time (type I) with synchronic and diachronic 
observations of within-unit covariation (types I and 
M). Many case studies are thus hybrids of all three 
research designs. A final complication is introduced by 
the fact that it is often difficult to figure out which sort of 
covariational evidence is being mobilized at a particular 
juncture. The difficulty owes something to the complex- 
ities of within-unit analysis. Although the primary unit 
of analysis is usually clear, within-unit cases are often 
multiple and ambiguous. 

A second source of ambiguity concerns the blurry 
line between a unit that is intensively studied—the case 
study—and other adjacent units that may be brought 
into the analysis in a less structured manner. Recall that 
because a case study refers to a set of units broader than 
the one immediately under study, a writer must have 
some knowledge of these additional units (a) to choose 
a unit for special treatment and (b) identify plausible 
causal hypotheses, Case studies are not immaculately 
conceived; additional units always loom in the back- 
ground. 

To speak of a case study at all it is helpful to introduce 
a distinction between formal and informal units. The 
formal unit is the unit chosen for intensive analysis— 
the person, group, organization, county, region, coun- 
try, or other bounded phenomenon of which the writer 
has in-depth knowledge. Informal units consist of all 
other units that are brought into the analysis in a pe- 
ripheral way, typically in an introductory or concluding 
chapter. Often, these informal units are studied only 
through secondary literature; they are always more su- 
perficially surveyed than the formal unit under study. 
Sometimes, the status of informal units is left implicit. 
This may be warranted in circumstances where the rel- 
evant comparison or contrast between the formal unit 
and other units is obvious or generally accepted. In any 
case, the distinction between a formal and an informal 
unit is always a matter of degrees. The more equality of 
treatment granted to peripheral units, the more a study 
leans toward a cross-unit style of analysis. The greater 
the predominance of a single unit, the more it merits 
the appellation case study. 

A third ambiguity occurs whenever a single work 
combines single-unit and across-unit analysis in a for- 
mal manner. This would be true of comparative- 
historical work as well as any work in which an inten- 
sively studied unit is “nested” within a broader research 
design (Coppedge 2002; Lieberman 2003). Indeed, the 
only thing that distinguishes the single-unit study from 
a sample (which is of course also designed to elucidate 
the features of some larger phenomenon) is that the 
latter is generally understood as composed of more 
than one unit. Case studies, like samples, seek to rep- 
resent, in all ways relevant to the proposition at hand, 
a population of cases. A series of case studies might 
therefore be referred to as a sample; it is a matter of 
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emphasis and of degree. The more case studies one 
has, the less intensively each one is studied, and the 
more confident one is in their representativeness (of 
some broader population), the more likely one is to 
describe them as a sample rather than a series of case 
studies. 

A fourth ambiguity afflicting case studies is that such 
works generally partake of two empirical worlds. They 
are both studies tout court and case studies of something 
more general. As a study, the population is restricted to 
the unit under investigation. As a case study, the pop- 
ulation includes adjacent units—perhaps quite a large 
number of them. This tension is evident in Graham 
Allison’s (1971) renowned work, whose subtitle, Ex- 
plaining the Cuban Missile Crisis, invokes a narrow 
topic, whereas the title, Essence of Decision, suggests 
a much larger topic (government decision-making). 
To complicate matters further, different propositions 
within the same work commonly apply to different pop- 
ulations. Some may be restricted to the unit under study, 
whereas others have a wider ambit. This is clearly the 
case in Allison’s study and is noted explicitly in the 
introduction. 

To complicate matters further, the status of a work 
may change as it is digested and appropriated by a com- 
munity of scholars. “Meta-analyses” are systematic at- 
tempts to integrate the results of individual studies into 
a single quantitative analysis, pooling individual cases 
drawn from each study into a single dataset (with vari- 
ous weightings and restrictions). The ubiquitous “liter- 
ature review” often aims at the same objective, albeit 
in a less synoptic way. Both statistical meta-analyses 
and narrative literature reviews assimilate a series of 
studies, treating each of them as case studies in some 
larger project—whether or not this was the intention 
of the original authors. 

A final ambiguity concerns the sort of argument that 
a case study is intended to prove or demonstrate. One 
species of case study examines a loosely defined general 
topic—war, revolution, gender relations—in a particu- 
lar setting but offers no specific proposition that might 
be applied across a larger set of units. E. P. Thompson’s 
The Making of the English Working Class (1963) is usu- 
ally construed as a case study of class formation. This 
suggests a very general purview, perhaps applicable to 
all countries in the modern era. Yet Thompson does 
not proffer a theory of class formation, aside from the 
rather fuzzy notion of a working class participating in 
its own development. Thus, his work is probably cor- 
rectly understood as a study of how a more general 
phenomenon occurred in one country setting. Virtu- 
ally any intensive study of a relatively bounded topic 
qualifies as a case study in this minimal sense, so long as 
it can be linked with some larger topic via a key word 
(e.g., class formation). Indeed, the narrowest terrains 
sometimes claim the broadest extensions. Studies of a 
war are studies of war, studies of a farming commu- 
nity are studies of farming communities everywhere, 
studies of individuals are studies of humanity, and so 
forth. 

A very different style of argumentation informs 
Benjamin Reilly’s (2001) study of the role of electoral 


systems in ethnically divided societies. Reilly argues, on 
the basis of several case studies, that single-transferable 
vote (STV) electoral systems have a moderating effect 
on group conflict relative to first-past-the-post (FPP) 
electoral systems. Here is a good example of a case 
study that is more than simply suggestive (for other 
examples see Eaton 2003, Elman 1997, Lijphart 1968, 
and Stratmann and Baur 2002). For present purposes, 
what is significant is that both styles of argumentation— 
the suggestive and the falsifiable—are legitimately re- 
ferred to as case studies. Evidently, they have very dif- 
ferent methodological implications. But these implica- 
tions should not be confused with the case study format, 
which can be implemented in interpretivist as well as 
positivist modes. 

Having flagged these six ambiguities of the case 
study, the question is begged: Are they necessary? Are 
they intrinsic to the research design, or might they be 
avoided? 

In many instances, ambiguities can be removed sim- 
ply by more careful attention to the task of specifi- 
cation (Gerring 2001, 90-99). Writers should be clear 
about which propositions are intended to describe the 
unit under study and which are intended to apply to a 
broader set of units. Regrettably, many studies focused 
on some element of politics in the United States frame 
their analysis as a study of politics—by implication, pol- 
itics in general (everywhere and always). One is left to 
wonder whether the study pertains only to American 
politics, to all contemporary polities, or, in varying de- 
grees, to both. Indeed, the slippage between study and 
case study accounts for much of the confusion that we 
encounter when reading single-unit analyses. To the ex- 
tent that propositions—and their attendant cases, units, 
and populations—are stated clearly and explicitly, the 
author avoids confusion and the work attains a higher 
degree of falsifiability. This may involve some sacrifice 
in narrative flow, but it is rightly regarded as the entry 
price of social science. 

However, it hardly seems plausible that the six am- 
biguities noted above arise solely from the sloppy or 
unduly belletristic habits of case study practitioners. 
Indeed, a certain degree of ambiguity is inherent in the 
enterprise of the case study. This pertains, most of all, 
to the study/case study distinction. 

It would be difficult to write a study of a single unit 
that does not also function as a case study, and vice 
versa, for reasons already explored. Indeed, it may be 
difficult to neatly separate the study and case study 
components of a work (e.g., into different chapters or 
differently labeled propositions). The reason for this 
structural ambiguity is that the utility of the single-unit 
study rests partly on its double functions. One wishes 
to know both what is particular to that unit and what 
is general about it. It should be kept in mind that case 
studies often tackle subjects about which little is pre- 
viously known or about which existing knowledge is 
fundamentally flawed. The case study typically presents 
original research of some sort. Indeed, it is the opportu- 
nity to study a single unit in great depth that constitutes 
one of the primary virtues of the case study method 
(see below). If a writer were to restrict herself only to 
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TABLE 2. Single-Unit Versus Cross-Unlt Research Designs: Tradeoffs and 

















Affinitles 
Affintty 
Case Study Cross-Untt Study 

1 Type of Inference (a) Descriptive + 

(b) Causal + 
2 Scope of proposition (a) Depth + 

(b) Breadth + 

(c) Boundedness + 
3. Unit homogenetty (a) Case comparability (Internal) + 

(b) Representativeness (external) + 
4. Causal Insight (a) Causal mechanisms + 

(b) Causal effect + 
5. Causal relatonship (a) Invariant + 

(b) Probabilistic + 
6. Strategy of research (a) Exploratory (theory generation) + 

) Confirmatory (theory testing) + 

7. Useful variance (a) For only a single unit + 

(b) For many units + 
8. Ontology _ indeterminate 





elements of the unit that were generalizable (i.e., if she 
rigorously maintains the “case study” mode of anal- 
ysis), a reader might justifiably complain. Such rigor 
would clarify the population of the primary infer- 
ence, but it would also constitute a considerable waste 
of scholarly resources. Imagine a study of economic 
growth that focuses on Mauritius as a case study yet re- 
fuses to engage causal questions unless they are clearly 
applicable to other countries (since this is a case study 
of a more general phenomenon, growth). No mention 
of factors specific to the Mauritian case is allowable; 
all proper nouns are converted into common nouns 
(Przeworski and Teune 1970). Such a study seems un- 
duly narrow; its conclusions may mislead. 

Indeed, it is often difficult to tell which of the many 
features of a given unit are typical of a larger set of 
units (and hence fodder for generalizable inferences) 
and which are particular to the unit under study. The ap- 
propriate response to such ambiguity is for the writer to 
report all facts and hypotheses that might be relevant— 
in short, to overreport. Much of the detail provided by 
the typical case study may be regarded as “field notes” 
of possible utility for future researchers—perhaps with 
a rather different set of inferences in mind. Again, it 
seems justifiable for case studies to work on two levels 
simultaneously, the unit itself and some broader class 
of (perhaps difficult to specify) units. 

As a general observation we might say that methods, 
strictly defined, tend to lose their shape as one looks 
closer at their innards. A study merges into a case study, 
a single-unit study merges into a study of a sample, a 
longitudinal study merges into a latitudinal study, infor- 
mal cases merge into formal cases, and so forth. Meth- 
ods that seem quite dissimilar in design bleed into one 
another when put into practice. There are few “pure” 
methods. And this is probably a good thing. Chastity is 
not necessarily an attribute to be cherished in research 
design. 
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It has been demonstrated that the difference between 
a case study and a study (tout court) is rarely clear-cut. 
Indeed, the case study is probably best understood as 
an ideal-type rather than a method with hard-and-fast 
rules. Yet the fact that the case study is fuzzy around 
the edges does not mean that it is lacking in distinctive 
characteristics. When considered as an ideal type the 
case study research design, like all research designs, ex- 
hibits characteristic strengths and weaknesses relative 
to its across-unit cousin. These pertain to the type of 
inference under consideration (descriptive or causal), 
the scope of the proposition (its depth, breadth, and 
boundedness), the degree of unit homogeneity found 
among cases and between the sample and the popula- 
tion, the sort of causal insight desired (causal effect 
or causal mechanism), the strategy of research (ex- 
ploratory or confirmatory), and the kind of empirical 
evidence available. Tradeoffs along these seven dimen- 
sions are summarized in Table 2. Ontological presuppo- 
sitions are also important but of indeterminate import 
(as indicated in Table 2). 

It should be underlined that these tradeoffs rep- 
resent methodological affinities, not invariant laws. 
Exceptions can be found to each of the general tenden- 
cies identified here. Even so, the strengths and weak- 
nesses often noted in case study research, reproduced 
in many subfields and disciplines over the course of 
many decades, are not the product of a purely stochas- 
tic process. General patterns suggest general inter- 
pretations. 

I should also emphasize that each of these trade- 
offs carries a ceteris paribus caveat. Case studies are 
more useful for forming descriptive inferences, all other 
things being equal. Since ceteris is not always paribus 
the reader should notjump to any conclusions about the 
research design appropriate to a given setting without 
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considering the single-unit/cross-unit options available 
within that research context. 

Finally, readers should note that although many of 
my examples are drawn from the subfield of compara- 
tive politics, with nation-states as the principal unit of 
concern, these examples could be replicated with other 
units and in other research settings. The problem of the 
case study is not limited to a single subfield. 


Type of Inference: Descriptive Versus Causal 


Descriptive inference remains an important, if under- 
valued, trope within the social sciences (Gerring 2001, 
chap. 6; King, Keohane, and Verba 1994, chap. 2). Thus, 
it is not at all pejorative to observe that there is a 
methodological affinity between descriptive inference 
and case study work. When one is examining correla- 
tive relationships or proximate causal relationships the 
case study format seems less problematic and is often 
highly informative. Indeed, many of the most famous 
case studies in anthropology, political science, and so- 
ciology are primarily descriptive in orientation (e.g., 
Fenno 1978, Hartz 1955, Lynd and Lynd [1929] 1956, 
Malinowski [1922] 1984, and Whyte [1943] 1955). How 
can we understand this affinity? 

What? and How? questions are easier to answer with- 
out recourse to cross-unit analysis than Why? questions. 
The simplest genre of descriptive case study asserts that 
the unit under study (A) is like, or unlike, other similar 
units (B and C). A more complicated descriptive case 
study might assert a classificatory relationship among 
A, B, and C, such that A falls into a certain typological 
relationship with B and C. The latter, of course, is more 
complicated and is more likely to require some explicit 
cross-unit examination. However, a descriptive infer- 
ence does not make any assertions about causal rela- 
tionships (beyond the most proximal) occurring within 
A, B, and C. In this sense, descriptive inference is sim- 
pler, methodologically speaking. 

To be sure, descriptive case study propositions are 
implicitly comparative and these comparisons must 
have a cross-unit reference point. To say “green” is 
to imply “not blue.” However, it is usually fairly easy 
to make such comparisons without conducting a study 
of the presumed variation. One knows what blue is 
without going in search of blue cases. This illustrates 
something important about the structure of descrip- 
tive propositions in social science. They are held to- 
gether by language—by ordinary or technical terms 
and their definitions. When describing a phenomenon 
one is usually comparing it to an ideal-type definition. 
American political culture is “liberal” or “republican” 
insofar as it conforms to standard definitions of these 
two concepts (Smith 1997). To describe is to categorize, 
and to categorize is to rely on language to divide up 
the world into identifiable entities. Language, in this 
sense, provides the “laws” that allow for consistent in- 
terpretations of the phenomenal world (Sartori 1984). 
Chairs are different from tables in predictable ways; 
labeling entities as one or the other thus allows the use 
of nomothetic descriptive generalizations without ac- 
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tually studying the phenomenon in question each time 
one sallies forth with a new pronouncement about the 
world. 

It should be clear that the affinity between case study 
research and descriptive inference does not denigrate 
the possibility of causal analysis through case study re- 
search, of which one might cite many illustrious exam- 
ples. Indeed, the discussion that follows is primarily 
concerned with propositions of a causal nature. My 
point is simply that it is easier to conduct descrip- 
tive work than to investigate causal propositions while 
working in a case study mode. 


Scope of Proposition: Breadth and 
Boundedness Versus Depth 


The variable utility of the case study is also partly a 
product of the scope of the causal argument that a 
writer wishes to prove or demonstrate. Arguments that 
strive for great breadth and boundedness are in greater 
need of cross-unit cases; causal arguments restricted to 
a small set of units more plausibly subsist on the basis 
of a single-unit study. The extensive/intensive tradeoff 
is commonsensical. Insofar as one wishes to make an 
argument about the universe of nation-states, one is 
on surer ground if one has studied more than one of 
those nation-states. A case study of France probably 
offers better evidence for an argument about Europe 
than for an argument about the world. Propositional 
breadth and evidentiary breadth generally go hand in 
hand. The evidence should be commensurate with the 
scope of the proposition. 

This statement, like all others, has a ceteris paribus 
caveat. There is a variety of ways in which single- 
unit studies can credibly claim to provide evidence 
for causal propositions of broad reach—e.g., by choos- 
ing cases that are especially representative of the phe- 
nomenon under study or by choosing “crucial” cases 
(Eckstein [1975] 1992). Even so, a proposition with 
a narrow scope will be more conducive to case study 
analysis than a proposition with a broad purview. The 
breadth of an inference thus constitutes one factor, 
among many, in determining the utility of the case study 
mode of analysis. 

Similarly, the boundedness of an inference is often 
related to the degree to which it exploits cross-unit 
variance. Precisely because their focus is so tight, case 
studies often produce inferences with poorly defined 
boundaries. It is clear that an inference extends be- 
yond the unit under study, but it is often unclear how 
far the inference extends, Cross-unit research may also 
suffer from poorly bounded inferences; however, it is 
less likely to do so since the research design allows one 
to test the limits of an inference in an explicit fashion. 


5 Granted, cross-unit exammation may be useful, particularly when 
the terms in question are broad and/or ambiguous If, for exam- 
ple, one is examining American political culture as an example of a 
broader class of political cultures deemed “liberal,” ıt will be difficult 
to reach firm conclusions without a larger sample of naton-states 
Even so, it will be easier to describe the subject without cross-unit 
reference-points than to explain it. 
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The sharpness of the boundaries of the population— 
what is and is not “covered” by an inference—is easier 
to establish when units in the population also constitute 
cases under study. 

By the same token, one of the primary virtues of the 
case study method is the depth of analysis that it offers. 
One may think of depth as referring to the detail, rich- 
ness, completeness, wholeness, or degree of variance 
that is accounted for by an explanation. The case study 
researcher’s complaint about the thinness of cross-unit 
analysis is well taken; such studies often have little to 
say about individual cases. Otherwise stated, cross-unit 
studies are likely to explain only a small portion of the 
variance with respect to a given outcome or to approach 
that outcome at a very general level. A cross-unit study 
might be satisfied to explain the occurrence of a revo- 
lution in Country A, while a case study of Country A 
might also strive to explain specific features of that 
event—why it occurred when it did and in the way that 
it did. 

I shall return to the advantages of taking an in-depth, 
Gestalt-oriented look, at a single unit. For present pur- 
poses, the conclusion is simple. Research designs invari- 
ably face a choice between knowing more about less 
and knowing less about more. The case study method 
may be defended, as well as criticized, along these lines 


(Ragin 2000, 22). 


Unit Homogenelty: Case Comparability 
Versus Representativeness 


Single-unit studies provide cases that are likely to be 
comparable to one another. After all, they are all drawn 
from the same unit (by definition). Cases drawn from 
different units, in contrast, often force the researcher 
to make heroic assumptions about the comparability 
of concepts and causal relationships across the chosen 
cases. After all, they are different units. 

Yet the strength of the case study also suggests a cor- 
responding weakness. Single-unit research designs of- 
ten fall short in their representativeness—the degree to 
which causal relationships evidenced by that single unit 
may be assumed to be true for a larger set of (unstud- 
ied) units. Are the men chosen by Robert Lane (1962) 
typical of the American male, white, immigrant, work- 
ing class? Is Middletown representative of America 
(Lynd and Lynd [1929] 1956)? 

The tradeoff between comparability and representa- 
tiveness is a general feature of cross-unit sample size. 
Naturally, there are many ways to overcome the cor- 
responding problems of comparability and representa- 
tiveness in both case study and non-case study research. 
Even so, it should be pointed out that the addition of 
units to a research design can hardly increase the case 
comparability of a sample. Similarly, it is unlikely that 
the addition of units will decrease the representative- 
ness of a sample (though it is certainly possible). Thus, it 
seems appropriate to regard the tradeoff between com- 
parability and representativeness, like other tradeoffs, 
as intrinsic to the study/case study choice of research 
design. 
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Causal Insight: Causal Effect Versus 
Causal M anisms 


A fourth tradeoff concerns the sort of insight into cau- 
sation that one is able to achieve by examining empir- 
ical evidence of a particular X:Y relationship. Tradi- 
tionally, quantitative researchers have been concerned 
primarily with the estimation of a causal effect—the ef- 
fect on Y of a given change in X, taking all background 
circumstances into account (King, Keohane, and Verba 
1994, 81-82)—and, equally important, an estimate of 
the probability of that effect, captured statistically in 
the error term. Assuming that the causal relationship 
is probabilistic in nature (see discussion below), the 
researcher must examine several instances of this phe- 
nomenon to gauge the average causal effect of Xon Y 
and the random element of that variation. The calcu- 
lation of a causal effect presumes the investigation of 
cross-unit variation precisely because, for a probabilis- 
tic cause, one cannot assume that the behavior of one 
unit will be indicative of the behavior of other units. 
Units may behave differently. Thus, the example of a 
single unit, even if subjected to iterated testing, is not 
a good way to estimate causal effects and is certainly 
inadequate to the task of estimating probabilities. 

But causal arguments depend not only on measur- 
ing causal effects They also presuppose the identifi- 
cation of a causal mechanism (Gerring, forthcoming; 
Hedstrom and Swedberg 1998). X must be connected 
with Y in a plausible fashion; otherwise, it is unclear 
whether a pattern of covariation is truly causal in 
nature. The identification of causal mechanisms hap- 
pens when one puts together general knowledge of the 
world with empirical knowledge of how X and Y inter- 
relate. It is in the latter task that case studies enjoy a 
comparative advantage. 

Case studies, if well constructed, allow one to peer 
into the box of causality to the intermediate causes lying 
between some cause and its purported effect. Ideally, 
they allow one to “see” X and Y interact—Hume’s 
billiard ball crossing the table and hitting a second ball. 
Clayton Roberts (1996, 66) describes process-tracing 
as “the minute tracing of the explanatory narrative to 
the point where the events to be explained are mi- 
croscopic and the covering laws correspondingly more 
certain” (see also George and Bennett 2004). Often, 
the connections between a putative cause and its ef- 
fect are rendered visible once one has examined the 
motivations of the actors involved. Intentionality is an 
integral part of causal analysis, as interpretivists have 
been claiming for some time (Taylor 1970). Similarly, 
the investigation of a single unit may allow one to test 
the causal implications of a theory, thus providing cor- 
roborating evidence for a causal argument. This is of- 
ten referred to as pattern-matching. Here, the theory 
of primary interest “generates predictions or expecta- 
tions on dozens of other aspects of the culture, and [the 
writer] does not retain the theory unless most of these 
are also confirmed. In some sense, he has tested the the- 
ory with degrees of freedom coming from the multiple 
implications of any one theory” (Campbell [1975] 1988, 
380). 
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One can readily see why the investigation of 
causal mechanisms (including both process-tracing and 
pattern-matching) is commonly associated with the 
case study research design. The in-depth analysis of a 
single unit is useful in elucidating causal mechanisms 
because its characteristic style of evidence-gathering— 
over-time and within-unit variation—is likely to pro- 
vide clues into what connects a purported X to a par- 
ticular Y. Cross-unit variation, in contrast, is often mute 
with respect to causal mechanisms. The Xs and Ys may 
be at a considerable remove from one another; one 
does not know, or must simply intuit, what connects 
the dots. 

The caveat here is that cross-unit evidence is not al- 
ways or necessarily mute with respect to causal mech- 
anisms, and single-unit analysis is not always revela- 
tory on this score. A cross-unit analysis is opaque if, 
or insofar as, (a) there is great causal distance be- 
tween the X and the Y variables, (b) the interven- 
ing variables thought to lie between X and Y cannot 
be tested empirically in a cross-unit research design, 
and (c) the X:Y relationship cannot be intuited from 
common sense or deductive reasoning. There is a gen- 
eral perception—common at least among protagonists 
of the case study—that cross-unit studies entail large 
black boxes with no peepholes and that, therefore, they 
must be supplemented by the in-depth analysis of key 
units. This is not always the case. One can think of 
plenty of studies in which the relationship is quite clear, 
and where a case study would be superfluous. And one 
can point out any number of studies in which interven- 
ing variables are in fact investigated with a cross-unit 
research design.® Again, it is important to stress that we 
are examining typical, not definitional, characteristics 
of the case study. Ceteris paribus, case studies are more 
likely to shed light on causal mechanisms and less likely 
to shed light on true causal effects. 


Causal Relationship: Invarlant 
Versus Probabilistic 


Causal arguments may be either invariant (“determin- 
istic”) or probabilistic. Invariant causal relationships 
are asserted to be always true, given some set of back- 
ground circumstances. They take the form of necessary, 
sufficient, or necessary and sufficient arguments. Proba- 
bilistic arguments, in contrast, are true in a probabilistic 


6 For example, a vast literature addresses the putative causal con- 
nections between trade openness and the welfare state The usual 
empirical finding 1s that more open economies are associated with 
higher social welfare spending. The question then becomes why such 
a robust correlation exists. What are the plaumble interconnections 
between trade openness and social welfare spending? One possible 
causal path, suggested by David Cameron (1978), is that increased 
trade openness leads to greater domestic economic vulnerability to 
external shocks (due, for instance, to changing terms of trade). In 
subsequent work, writers have attempted to test this hypothesis by 
examining the correlation between terms of trade and social welfare 
spending. If Cameron's causal mechanism is operatrve, one should 
find a robust correlation between these two variables in a cross- 
national regression. As it happens, results are equivocal (Aleama, 
Glaeser, and Sacerdote 2001). The pomt is, writers can and do exploit 
cross-unit variation to test assumptions about causal mechanisms. 


fashion; a cause increases the likelihood of an outcome 
and/or the magnitude of a (scalar) outcome.’ 

Ceteris paribus, case study research designs have 
an easier time addressing invariant causes. Consider 
that a necessary or sufficient causal proposition can be 
disproved with a single case study (Dion 1998). Proving 
an invariant causal argument generally requires more 
cross-unit cases. However, it is not nearly as comph- 
cated as proving a probabilistic argument for the simple 
reason that one assumes invariant relationships; conse- 
quently, the single unit under study carries more weight. 
Where the causal relationship is assumed to be proba- 
bilistic, on the other hand, case study evidence is easier 
to dismiss; it is, after all, just one example of a general 
phenomenon assumed to have a stochastic component. 
Recall that an error term may be incorporated into case 
study work since the N of a case study is indeterminate, 
however, this error term remains a property of single- 
unit analysis. In contrast, the error term in a large-N 
cross-unit analysis represents the stochastic nature of 
an (assumed) probabilistic relationship. 


Strategy of Research: Exploratory 
Versus Conflrmatory 


Social science research involves a quest for new theo- 
ries as well as a testing of existing theories, a series of 
“conjectures and refutations” (Popper 1969). Regret- 
tably, social science methodology has focused almost 
exclusively on the latter. The former is quickly dis- 
missed as a matter of guesswork, inspiration, or luck—a 
leap of faith, in any case, and hence a poor subject for 
methodological reflection. Yet it will readily be granted 
that many works of social science, including most gen- 
erally acknowledged classics, are seminal works. Their 
classic status derives from a new idea, a new perspec- 
tive, that is subsequently subjected to more rigorous 
analysis. Indeed, it is difficult to devise a program of 
falsification the first time a new theory is proposed. 
Path-breaking research is, by definition, exploratory. 
Subsequent research on that topic is confirmationist 
insofar as its primary task is to verify or falsify a preex- 
isting hypothesis or a set of hypotheses. Thus, the world 
of social science may be usefully divided according 
to the predominant strategy of research undertaken, 
exploratory or confirmatory/disconfirmatory (Gerring 
2001, chap. 10). These constitute two moments of em- 
pirical research, a generative moment and a skeptical 
moment, each of which is essential to the progress of a 
discipline. 

Case studies enjoy a natural advantage in research 
of an exploratory nature. These same advantages, 


7 I avoid the term “determinmm” since it has multiple meanings, only 
one of which—invariance—is relevant here. I assume that to make 
an invariant causal argument does not commit one to a view that all 
causes are perfectly determined; some causes may be invariant and 
others probabilishc. Useful discussions of invariance/determinism 
can be found ın Adcock 2002, Dion 1998, 141, Goertz and Starr 
2003, and Waldner 2002. Goertz (2003, 76-94) includes a sample of 
150 necessary condition hypotheses deployed in various fields of the 
social sciences (Goertz 2003, 76-94). 
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however, often serve as impediments in work of a con- 
firmatory nature. Let us explore why this might be so.8 

Traditionally, scientific methodology has been identi- 
fied with the segregation of conjecture and refutation; 
one should not be allowed to contaminate the other. 
Yet in the real world of social science, inspiration arises 
from perspiration. “Lightbulb” moments build on a 
close engagement with the particular facts of a partic- 
ular case (unit). Ragin (1997) notes that case study re- 
search is all about “casing”—defining the topic, includ- 
ing the hypothesis(es) of primary interest, the outcome, 
and the set of cases that offer relevant information vis- 
a-vis the hypothesis. It is a highly circular process. A 
study of the French Revolution may be conceptualized 
as a study of revolution, of social revolution, of revolt, 
of political violence, and so forth. Each of these top- 
ics entails a different population and a different set of 
causal factors. A good deal of authorial intervention is 
necessary in the course of defining a case study topic, 
for there is a great deal of evidentiary leeway. Yet the 
very “subjectivity” of case study research allows for the 
generation of a great number of hypotheses, insights 
that might not be apparent to the cross-unit researcher 
who works with a thinner set of empirical data across 
a large number of units and with a more determinate 
(fixed) definition of cases, variables, and outcomes. It 
is the very fuzziness of case studies that grant them 
a strong advantage in research at exploratory stages, 
for the single-unit study allows one to test a multitude 
of hypotheses in a rough-and-ready way. Nor is this an 
entirely conjectural process. The covariational relation- 
ships discovered among different elements of a single 
unit have a prima facie causal connection: They are all 
at the scene of the crime. This is revelatory when one 
is at an early stage of analysis, for there is no identi- 
fiable suspect and the crime itself may be difficult to 
discern. The fact that A, B, and C are present at the 
expected times and places (relative to some outcome 
of interest) is sufficient to establish them as indepen- 
dent variables, Proximal evidence is all that is required. 
Hence, the common identification of case studies as 
“plausibility probes,” “pilot studies,” “heuristic stud- 
ies,” and “theory-building” exercises (Eckstein [1975] 
1992; Ragin 1992, 1997; Rueschemeyer and Stephens 
1997). 

A multiple-unit study, in contrast, generally allows 
for the testing of only a few hypotheses but does so 
with a somewhat greater degree of confidence, as is 
appropriate to work of a confirmatory nature. There is 
less room for authorial intervention because evidence 
gathered from a cross-unit research design can only be 
interpreted in a limited number of ways. Another way 
of stating the point is to say that whereas case studies 
lean toward Type 1 errors (falsely rejecting the null hy- 
pothesis), cross-unit studies lean toward Type 2 errors 
(failing to reject the false null hypothesis). Perhaps this 
explains why case studies are more likely to be theory- 
generating, whereas cross-unit studies toil in the prosaic 
but highly structured field of normal science. 


8 For discussion of this tradeoff ın the context of economic growth 
theory see Temple 1999, 120 
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I do not mean to suggest that case studies never 
serve a confirmatory role. As discussed, evidence drawn 
from a single unit may disconfirm a necessary or suffi- 
cient hypothesis. Case studies are also often useful in 
conjunction with a cross-unit study for the purpose of 
elucidating causal mechanisms, as discussed. However, 
general theories rarely offer the kind of detailed and 
determinate predictions on within-unit variation that 
would allow one to reject a hypothesis through pattern- 
matching (without additional cross-unit evidence). The 
point is, theory confirmation/disconfirmation is not the 
case study’s strong suit. The selection of “crucial” cases 
cannot overcome the fact that cross-unit N is minimal. 
We are unlikely to reject a hypothesis, or to consider it 
definitively proved, on the basis of the study of a single 
unit, particularly if the hypothesis has behind it a corpus 
of scholarly work. Eckstein himself acknowledges that 
his argument for case studies as a form of theory con- 
firmation is largely conjectural. At the time of writing, 
several decades ago, he could not point to any existing 
study where a crucial case study had performed the 
heroic role assigned to it (Eckstein [1975] 1992, 172). 
I suspect that this is still more or less true. Indeed, it 
is true even of experimental case studies in the natural 
sciences (Campbell and Stanley 1963, 3). A single case 
study is still a single-shot, a single piece of evidence 
lying at the same level of analysis as the proposition 
itself. 

The tradeoff between exploratory and confirmatory 
research helps us to reconcile the enthusiasm of case 
study researchers and the skepticism of case study crit- 
ics. They are both right, for the looseness of case study 
research is a boon to new conceptualizations just as it 
is a bane to falsification. The problem is that work of 
an exploratory nature, although it may receive praise 
from the discipline, is unappreciated, and greatly un- 
dertheorized, by methodologists. 


Useful Variance: Single- Versus Multiple-Unit 


The analysis of any causal relationship hinges on the 
counterfactual assumption—that without X (or with 
more or less of X), Y would be different. In investi- 
gating this assumption the preferred research designs 
are, in order of preference: (a) laboratory or field ex- 
periments, (b) “natural” experiments (where a single 
unit undergoes unmanipulated change through time 
that approximates a true experiment), (c) thought ex- 
periments (counterfactuals), or (d) statistical controls 
(a quasi-experimental method of neutralizing irrele- 
vant variables so as to isolate the true causal effects 
of one or a few factors of theoretical interest). The 
implication of this hierarchy of research designs is that 
a laboratory/field experiment, natural experiment, or 
thought experiment involving a single unit may be more 
useful than multiple units that attempt to mimic the 
virtues of the experimental method with purely “statis- 
tical” evidence. 

For example, in investigating the relationship be- 
tween campaign efforts and voter turnout one might 
be more convinced by a field experiment conducted in 
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a single community than by multiple cross-community 
studies or individual poll data that rely on a host of 
more or less unsatisfactory quasi-experimental con- 
trols (Gerber and Green 2000). Similarly, in investigat- 
ing the role of electoral systems in conditioning pub- 
lic policy outcomes one might be more convinced by 
a single natural experiment—a change in a country’s 
electoral system—than by a cross-country study em- 
ploying statistical controls to examine these complex 
causal relationships (Horowitz 1985, 602). Even where 
one is forced to rely solely on counterfactual thought- 
experiments to evaluate causal claims, one might still 
prefer the case study mode if useful variance is not 
available across units. The time-honored question of 
whether early democratization leads to a quiescent 
working class and lower levels of social welfare devel- 
opment (Lipset 1963) is difficult to investigate cross- 
nationally for the simple reason that only one coun- 
try granted suffrage to the (male) working class prior 
to industrialization. In this circumstance, a historical 
study focused on the United States (i.e., a case study) 
may provide the most compelling evidence of a general 
proposition. 

To clarify, the issue is not whether natural 
experiments—or real experiments for that matter—are 
desirable. They are always desirable. The issue is how 
many experiments are available and how (truly) exper- 
imental are they? If many experiments are possible, 
as they typically are in natural science settings, then 
a crogs-unit research design is probably justified. But 
this is uncommon in social science settings. Similarly, 
returning to our previous examples, if nation-states 
exhibit substantial variance on the independent vari- 
ables of interest—early democratization and electoral 
change—then one would probably choose a cross-unit 
research design (with a time-series dimension). This 
would not preclude a supplementary case study, but 
such a case study would undoubtedly carry less weight. 
In the event, however, one is thrown back on more 
primitive expedients. Much of the utility of the case 
study is “contextual” in this special sense. The sort 
of natural or experimental variation that would make 
cross-unit analysis fruitful is lacking; such units simply 
do not exist and cannot be generated. 

This helps to explain why case studies often focus 
on rare (“historical”) events. Let me define an event 
provisionally as an instance of substantial and rela- 
tively quick change in an independent or dependent 
variable of theoretical interest. Now imagine a uni- 
verse of empirical data in which such events occur in 
only 13 instances. This approximates the situation of 
scholars whose work focuses on the phenomenon of 
social revolution (e.g., Skocpol 1979, 287). In princi- 
ple, the field of empirical evidence is virtually bound- 
less, including all nation-states over the past three 
centuries. (Let us suppose that this constitutes the do- 
main of the inference.) Yet useful variation is severely 
limited. The world does not provide many units that 
have experienced revolutionary upheaval. Arguably, 
one learns more about this phenomenon through an 
intensive case study of France, or several case stud- 
ies focused on the 13 or so social-revolutionary coun- 


tries (the comparative-historical method), than by a 
cross-country TSCS research design. The same prob- 
lem affects the study of many social science questions 
where only one unit, or a small number of units, un- 
dergoes the event that is to be explained: early in- 
dustrialization (England and the Netherlands), fascism 
(Germany, Italy), the use of nuclear weapons (United 
States), world war (WWI, WWI), democratization in 
Africa (Botswana, Mauritius, South Africa), single- 
nontransferable vote (SNTV) electoral systems (prere- 
form Japan, Jordan, Taiwan, Vanuatu), and settler soci- 
eties (Australia, Canada, New Zealand, South Africa, 
United States). 


Ontological Considerations 


Thus far, I have looked upon the choice between case 
study and non-case study methods as a matter of logic 
(the logic of causal inquiry) and empirics (the inves- 
tigation of the empirical world). However, this choice 
rests also on ontological presuppositions. An ontology 
is a vision of the world as it really is, a more or less co- 
herent set of assumptions about how the world works, 
a research Weltanschauung analogous to a Kuhnian 
paradigm (Hall 2003; Kuhn [1962] 1970). As a result 
of its all-embracing character and ambiguous claims, 
an ontology cannot be proved or disproved, at least 
not in the usual (verificationist or falsificationist) sense. 
Although it seems odd to bring ontological issues into 
a discussion of social science methodology, it may be 
granted that social science research is not a purely em- 
pirical endeavor. What one finds is contingent upon 
what one looks for, and what one looks for is to some 
extent contingent upon what one expects to find. 

The significance of ontological presuppositions be- 
comes apparent whenever questions of breadth, case 
comparability, and representativeness are vetted. Con- 
sider the possibility that a sample of units will become 
more and more dissimilar (less comparable) as that 
sample is enlarged, but with no obvious break-points. 
Here it evidently is the choice of the researcher how to 
define the population of a given inference and, hence, 
what the relevant units of analysis will be. Where do 
like cases end and unlike cases begin? 

If adjacent units are thought to be entirely noncom- 
parable, the case study method is impossible. The per- 
fectly ideographic universe displays such uniqueness 
among units that absolutely nothing can be learned 
about one unit by studying another. The notion of 
a “case study” is nonsensical. At the other extreme, 
where all units of a given type are perfectly comparable, 
the case study is equally nonsensical. Why focus on a 
single unit when other units will do just as well? This is 
the nomothet’s way of looking at things. 

Case study researchers are situated between these 
two extremes. They are dubious about the viabil- 
ity of comparisons drawn across many apparently di- 
verse units. “Variable-oriented” research involves “ho- 
mogenizing assumptions” (Ragin 2000, 35). Yet they 
are equally suspicious of the claim, implicit in much 
historical and anthropological work, that each unit is 
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FIGURE 1. The Ontology of Case Study 
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unique. The case study thus occupies a tenuous onto- 
logical ground midway between ideographic and nomo- 
thetic extremes, as depicted in Figure 1 (see Collier and 
Collier [1991] 2002, 13-14). 

To be sure, there is no profit in dwelling on ontologi- 
cal differences between case study and non-case study 
researchers. Ontological debates are, by definition, ir- 
resolvable. Once one has defended one’s position as 
a matter of ontology, further discussion is superfluous 
except as it might bear upon matters of logic and co- 
herence. If social science is understood as an evidence- 
based form of inquiry then matters of ontology are 
simply not relevant or are only tangentially relevant. 
Nonetheless, insofar as our ontological presuppositions 
influence our construction of cases, we had best be cog- 
nizant of this fact. Indeed, the middle-range position 
of case study research on this crucial question may 
help to account for its ambiguous position in the so- 
cial sciences. It is neither fish nor fowl, ontologically 


speaking. 


CONCLUSION 


This paper has attempted to shed light on the apparent 
disjuncture between an often-maligned methodology 
and a heavily practiced method, both of which go by 
the name of case study. The torment of the case study 
begins with its definitional penumbra. Frequently, this 
key term is conflated with a set of disparate method- 
ological traits that are not definitionally entailed. The 
first task of the essay, therefore, was to craft a narrower 
and more useful concept for purposes of methodologi- 
cal discussion. The case study, I argued, is best defined 
as an intensive study of a single unit with an aim to 
generalize across a larger set of units. 

In other respects, the predicament of the case study is 
not merely definitional but rather inheres in the method 
itself. To study a single unit with intent to shed light 
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upon other units brings in its train six methodologi- 
cal ambiguities that are, to some extent, ineradicable. 
First, case studies may build upon a variety of covari- 
ational evidence; there is no single type of case study 
evidence, but rather three (see Table 1). Second, case 
studies assume a distinction between formally and in- 
formally studied units that is never entirely clear since 
the latter must be brought into the analysis in some 
fashion but would compromise the notion of a case 
study if fully integrated. Third, individual case studies 
are often grouped together in a single study, thus con- 
founding the distinction between single- and cross-unit 
analysis. Fourth, case studies usually perform a double 
function; they are studies (of the unit itself) as well 
as case studies (of a broader class of units). Finally, 
the inference(s) pursued by a case study may be either 
illustrative or falsifiable. These methodological issues 
bedevil the case study research design. And they shall 
continue to do so, for they are inherent in the enterprise. 

The travails of the case study within the discipline 
of political science are rooted, finally, in an insuffi- 
cient appreciation of the methodological tradeoffs that 
this method calls forth. I have argued that at least 
seven characteristic strengths and weaknesses must be 
considered. Ceteris paribus, case studies are generally 
more useful (1) when inferences are descriptive rather 
than causal, (2) when propositional depth is prized 
over breadth and boundedness, (3) when (internal) 
case comparability is given precedence over (external) 
case representativeness, (4) when insight into causal 
mechanisms is more important than insight into causal 
effects, (S) when the causal proposition at issue is invari- 
ant rather than probabilistic, (6) when the strategy of 
research is exploratory, rather than confirmatory, and 
(7) when useful variance is available for only a single 
unit or a small number of units. (Ontological consider- 
ations also come into play when one chooses between 
a single-unit and a cross-unit research design, though 
the methodological implications of these assumptions 
are equivocal.) 

Of these seven considerations, the last is perhaps the 
most important. There is little point in pursuing cross- 
unit analysis if the units in question do not exhibit vari- 
ation on the dimensions of theoretical interest and/or 
the researcher cannot manage to hold other, potentially 
confounding, factors constant. Of course, a preliminary 
canvassing of these units is necessary to perceive these 
facts about the sample. But beyond this, cross-unit anal- 
ysis may be of little consequence. By the same token, 
individual units may offer useful variance or they may 
not. In any case, the most important single question 
researchers should ask themselves as they contemplate 
various research designs is which one of these options 
most closely approximates the experimental ideal. 

Throughout this discussion I have avoided the discus- 
sion of methodological considerations that are practical 
in nature or rooted in specific research contexts. My 
concern has been with general methodological issues. 
Before concluding this essay, however, it is important 
to acknowledge that practical and contextual consid- 
erations are often paramount in the choice between a 
case study and a non-case study research format. 


American Political Science Review 


Vol. 98, No. 2 


American Political Science Review 0 


The collection of original data is typically more dif- 
ficult in cross-unit analysis than in case study analysis, 
involving greater expense, greater difficulties in iden- 
tifying and coding cases, learning foreign languages, 
traveling, and so forth.? Whatever can be done for a 
set of units can usually be done more easily for a sin- 
gle unit. Similarly, case studies commonly afford multi- 
ple observations of a single case, thus providing firmer 
evidence of the factual accuracy of a given proposi- 
tion than would be possible in the analogous cross-unit 
study. 

A een practical consideration concerns the state 
of research on a given topic. Here one is concerned with 
the “triangulation” of evidence rather than the ease 
of evidence-gathering. Social scientists are accustomed 
to the idea that research occurs within the context of 
an ongoing tradition. All work is dependent, for the 
identification of topic, argument, and evidence, on this 
research tradition. What we need to know, and hence 
ought to study, is to some extent contingent upon what 
is already known. It follows from this that the utility of 
case study research relative to non-case study research 
is to some extent the product of the state of research 
within a given field. A field dominated by case studies 
may have little need for another case study. A field 
where cross-unit studies are hegemonic may be des- 
perately in need of in-depth studies focused on single 
units. 

Indeed, much of the debate over the utility of the case 
study method has little to do with the method itself and 
more to do with the state of current research in that 
field. If both case study and cross-unit methods have 
much to recommend them (the implicit argument of 
this paper), then both ought to be pursued—perhaps 
not in equal measure but at least with equal diligence 
and respect. There is no virtue, and potentially great 
harm, in pursuing one approach to the exclusion of the 
other or in ghettoizing the practitioners of the minority 
approach. The triangulation essential to social scien- 
tific advance demands the employment of a variety of 
(viable) methods, including the case study. 

This paper is manifestly not a brief for the case study. 
Rather, it is a brief for the better understanding of the 
case study. We may or may not need more case stud- 
ies in political science. It is hoped, however, that the 
foregoing discussion will encourage better case studies 
and a greater appreciation of their utility within the 
discipline. 

It should also be apparent that the perceived hos- 
tility between case study and non-case study research 
is largely unjustified and, perhaps, deserves to be re- 
garded as a misconception. Case studies may be small- 
or large-N, qualitative or quantitative, experimental or 
observational, synchronic or diachronic. The case study 
research design comports with any social-scientific the- 
oretical framework including behavioralism, rational 
choice, institutionalism, and interpretivism. What dif- 
ferentiates the case study from the cross-unit study is 
its way of defining cases, not its analysis of those cases 


9 Granted, a good deal of cross-unit work involves the reanalyss of 
existing datasets; in this situation the barriers to entry are not so high. 


or its method of modeling causal relations. The case 
study research design constructs cases from a single unit 
while remaining attentive to inferences that span simi- 
lar units outside the formal scope of investigation. Non- 
case study research designs construct cases across units 
exemplifying the principal causal inference. This dif- 
ference in “casing” has important consequences, as we 
have shown. However, it does not render the case study 
epistemologically distinct from the cross-unit analysis. 
Indeed, the two modes of analysis are interdependent, 
and this is as it should be. One is at pains to imagine 
cross-unit research that does not draw upon case study 
work or case study work that disregards adjacent units. 
The strongest conclusion to arise from this methodolog- 
ical reflection concerns the complementarity of single- 
unit and cross-unit research designs. 
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voting. This approach is extremely flexible, applicable to any legislative setting, irrespective of 


W: develop a Bayesian procedure for estimation and inference for spatial models of roll call 


size, the extremism of the legislators’ voting histories, or the number of roll calls available for 
analysis. The model is easily extended to let other sources of information inform the analysis of roll call 
data, such as the number and nature of the underlying dimensions, the presence of party whipping, the 
determinants of legislator preferences, and the evolution of the legislative agenda; this is especially helpful 
since generally it is inappropriate to use estimates of extant methods (usually generated under assumptions 
of sincere voting) to test models embodying alternate assumptions (e.g., log-rolling, party discipline). A 
Bayesian approach also provides a coherent framework for estimation and inference with roll call data 
that eludes extant methods; moreover, via Bayesian simulation methods, it is straightforward to generate 
uncertainty assessments or hypothesis tests concerning any auxiliary quantity of interest or to formally 
compare models. In a series of examples we show how our method is easily extended to accommodate 
theoretically interesting models of legislative behavior. Our goal is to provide a statistical framework for 
combining the measurement of legislative preferences with tests of models of legislative behavior. 


odern studies of legislative behavior focus 
Me the relationship among the policy pref- 

erences of legislators, institutional arrange- 
ments, and legislative outcomes. In spatial models of 
legislatures, policies are represented geometrically, as 
points in a low-dimensional Euclidean space. Each leg- 
islator has a most preferred policy or ideal point in this 
space and his or her utility for a policy declines with 
the distance of the policy from his or her ideal point; 
see Davis, Hinich, and Ordeshook (1970) for an early 
survey. 

The primary use of roll call data—the recorded 
votes of deliberative bodies!—s the estimation of ideal 
points. The appeal and importance of ideal point esti- 
mation arises in two ways. First, ideal point estimates let 
us describe legislators and legislatures. The distribution 
ofideal points estimates reveals how cleavages between 
legislators reflect partisan affiliation or region or be- 
come more polarized over time (e.g., McCarty, Poole, 
and Rosenthal 2001). Roll call data serve similar pur- 
poses for interest groups, such as Americans for Demo- 
cratic Action, the National Taxpayers Union, and the 
Sierra Club, to produce “ratings” of legislators along 
different policy dimensions. Second, estimates from roll 
call analysis can be used to test theories of legislative 
behavior. For instance, roll call analysis has been used 
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in studies of the U.S. Congress, both contemporary and 
historical (e.g., Canes-Wrone, Brady, and Cogan 2002; 
Jenkins 1999; Schickler 2000), state legislatures (e.g., 
Wright and Schaffner 2002), courts (Martin and Quinn 
2001), comparative politics (Londregan 2000b), and in- 
ternational relations (Voeten 2000). In short, roll call 
analysis make conjectures about legislative behavior 
amenable to quantitative analysis, helping make the 
study of legislative politics an empirically grounded, 
cumulative body of scientific knowledge. 

Current methods of estimating ideal points in polit- 
ical science suffer from both statistical and theoretical 
deficiencies. First, any method of ideal point estimation 
embodies an explicit or implicit model of legislative be- 
havior. Generally, it is inappropriate to use ideal points 
estimated under one set of assumptions (such as sin- 
cere voting over a unidimensional policy space) to test 
a different behavioral model (such as log-rolling). Sec- 
ond, the computations required for estimating even the 
simplest roll call model are very difficult and extending 
these models to incorporate more realistic behavioral 
assumptions is nearly impossible with extant methods. 
Finally, the statistical basis of current methods for ideal 
point estimation is, to be polite, questionable. Roll call 
analysis involves very large numbers of parameters, 
since each legislator has an ideal point and each bill 
has a policy location that must be estimated. Popular 
methods of roll call analysis compute standard errors 
that are admittedly invalid (Poole and Rosenthal 1997, 
246) and one cannot appeal to standard statistical the- 
ory to ensure the consistency and other properties of 
estimators (we revisit this point below). 

In this paper we develop and illustrate Bayesian 
methods for ideal point estimation and the analysis of 
roll call data more generally. Bayesian inference pro- 
vides a coherent method for assessment of uncertainty 
and hypothesis testing in the presence of large num- 
bers of parameters, and recent advances in computing 
put Bayesian modeling (via Monte Carlo simulation) 
well within the reach of social scientists. Using our 
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approach, we show how it is possible to extend the 
standard voting model to accommodate more complex 
behavioral assumptions. Our goal is to provide a sta- 
tistical framework for combining the measurement of 
legislative preferences with tests of models of legisla- 
tive behavior. 


A STATISTICAL MODEL FOR ROLL CALL 
ANALYSIS 


In theoretical work on spatial voting models, utility 
functions are usually deterministic and the precise 
functional form, aside from an assumption of quasi- 
concavity, is not specified. For empirical work, it is 
convenient to choose a parametric specification for the 
utilities and to add a stochastic disturbance. Several 
different specifications have been used, but all are quite 
similar. We assume a quadratic utility function for leg- 
islators with normal errors. Poole and Rosenthal (1997, 
235-36) assume Gaussian utilities with extreme value 
errors, Heckman and Snyder (1997) assume quadratic 
utilities with uniform errors for one of the alternatives 
and nonstochastic utility for the other. See Table 2 for 
a comparison of the specifications. 

The data consist of n legislators voting on m different 
roll calls. Each roll call j =1,..., m presents legislators 
i=1,...,n with a choice between a “Yea” position C, 
and a “Nay” position w,, locations in Rf, where d 
denotes the dimension of the policy space. Let y =1 
if legislator i votes Yea on the jth roll call and y,, =0 
otherwise. Legislators are assumed to have quadratic 
utility functions over the policy space, U,(Z,)= 
=]ix, — (e I? +n, and U,(p,) =—|%, =pl? + vy, 
where x, € R? is the ideal point of legislator i, n,, and v,, 
are the errors or stochastic elements of utility, and ||- || is 
the Euclidean norm. Utility maximization implies that 
yy =1 if U,(&,)> U,Qp,) and yı =0 otherwise. The 
specification is completed by assigning a distribution 
to the errors. We assume that the errors n,, and Vy 
have a joint normal distribution with E(m,) = E(,,), 
var(n, — wj)=0? and the errors are independent 
across both legislators and roll calls. It follows that 


Py = 1) = P(U,(Z,) > U,(p;)) 

= Pvy = ny < I =pl? = Ie — IP), 

= Puy =j < 2(čj —,)™ 

+ Wi; ~ oS) 

= &(B)x, — x), (1) 
where B, =2(Z, — tp, )/o,, a, =(2, 0, — py w,)/o,, and 
®(-) denotes the standard normal distribution func- 
tion. This corresponds to a probit model with an un- 
observed regressor x, corresponding to the legislator’s 
ideal point (a logit model results if the errors have 
extreme value distributions). The coefficient vector B, 


is the direction of the jth proposal in the policy space 
relative to the Nay position. 
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Given the assumptions of independence across leg- 
islators and roll calls, the likelihood is 


AB MN) Iewe, = 0%) 


tm] jai 
x1- BRB, -x)= 2) 


where B is an mxd matrix with jth row Bi, a= 
(01,..., Gm)’, Kis ann x d matrix with ith row x,, and 
Y is the n x m matrix of observed votes with (i, j)th 
element y,,. 

The model, as described above, is the simplest pos- 
sible form and is a convenient starting point for more 
elaborate models. For instance, we show later how it 
is possible to add party effects to this specification. 
Clinton and Mierowitz (2001) modify this framework 
to study agenda dependence. It is also possible to in- 
corporate vote trading and cue-taking into the model 
by making the utility of one legislator dependent upon 
either the utility or the voting behavior of another. 

The spatial voting model is equivalent to the two- 
parameter item response model used in educational 
testing,” where §, is the item discrimination parameter 
and a, is the item difficulty parameter, but in the roll 
call context the latent trait or “ability” parameter x, 
is the ideal point of the ith legislator. There is a large 
literature in psychometrics on estimation of these mod- 
els (e.g., Baker 1992 and Bock and Aitken 1981), but 
the focus is usually on estimation of the B, (the item 
parameters), which are used for test equating. In roll 
call analysis, however, primary interest almost always 
centers on the x, (the ideal points), while in psycho- 
metrics the x, (ability parameters) are usually treated 
as random effects. 


Identification 


As it stands, model (2) is not identified. For example, 
suppose that we transform the matrix of ideal points 
X by premultiplying by an invertible d x d matrix R 
and apply the inverse transformation to the direction 
vectors B, X*= XR and B*=BR™. Then the likeli- 
hood L{B, a, X | Y) = L(B*, æ, X* | Y) for all possible 
voting patterns Y: no data can distinguish between the 
different parameter values because any translation or 
rotation of the ideal points and proposals leaves the 
distances between ideal points and alternatives un- 
changed. 

Identification is essential for standard methods of 
estimation, such as maximum likelihood, which are in- 
consistent when the model is unidentified. The role 
of identification in Bayesian estimation is more con- 
troversial. Bayesian procedures can be applied to 
unidentified models, though the data are only infor- 
mative about identified parameters (e.g., Neath and 
Samaniego 1997). However, in many cases it is difficult 
to formulate a reasonable prior for problems involving 


2 This equivalence has been noted by several authors, including 
Bailey and Rivers (1997), Londregan (2000a), and Poole and 
Rosenthal (1997, 247). 
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arbitrary rescalings. For example, we may have some 
prior information about tomorrow’s temperature, but 
itis very difficult to quantify this information unless we 
agree in advance whether temperature is measured on 
the Fahrenheit, the celsius, or some other scale. This is 
a simple example of normalization. The same problem 
occurs in policy spaces since both the origin and the 
metric are arbitrary. 

Rivers (2003) derives necessary and sufficient con- 
ditions for identification of multidimensional spatial 
models based upon a priori restrictions on the ideal 
point matrix X. In the case of a unidimensional policy 
space, the identifying conditions are straightforward: 
Two linearly independent restrictions on the ideal point 
matrix X are required. One possibility is to constrain 
the positions of two legislators at arbitrary positions, 
e.g, Kennedy at —1 and Helms at +1. Alternatively, 
we can constrain the ideal points to have mean zero 
and standard deviation one across legislators. This is 
sufficient for local, but not global, identification (since 
the left-right direction can be reversed by reflecting the 
ideal points around the origin and reversing the policy 
directions). 

In d-dimensional choice spaces, d(d + 1) linearly in- 
dependent a priori restrictions on the ideal points X are 
required for identification. Thus, in two dimensions, it is 
necessary to fix the positions of three legislators (three 
ideal points, each with two elements). In general, local 
identification can be achieved by fixing the positions 
of d +1 legislators. Estimation becomes progressively 
more difficult in higher dimensions. In addition to the 
necessary identifying restrictions, it is also beneficial to 
add other a priori information (see Jackman 2001 for 
an example). 


ESTIMATION AND INFERENCE 


The classical or frequentist approach treats ideal points 
as fixed but unknown parameters. An estimation tech- 
nique, such as maximum likelihood, is evaluated by 
considering its sampling distribution. We imagine the 
ideal points and other parameters to be fixed and draw 
repeated samples from the same data generating pro- 
cess. Each of these samples is a hypothetical roll call 
governed by the same ideal points and bill parame- 
ters. Because voting is probabilistic (see equation [1]), 
each sample yields different votes and hence different 
estimates of the ideal points and other parameters. The 
sampling distribution of an estimated ideal point is its 
distribution across a set of hypothetical roll calls. 








TABLE 1. Number of Parameters In Roll Call Analyses 


The Bayesian approach, in contrast, treats the un- 
known ideal points and other parameters as random 
variables and conditions upon the observed roll call 
data. We represent any a priori information by a prior 
distribution over the parameters. Bayes’ formula de- 
scribes how to combine the prior information with the 
observed data to obtain a posterior distribution which 
summarizes our information about the parameters hav- 
ing seen the roll call data. The Bayesian approach, as 
we will see, allows us to make probability statements, 
such as “Kennedy is more likely than O’Connor to be 
the median justice on the Supreme Court.” Of course, 
this kind of statement is meaningless from the fre- 
quentist perspective, which treats the ideal points as 
fixed. 


Bayesian methods are often thought of primarily as a 
way to use nonsample information in estimation. (See 
Western and Jackman 1994 for some examples from 
political science.) Our motivation for using Bayesian 
methods in roll call analysis, however, is rather differ- 
ent. Roll call data sets are usually very large, so in most 
cases the benefit to introducing additional a priori infor- 
mation is slight. Instead, the real benefit to the Bayesian 
approach is that it turns a very difficult classical estima- 
tion problem into a fairly routine application of Markov 
chain Monte Carlo (MCMC) simulation. Moreover, the 
Bayesian approach lets us make inferences about ideal 
points and substantive hypotheses that were intractable 
with classical techniques. And in addition, the Bayesian 
model and estimation procedures are easily extended 
to handle more complex formulations. 

To understand better the computational challenge 
that roll call analysis presents for classical estimation, 
consider the number of parameters that need to be es- 
timated in some typical applications. With data from 
n legislators voting on m roll calls, a d-dimensional 
spatial voting model gives rise to a statistical model 
with p=nd+m(d +1) parameters. Table 1 presents 
values of p for five different data sets. A moderately 
sized roll call data set (say the 105th U.S. Senate) with 
n=100, m=534 nonunanimous roll calls, and d=1 
yields p=1,168 unknown parameters, while a two- 
dimensional model has p = 1,802 parameters. A typical 
House of Representatives (e.g., the 93rd House) set has 
n= 442 and m=917, so a one-dimensional model has 
p=2,276 parameters, while a two-dimensional model 
has p=3,635 parameters. Pooling across years dra- 
matically increases the number of parameters: For in- 
stance, Poole and Rosenthal (1997) report that fitting a 
two-dimensional model to roughly 200 years of U.S. 











Dimensions (ad) 











Legislators Roll Calls 
Legislature (n) (m) _ 1 2 
U.S. Supreme Court, 1994—97 9 213 435 657 
105th U.S. Senate 100 534 1,168 1,802 
93rd U.S. House 442 917 2,276 3,635 
U.S. Senate, 1789—1985 1,714 37,281 76,276 115,271 
U.S. House, 1789-1985 9,759 32,953 75,485 118,017 
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House of Representatives roll call data gave rise to an 
optimization problem with p > 150,000 parameters. 

The proliferation of parameters causes several prob- 
lems. The usual optimality properties of conventional 
estimators, such as maximum likelihood, may not hold 
when, as in this case, the number of parameters is a 
function of the sample size (see Lancaster 2000 for a 
recent survey). In particular, the customary asymptotic 
standard error calculations, using the inverse of the in- 
formation matrix, are not valid. As a practical matter, 
the size of the information matrix is too large for direct 
inversion. Poole and Rosenthal (1997, 246) take the 
obvious shortcut of fixing the bill parameters at their 
estimated values before calculating standard errors for 
the ideal point estimates. They point out that this is 
invalid, but it reduces the computational burden by an 
order of magnitude. 

The Bayesian methods of estimation and inference 
proposed here are valid for finite samples and do not 
rely any large sample approximations. The number of 
parameters is fixed for any particular estimation prob- 
lem by the actual number of legislators and roll calls 
and Bayes’ theorem gives the posterior distribution of 
the parameters conditional upon the observed data. 
The only approximation involved is the simulation 
of the posterior distribution and this approximation 
can be made to any desired degree of accuracy by in- 
creasing the number of simulations (n.b., not the sample 
size, which is fixed for any given data set). 

Details of the Bayesian simulation procedure we 
adopt are provided in the Appendix, but a brief heuris- 
tic explanation may be useful. The fundamental diffi- 
culty in roll call analysis is that everything other than the 





votes is unobservable: The ideal points, bill parameters, 
and utilities are unknowns. But if it were possible to im- 
pute values to the bill parameters and utilities, then the 
ideal points could be estimated by regression. By the 
same logic, if we were able to impute values for the ideal 
points and utilities, the bill parameters could also be 
estimated by regression. The MCMC algorithm repeat- 
edly performs these imputations and regressions, start- 
ing from an arbitrary point and alternating between 
simulation of the ideal points, bill parameters, and util- 
ities. Under a wide set of conditions (e.g., Tierney 1996) 
MCMC algorithms are guaranteed to generate samples 
from the posterior density of the model parameters, re- 
gardless of where in the parameter space the algorithm 
is initialized. In principle, MCMC algorithms are not 
hampered by the large number of parameters encoun- 
tered in roll call analysis, although obviously compu- 
tational time increases with the number of legislators 
and/or roll calls (see the Appendix) and monitoring the 
performance of the algorithm is more costly. 

We use intentionally vague priors for most of the 
parameters. For each application below, we describe the 
actual prior used, but except where noted, the results 
appear to be insensitive to choice of prior. 


COMPARISON WITH OTHER METHODS OF 
IDEAL POINT ESTIMATION 


Having detailed our approach, Table 2 provides a 
summary of the differences between our approach, 
W-NOMINATE, and the factor-analytic approach of 
Heckman and Snyder. Our approach has more in com- 
mon with W-NOMINATE than the Heckman-Snyder 





TABLE 2. Comparison of Ideal Point Estimation Methods 
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factor analysis approach. The Heckman-Snyder factor- 
analyic approach is distinctive in that the relatively 
simple statistical model (factor analysis) does not fol- 
low neatly from a formal model of legislative voting as 
our quadratic-probit model or the normal-logit model 
underlying W-NOMINATE? but provides ideal point 
estimates relatively cheaply; indeed, factor analysis 
supplies starting values for both the NOMINATE al- 
gorithms and our Bayesian simulation approach. 


Example 1: 106th U.S. House of 
Representatives 


To illustrate the differences and similarities between 
existing approaches and our simulation-based 
Bayesian estimator, we first analyze roll calls from the 
106th U.S. House of Representatives via the several 
methods. We fit a one-dimensional model to these 
data using principal components (extremely similar to 
the Heckman-Snyder estimator),4 W-NOMINATE, 
as well as our Bayesian approach. We use the probit 
version of our model and impose the identifying con- 
straint that the legislators’ ideal points have mean zero 
and unit variance across legislators. After discarding 
lop-sided votes, W-NOMINATE uses 871 roll calls and 
does not fit an ideal point for Livingston (R-LA), who 
resigned from Congress in February 1999 after voting 
on 19 roll calls in the 106th House. Lop-sided votes and 
short voting records pose no problems in the Bayesian 
approach; we estimate ideal points for all legislators 
and include all but unanimous roll calls, yielding 
m=1,073 roll calls in all, comprising 444,326 individual 
voting decisions. With predicted probabilities of 5 as 
a classification threshold we correctly classify 89.9% 
of the individual voting decisions? and find that 1,007 
of the 1,073 (93.8%) roll calls discriminate with respect 
to the single latent dimension.® Of the 66 roll calls that 


3 The issue is that the Heckman—Snyder factor analytic statistical 
model results from a linear probability model for roll call voting, in 
turn driven by the assumption that the stochastic component of the 
legislators utility differential (the net utility a legislator has for voting 
Yea over voting Nay) follows a uniform distribution. In turn, the util- 
ity functions that rationalize this statistical model have unspecified 
stochastic components, since there does not exist a distribution such 
that the difference of two independent realizations from it yield a 
quantity with a uniform distribution. While no more ad hoc than 
the usual stochastic assumptions (normal or Type-1 extreme value 
errors), the H-S assumptions are somewhat idiosyncratic. This tech- 
nical point aside, there are other more practical reasons to prefer our 
approach over factor-analytic approaches, such as the availability of 
standard errors and the extensibibty of the model 

4 We implement the principal co ts estimator as follows: (1) 
compute an n x m matrix D by double-centering the roll call matrix, 
(2) compute P, an n x n correlation matrix (the correlation matrix 
of D’, using pairwise deletion of missing data), and (3) take the first 
d eigenvectors of P es the ideal point estimates for a d-dimensional 
model. 

5 This classification rate 1s a function of the unknown model param- 
eters and, so, is itself subject to uncertainty; here we report the clas- 
sification rate averaging over uncertainty in the model parameters 
See the discussion of auxiliary quantities of interest in section. 

6 That is, these 1,007 roll calls all had slope coefficients (j, the 
equivalent of item discrimination parameters) whose 90% posterior 
confidence mtervals did not cover zero. 


fail to discriminate with respect to the recovered dimen- 
sion, only two roll calls were decided by margins closer 
than 60%—40%. In short, a one-dimensional model ap- 
pears to be a very good characterization of these roll call 
data. 


Figure 1 plots the three sets of ideal point esti- 
mates against one another. This figure exemplifies a 
pattern we have seen in many other roll call data 
sets: When n and m are both reasonably large and a 
low-dimensional model fits the data well, there is ex- 
tremely little difference in the ideal point estimates 
produced by W-NOMINATE and our Bayesian esti- 
mator. In this specific example, n = 440, m= 1,073, and 
a one-dimensional model gives an extremely good fit to 
the data (as is typical of recent U.S. Congresses), and 
the ideal point estimates correlate at .996. Nonethe- 
less, by retaining more of the lop-sided votes than 
W-NOMINATE, our Bayesian estimator can discrimi- 
nate among extremist Democrat legislators (in the left 
tail), effectively “stretching” the distribution of the 
Democrats ideal points relative to W-NOMINATE’s 
estimates. The comparison of both W-NOMINATE 
and our Bayesian estimator with the principal com- 
ponents estimator reveals the linearity of the factor 
analytic model, with the two nonlinear models both 
generating more discrimination among extremist legis- 
lators. 


ESTIMATION AND INFERENCE FOR 
AUXILIARY QUANTITIES OF INTEREST 


An advantage of the Bayesian approach is that it is 
straightforward to estimate posterior distributions over 
any auxiliary quantity of interest that is a function of 
the model parameters. These quantities of interest can 
be any function of the model parameters, as we now 
demonstrate. 


Example 2: Pivotal Senators In the 106th 
U.S. Senate 


The notion of pivotal legislators is critical to many 
theories of legislative behavior. For instance, superma- 
jorities are often needed for extraordinary legislative 
action, such as the two-thirds majority required to over- 
ride a presidential veto or the 60 votes needed to pass 
cloture motions in the U.S. Senate. Order statistics of 
ideal points play a prominent role in theories of leg- 
islative politics: e.g., the “gridlock interval” is defined 
as the region between the filibuster pivot and the veto 
pivot, and in the case of a liberal president the gridlock 
interval is bounded on the left by the veto pivot (the 
33rd senator) and on the right by the filibuster pivot (the 
60th senator) (e.g., see Krehbiel 1998, Fig. 2.2). Formal 
theories of legislative politics make sharp and exact 
predictions on the basis of these pivotal locations: e.g., 
proposals that attempt to change status quo policies 
located in the gridlock interval will not succeed. To op- 
erationalize these theoretical predictions, the gridlock 
interval is usually computed using the estimated ideal 
points of the corresponding legislators (e.g., Howell 
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et al. 2000). Similarly, Schickler (2000) characterizes 
the parties’ ideological positions with the ideal points of 
the median legislator within each party. Given the im- 
portance of individual legislators such as the chamber 
median or the “filibuster pivot” (i.e., the 40th senator), 
it is straightforward to generate posterior estimates for 
both the identity and the spatial location for such leg- 
islators in our Bayesian simulation approach. 
Consider the task of uncovering the identity of the 
“pivotal” senators in the 106th Senate (n = 102, m = 59% 
nonunanimous roll calls; there are 102 senators because 
of the replacement of John Chafee (RI) by his son 
Lincoln Chafee and the replacement of Paul Coverdell 
(GA) by Zell Miller, although in assessing rank order 
we ignore the “replacement senators”). We fit a unidi- 
mensional model to these data, again with the identifi- 
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cation constraint that the ideal points have mean zero, 
variance one across legislators. To determine which 
senators are critical for invoking cloture or which are 
the median senators requires recovering the posterior 
distribution of the rank of each senator’s ideal point. 
We compute this by repeating the following scheme an 
arbitrarily large number of times: (1) sample the leg- 
islators’ ideal points x, from their joint posterior den- 
sity; (2) rank order the sampled ideal points; (3) note 
which legislator’s occupies a particular pivot or order 
statistic of interest. We then report the proportion of 
times the ith legislator’s ideal point is the pivot or order 
statistic of interest over these repeated samples from 
the posterior of the ideal points. Since we are working 
with an arbitrarily accurate approximation of the joint 
posterior density for the ideal points, inferences as to 
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the ranks and the identity of the legislators occupying 
particular ranks are also arbitrarily accurate.’ 

Figure 2 summarizes the posterior densities over the 
identities and locations of the senators at key pivot 
points. We omit the “replacement senators” Chafee and 
Miller from these calculation. There is almost no doubt 
as to identity of the chamber median: Republican Sen- 
ators Snowe and Collins are the only senators with pos- 
itive probability of being the 50th senator, with Collins 
overwhelmingly most likely to be the 50th (p=.98). 
Twenty-two senators have discernible probability of 
being the veto pivot (the 33rd senator), with 10 sen- 
ators having probabilities greater than 5% of being the 
veto pivot: Senators Baucus (p =.12), Biden (p= .11), 
Johnson (p = .08), Graham (p= .08), Bayh (p=.08), 
and Cleland (p=.08) account for roughly half of the 
uncertainty as to the identity of the veto pivot, but 
clearly no one senator is unambiguosly the veto pivot. 
Thirteen Republican senators have positive probability 
of being the filibuster pivot (the 60th Senator), five 
of whom have p> .05, with Stevens the most likely 
candidate for the filibuster pivot (p = .41), followed by 
Warner (p = .26) and Campbell (p = .09). 

A similar computation can be performed to recover 
the estimated spatial location of pivotal legislators. The 
right-hand panels in Figure 2 show the location of the 
median, veto pivot, and filibuster pivot (with confi- 
dence intervals), along with the estimated ideal points 
(posterior means) and 95% confidence intervals of ad- 
jacent senators. Again, it is apparent that there is little 
uncertainty as to the median (Senator Collins). But an 
interesting result arises for the veto pivot: Although 
we are unsure as to the identity of the veto pivot, we 
are quite sure as to the veto pivot’s location. A similar 
result is also apparent for the filibuster pivot. While we 
may be able to pin down the location of either pivot 
with some precision, we do not know which legislators 
will be the veto and filibuster pivots on any given vote. 
This is a seldom noticed feature of contemporary U.S. 
Congresses, but one with implications for lobbying and 
legislative strategy; i.e., relatively precise knowledge of 
where the pivots lie does not correspond to knowing the 
identity of pivotal legislators, whose votes are necessary 
to guarantee cloture or veto-proof majorities. 


Example 3: Party Switchers and the “Party 
Influence” Hypothesis 


A major advantage of our modeling approach is the 
ability to extend the model to encompass alterna- 
tive models of legislative behavior. For instance, thus 
far we assumed a Euclidean spatial voting model, 
in which, conditional on legislators’ unobserved ideal 
points (which are considered constant over the period 


7In pmneiple, one could mplement a similar procedure with 
W-NOMINATE estimates, but two complications arise: (1) all 
covanances among ideal points are implicitly set to zero since 
W-NOMINATE only reports pointwise standard errors on each ideal 
point and (2) an asymptotically valid normal approximation is as- 
sumed to characterize ideal point uncertainty (less pressing when 
working with large roll call data sets). Lewis and Poole (2003) use 
parameteric bootstrapping. 


spanned by the roll call data), voting is independent 
across legislators and roll calls. In the next set of exam- 
ples we consider alternatives to this basic setup, all of 
which are easily accommodated in our approach. 

A question of particular interest to congressional 
scholars is the influence of political parties on the 
voting records of legislators. Party switchers—that 
is, legislators who change parties between elections, 
while continuing to represent the same geographic 
constituency—provide something akin to a natural ex- 
periment: The switcher’s change in party affiliation 
helps identify a “party effect,” since many other deter- 
minants of roll call voting remain constant (e.g., char- 
acteristics of the legislators’ constituency). The typical 
investigation of party switchers (e.g., Nokken 2000) 
uses a “pre/post” or “differences-in-differences” de- 
sign, comparing changes in ideal point estimates for 
the party switcher relative to the changes among the 
switcher’s fellow legislators or a random selection of 
nonswitchers (McCarty, Poole, and Rosenthal 2001).° 
By definition, splitting the roll call data into “pre” and 
“post” switching periods gives us fewer data than in the 
entire legislative session, and, as a consequence, ideal 
point estimates based on the pre and post sets of roll 
calls will be less precise than those based on all roll calls. 
Any comparison of change in the ideal points ought to 
properly acknowledge the relative imprecision arising 
from the smaller sets of roll calls available for anal- 
ysis. A strength of our Bayesian simulation approach 
is that we routinely obtain uncertainty assessments for 
all model parameters, and all inferences will reflect the 
drop in precision occasioned by slicing the roll call ma- 
trix around the party switch. 

The flexibility of our approach lets us formally embed 
a model for change in ideal points in a statistical model 
for roll call data, greatly facilitating investigation of 
the party switching hypothesis. Let s € {1, ... , n} des- 
ignate the party switcher and x, and x,o be the ideal 
points of legislator i in the postswitch and preswitch 
periods, respectively. Then a weak form of the party 
switcher hypothesis is that 5, = x,1 — xs #0 (ie., the 
party switcher’s ideal point changes, presumably in 
a direction consistent with the change in parties). A 
strict form of the party switcher hypothesis involves 
the n — 1 additional restriction ô, = 41 — %0 =0, ifi #s 
(ie, the party switcher is the only legislator whose 
preswitch and postswitch ideal points differ). An in- 
termediate version of the switching hypothesis main- 
tains that legislative ideal points may “drift” but that 
the party switcher’s 6, is larger than the nonswitchers’ 
5,. In any event, we require estimates of each legisla- 
tors’ 5, either by running two separate analyses (split- 
ting the roll call data around the time of the party 
switch) or by a more direct (but equivalent) approach 





8 An obvious threat to this approach 1s self-selection into the “treat- 
ment” of party switchmg, say, if party switching is motivating by 
change in the personal preferences of the legislator, in this case we 
could not distinguish any party effect from an effect due to the per- 
sonal preferences of the legislator, but we do not pursue this issue 
here. Other analyses of party switchers have also noted this problem: 
See McCarty, Poole, and Rosenthal (2001, 686). 
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FIGURE 2. Uncertainty Over Identity and Location of Pivots, 106th U.S. Senate 
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in which we parameterize the postswitch ideal points 
asxo+6,,i=1,...,7. 

Since the pre- and post-switch roll calls do not refer 
to identical proposal and status quo positions, some 
normalization is required to compare the resulting 
ideal point estimates. Our solution is to focus on rel- 
ative changes, since without further assumptions, any 
“global” or “uniform” shift in the legislative agenda 
or preferences around the party switch is unidentified. 
That is, the average postswitch legislative proposal may 
be more liberal than the average preswitch proposal 
by distance w, or (observationally equivalent) all leg- 
islators may move w to the right (put simply, there is 
no guarantee that the pre and post ideal point esti- 
mates are comparable). Solutions to this scaling prob- 
lem abound: One could try to find bills with iden- 
tical “Yea” and “Nay” locations in both periods, or 
assert that particular legislators do not move across 
the party switch. The identifying restriction we adopt 
is to estimate subject to the constraint that the legis- 
lators’ ideal points have mean zero and unit variance 
in each period and that we interpret the 6, as relative 
changes. 

We illustrate our method with the 107th U.S. Senate. 
On May 24, 2001, Senator James M. Jeffords (VT) 
announced that he would leave the Republican party 
and become an independent. The switch was partic- 
ularly consequential, giving the Democrats control 
of the Senate: The postswitch partisan composition 
was 50 Democrats, 49 Republicans, and 1 Indepen- 
dent (Jeffords). One hundred forty-eight nonunani- 
mous votes were recorded in the 107th Senate prior 
to Jeffords’s switch, and 349 were recorded post- 
switch. 

Figure 3 summarizes the preswitch and postswitch 
ideal point estimates (top panels) and rank orderings 
(lower panels). We find that the ideal points underly- 
ing Jeffords’ voting behavior differ across the switch: 
The estimate of Jeffords’s postswitch ideal point is 
statistically distinguishable and more liberal than the 
preswitch estimate, indicating that a shift in Jeffords’s 
preferred policy position accompanied his switch in 
party affiliation. 

But of course, there are numerous other senators 
who do not switch parties but whose ideal points do 
move markedly. Points above (below) the 45° lines in 
the left-hand panels in Figure 3 indicate senators mov- 
ing in a conservative (liberal) direction. The 10 largest 
changers in terms of ideal points are presented in the 
top-right panel; the 10 largest changers in terms of rank 
order appear in the lower-right panel (horizontal bars 
indicate 95% confidence intervals). The largest changer 
in terms of spatial location is Wellstone (D, MN), witha 
jump to the left estimated to be about .75 on the recov- 
ered scale. Since Wellstone is a preference outlier, his 
ideal points (and hence change in ideal point) are esti- 
mated with considerable imprecision. Jeffords’s change 
is almost as large, but estimated with considerably more 
precision. However, even after taking into account un- 
certainty in the magnitude of the changes, there is no 
disputing that Wellstone’s change is larger: The prob- 
ability that Wellstone’s change is the largest change is 
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284, while the probability that Jefford’s change is the 
largest is only .068.? 

Intriguingly, other large changers in terms of spatial 
location are the two party leaders, Daschle (D, SD) 
and Lott (R, MS). With Jeffords leaving the Republi- 
cans, Daschle becomes majority leader while Lott be- 
comes minority leader. Both Daschle and Lott move 
in a conservative direction, and are the third and fifth 
largest changers, respectively. When we trun to a con- 
sideration of ranks (lower panels in Figure 3; Daschle is 
the largest switcher, moving 31 places to the right and 
becoming markedly more moderate relative to other 
senators. Daschle changes from about the 10th senator 
(95% bound: 2nd-21st) to the 41st (35th—45th) sena- 
tor (arraying senators from left to right, or liberal to 
conservative), jumping from being unambiguously on 
one side of the Democratic senate median to the other. 
Lott is the fourth largest changer in terms of ranks (20 
places), jumping from the 77th senator (68th—87th) to 
the 97th (95th-100th). Jeffords, the party switcher, is 
only the seventh largest switcher in terms of ranks and 
is the largest switcher with probability .01. Daschle is 
the largest switcher in terms of rank, with probability 
.786. We would reject any hypothesis that the party 
switcher’s change was the largest change we observe, 
in terms of either spatial location or ranks. 

These results are based on a unidimensional fit to 
a single congress, and, fortuitously, a party switch that 
brought about a change in party control of the Senate, 
and so we are cautious about reaching for any broad 
conclusion. Nonetheless, these results are consistent 
with policy moderation by majority leaders, perhaps in 
order to more effectively represent and articulate party 
policy positions, or even so as to secure the majority 
leadership in the first instance. 

This example models change in ideal points around 
a recent, vivid instance of party switching. But other 
models of change in ideal points can be easily fit into 
this framework. Party switchers perhaps provide the 
most direct evidence of party effects, but the method- 
ology can be used to examine changes elsewhere (e.g., 
across congressional sessions, congresses, or politically 
consequential “shocks” in American political history). 


Example 4: A Two-Cutpoint, “Party 
Influence” Model 


Our final example again examines the question of party 
pressure in legislative behavior. The standard model 
assumes that conditional on a legislator’s ideal point x, 
and the vote parameters B,, vote y, is independent of 
yy’, for alli Æi’, j A j": eg, shocks making legislator 
i more likely to vote Yea do not make legislator i’ any 


9 The probability that Senator X1s the biggest changer is computed 
as the proportion of umes a particular senator’s change is the largest 
change, over many draws from the posterior density of the preswitch 
and postswitch ideal points (i.e., we induce a posterior density on 
the identity of the largest switcher. This example again highlights the 
flexibility and ease of the Bayesian simulation approach to estimation 
and mference for roll call analysis See also the earlier example where 
we induce posterior densities over the identity of critical pivots. 
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more or less likely to vote Yea. Accordingly, the re- 
covered ideal points need to be interpreted with some 
caution. Party influence or “whipping” is one way that 
conditional independence can be breached: e.g., legisla- 
tors whose preferences might lead them to vote Yea are 
persuaded to vote Nay, and vice versa. Modeling roll 
call data without considering these possibilities leads to 
ideal point estimates that absorb a shock common to 
members of a given party (or, say, a whipped subset of 
a party’s members); to the extent that party influence 
is an unmodeled common shock, then the recovered x 
display greater partisan polarization than exists in the 
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and the 


“true” x,. Note that while party influence is a plau- 
sible mechanism for generating party-specific utility 
shocks, we are wary of inferring the presence of party 
pressure given evidence of party-specific utility shocks; 
we acknowledge that other mechanisms may generate 
party-specific utility shocks (e.g., lobbying by activists 
or interest groups that targets legislators from one party 
more than the other) and so we refer to “party-specific 
inducements” rather than “party-pressure.” 

It is straightforward to augment the standard vot- 
ing model to allow for party-specific inducements. For 
instance, suppose that in addition to the standard 


American Political Science Review 


quadratic spatial utilities, legislator i receives ô 5 a 
net incentive to vote Yea vis-à-vis Nay on vote j, but 
where the net incentive is specific to i’s party affilia- 
tion, B = D if i is a Democrat and PF, = R if i is a Re- 
publican. If there are only two parties, then a, and sf 
are unidentified. At best we can estimate a net differ- 
ence in party-specific inducements, ô; =5/ — 57 (see 
also Krehbiel 2003): i.e., we estimate y; = U, (“Yea”) 
—U,,(“Nay”)=%,B, — «, + ô, D, + £j, where D, is a 
binary indicator coded one if legislator z is a Democrat 
and zero otherwise, and for this example we assume 
that the e, are iid logistic (logit rather than probit).° 
Since the standard model nests as a special case of 
the two-cutpoint model, it is straightforward to assess 
whether the restrictions implicit in the standard model 
are valid, by testing the joint null hypothesis Hh: 5, =0, 
for all j. 

In addition, we also need to define a set of votes in 
which the net party-specific inducements are not rele- 
vant (or can be reasonably assumed to be zero), since 
if every roll call was assumed to be potentially subject 
to party influence, then there is no way to compare the 
recovered ideal points of Democrats and Republicans. 
By way of analogy, consider standardized test items 
that (say, due to cultural biases) are suspected to be 
easier for group A than for group B: a phenomenon 
known as differential item functioning (DIF); e.g., see 
Holland and Wainer (1993). Does better test perfor- 
mance by group A reflect higher ability than group 
B, or DIF? Unless we can identify a set of test items 
that are known to be DIF-free and use these items to 
pin down ability, then there is no way to distinguish 
apparent differences in ability from DIF. 

Several implementations of party influence models 
appear in the literature. Snyder and Groseclose (2000) 
use a two-stage procedure: First, using lop-sided votes 
(those decided by more than 65/35 margins), estimate x, 
via the standard model using the linear factor-analysis 
model due to Heckman and Snyder (1997); second, 
on non-lop-sided votes, estimate the linear regression 
of votes on x, and a dummy variable for Democratic 
legislators, with the coefficient on the dummy variable 
interpreted as a coefficient of party influence. Aside 
from (1) equating net party-specific inducements with 
party influence and (2) assuming no net party-specific 
inducements for lop-sided votes, this approach makes 
some strong additional assumptions: (3) the use of the 
Heckman—Snyder factor-analytic estimator in the first 
stage; (4) the use of a linear regression model with the 
binary dependent variables (Yeas and Nays) in the sec- 
ond stage (Synder and Groseclose use a Huber-White 
robust variance—covariance estimator to correct for 
the resulting heteroskedasticity); and (5) the fact that 
the x, result from a measurement procedure (the first 
stage) and generate an “errors-in-variables” problem, 


10 McCarty, Poole, and Rosenthal (2001) refer to this class of model 
as a “two-cutpoint” model, smce it implies separate points where 
legislators of each party are indifferent between voting “Yea” and 
voting “Nay.” 
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which Synder and Groseclose tackle via instrumental 
variables." 

In contrast, our approach provides a direct way to 
test for party influence; our modeling approach is eas- 
ily extended to let us embed parameters tapping party 
influence. That is, we fit one model to the data, with 
the statistical specification following directly from aug- 
menting the utility functions with inducements specific 
to each vote, by party affiliation. In this way there is 
no need to break the estimation into separate pieces 
(one to recover ideal point estimates free of party ef- 
fects, the other to recover estimates of effects, con- 
ditional on the ideal points recovered from the first 
stage): Uncertainty in the recovered ideal points esti- 
mates propagates into uncertainty in the estimates of 
the vote-specific parameters (B,, œ}, and &j), and vice 
versa. The two-stage approach has been criticized for 
generating baised estimates of the ideal points of mod- 
erate legislators in the first stage (e.g, McCarty, Poole, 
and Rosenthal 2001, 675-77). But with our approach 
we use all votes to learn about the legislators’ ideal 
points—even those votes thought to be subject to party 
pressure—and so we are less prone to this source of 
bias. 

Note that in our parameterization the 6, are net 
utility shocks specific to Democratic senators and, so, 
alternate sign depending on whether the net party- 

ific inducement was for Democrats to vote Yea 
(8, > 0) or Nay (5; <0). In addition, for perfect party 
line votes, the likelihood for the data attains a maxi- 
mum at 6, = + oo (i.e. votes with perfect separation by 
party are consistent with an infinite amount of party 
pressure!), causing classical estimation procedures to 
break down. Party pressure may be large or small, but 
certainty not infinite, and we express this belief via 
proper prior densities on the 6, that assign infinitesi- 
mal probability to infinite party pressure, and bound 
the posterior densities for ô, away from +00.” 

We estimate our expanded model with 534 nonunan- 
imous roll calls from the 105th U.S. Senate (55 Re- 
publicans, 45 Democrats). To identify the model we 
constrain Senators Kennedy and Helms to have ideal 
points of —1 and 1, respectively. The two-cutpoint 
model requires the additional identifying constraint 
5,=0 among roll calls decided with majorities of 
65% or more (“lop-sided” votes), consistent with the 
Snyder and Groseclose (2000) approach: 257 of the 534 
nonunanimous roll calls meet this criteria. Vague nor- 
mals prior distributions are used for the 5, parameters 
(mean zero, variance 16) for the remaining 277 “close” 
roll calls (decided by margins closer than 65-35). Of 
these close roll calls, 93 (33.6%) have net party-specific 
inducements that are significantly different from zero 
(again, in the sense that the 95% confidence inter- 
val on the corresponding 6, does not overlap zero). 


11 Another implementation of the two-cutpoit approach appears in 
McCarty, Poole, and Rosenthal (2001), usmg a nonparametric opti- 
mal classification algorithm (Poole 2000): they compare the increase 
10 Classification success in moving from one to two cutpoints. 

12 Note that Snyder and Groseclose (2000) effectively sidestep this 
issue by fitting the binary roll data by least-squares linear regression. 
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FIGURE 4. Net Party-Specttic Inducements, 105th U.S. Senate, by Roll Call Margin and Party 
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Figure 4 plots the means of the posterior density of 
each 5, parameter against the margin of victory (left 
panel) and party cohesion (right panel) per “close” roll 
call. Our analysis shows that net party-specific induce- 
ments are at their largest for roll calls decided along 
party lines and decline as the votes become more lop- 
sided. Nonetheless, the extent of party line voting we 
find here is smaller than those reported by Snyder 
and Groseclose (2000) and closer to the proportions 
found by Cox and Poole (2002), although the latter 
analysis considered voting in the House of Represen- 
tatives. 

The actual magnitudes of the 5, warrant elaboration 
as well. The smallest, statistically significant 5, we ob- 
tain are roughly +2.0. To assess the magnitude of this 
effect, consider a moderate Republican and Democrat 
who ex ante are indifferent between voting Yea and 
voting Nay. A net party-specific inducement corre- 
sponding to ô, = 2 makes the probability of the Demo- 
crat voting Yea about .73 and the corresponding prob- 
ability for the Republican .27. If the net party pressure 
is all one-sided, say all Democratic, then the probabil- 
ity of the Democrat voting Yea goes up to .88; con- 
versely, if the party pressure is all Republican, then the 
probability of the Republican voting Yea is .12. Note 
that these are the smallest statistically significant esti- 
mates of ô, we obtain, with estimates of +4 being more 
common. That is, although we find evidence consistent 
with party pressure on just one-third of close votes, the 
magnitude of that pressure seems large and politically 
consequential. 
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Note: Each polnt represents postenor means of the net party-spectfic inducements parameters (5,), plotted agalnst roll call margin 
and party cohesion (number of Democrats voting with majortty of Democrats plus number of Republicans voting with majority of 
Republicans, as a proportion of all votes cast). Solid points Indicate parameters signiicantly different from zero at conventional 95% 
levels Unsurprisingly, the largest and moet consistently significant net party-specrfic Inducement estimates occur when the Senate splits 








We can compare the ideal point estimates produced 
by our augmented model with those from the stan- 
dard model. A standard one-dimensional spatial voting 
model finds absolutely no overlap between the ideal 
points of Democrats and Republicans in the 105th U.S. 
Senate. This complete separation between the parties 
arises in part because any voting on close votes driven 
by party-specific inducements is attributed wholly to 
differences in the legislators’ ideal points. Once we ad- 
mit the possibility of party-specific inducements, the 
two partisan groupings actually overlap: Breaux, the 
most conservative Democrat, has roughly the same 
ideal point as the most liberal Republicans, Specter 
and Jeffords. Even after estimating net party-specific 
inducements specific to each roll call, we still find con- 
siderable partisan polarization but substantially less 
polarization than the levels recovered from a conven- 
tional statistical analysis using the standard model. 


CONCLUSION 


Roll call analysis and the statistical operationalization 
of the Euclidean spatial voting model is a critical com- 
ponent of the scientific study of legislative politics. Al- 
though existing methods for the statistical analysis of 
roll call data have been employed in many settings and 
produced important insights, the Bayesian simulation 
approach we present builds upon and improves extant 
methods in several ways. First, Bayesian methods per- 
mit auxiliary information to be brought to bear on roll 
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call analysis in a straightforward and theoretically con- 
sistent way; this auxiliary information may include (but 
is not restricted to) expert judgments about dimen- 
sional structure, the location of extremist legislators, 
legislator-specific covariates, or the evolution of the leg- 
islative agenda. Second, the methodology we present is 
sufficiently flexible so as to easily accommodate alter- 
native models of legislative behavior. For example, it 
is possible to permit ideal point estimates to change 
over time by modeling the process associated with that 
change (e.g., legislators switching affiliations between 
political parties). Finally, Bayesian simulation exploits 
tremendous increases in computing power available 
to social scientists over the last decade or so: Estima- 
tion and inference via simulation—iong known to be 
an attractive statistical methodology (e.g., Metropolis 
and Ulam 1949)—is now a reality. Consequently, our 
model works in any legislative setting, irrespective of 
the size of the legislature or its agenda. 

Thus Bayesian methods can make roll call analysis 
less a mechanical scaling exercise in which scholars 
simply feed roll call data to a “black box” algorithm 
and more a way to test theoretically interesting models 
of legislative behavior. In sum, the Bayesian simula- 
tion methodology we present lets scholars (1) incorpo- 
rate substantive information about the proposals being 
voted upon or (2) about the prefenences that structure 
the ideal points being estimated, (3) impose theoret- 
ically implied constraints on the standard model, and 
(4) easily estimate and test alternative and models of 
legislator voting. 


APPENDIX 


Markov Chaln Monte Carlo Algorithm 


The difference between the utilities of the alternatives on 
the jth roll call for the :th legislator is y; mU;(Z,) — 
U (p,)=B x — æ; + &,, where, for simplicity, we have set 
o=: If B, and a, are given, then x, is a vector of “regres- 
sion coefficients” that can be imputed from the regression of 
ys + %, on B, using the m votes of legislator i. If x, is given, 
then we use the votes of the n legislators on roll call j to 
impute 6, and œ. Then given x,, B,, and œ, the latent utility 
differences y* are simulated by drawing errors from a normal 
distribution subject to the constraints implied by the actual 
votes (if y; = 1, then yf, > 0, and if y, = 0, then yf < 0), and 
the process repeats. 

In our Bayesian approach, priors are required for all pa- 
rameters: B, and a,, /=1,...,mandx,,i=1,...,n. For the 
probit version of our model we have standard normal errors 
E€; and normal priors for the ideal points and the $, and a, 
parameters, leading to simple expressions for the conditional 
densities that drive the MCMC algorithm. For $, and a,, we 
denote the priors N(To, To); we generally choose vague pri- 
ors by setting To =0 and To = x - Iy,;, with K a large positive 
quantity (e.g, K = +). For the legislators’ ideal points, we 
use the normal prior x, ~ N(v,, V,), where usually v, =0 and 
V, =I, (an identity matrix of order d), but for legislators we 
are fixing to an set location (e.g., Kennedy and Helms, so as 
to normalize the scale of the latent traits), we use the prior 
N(x, v: Ia), where v is an arbitrarily small, positive quantity 
(i.¢., a “spike prior” at x). 
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The goal is to compute the joint posterior density for 
all model parameters and «,,f/=1,...,m and x,,/ = 
1, ..., n. A MCMC algorithm provides a computer-intensive 
exploration or “random tour” of this joint density, by succes- 
sively sampling from the conditional densities that together 
characterize the joint density. The model here 1s isomorphic 
with a two-parameter item—response theory model: Albert 
(1992) showed how an MCMC algorithm can be used to ex- 
plore the posteriðt density of this model; see also Patz and 
Junker (1999) and Johnson and Albert (1999). Augmenting 
the MCMC algorithm with the latent y* greatly simplifies 
the computation of the probit version of the model, letting 
us exploit standard results on the Bayesian analysis of linear 
regression models as we show below, we obtain y, by sam- 
pling from its predictive density given the current values of 
the other parameters and the roll call data. Letting ¢ index it- 
erations of the MCMC algorithm, iteration t of the algorithm 
comprises sampling from the following conditional densities. 


1. 80% |), X, Ba). At the start of iteration t, we have 
BY, a’) and x"). We sample yP from one of the two 
following densities, depending on whether we observed a 
Yea (y,, = 1) or a Nay (y, = 0): 


yt | (u =0, x), po), of) ES Niu, 1)I(y* <0) 


(truncated normal), 

y [Oy = Lat, BI, of) ~ Nu? IGG, = 0) 
(truncated normal), 

where Te ed — a" and I(-) is an indica- 


tor function. For abstentions and other missing roll calls 

we sample yn from the untruncated normal density 
NEY, 1), effectively generating multiple imputations 
for these missing data over iterations of the MCMC al- 
gorithm. 

2. g(B,, œ |X, y5). For j=1,...,m, sample g” and a 
from the multivariate normal density with mean vec- 
tor [X"X* +T; X" yO + T5 To] and variance- 
covariance matrix [X”X*+T)']-!, where X* is an 
n x (d + 1) matrix with typical row xt = (x, —1), y™ is 
an n x —1 vector of sampled latent utility differentials for 
the jth roll call, and recalling that N(To, To) 1s the prior for 
B, and CFR This amounts to running “Bayesian regression” 

fy” on 1“! and a negative intercept and then sampling 
from the posterior density for the coefficients B, and a,, 
for j=1,...,m. 

3. g(x, | y5., P, œ). Rearranging the latent linear regres- 
sion yields w,, = y}, + a, =x,B, + &,. Collapse these equa- 
tions over the j subscript, to yield the n regressions 
w, =Bx,+¢,, where B is the mxd matrix with the 
jth row given by B’. That is, we have n regressions, 
with the ideal points x, as parameters to be updated. 
Again exploiting conjugacy, the update is performed by 
sampling each x” from the d-dimensional normal den- 
sity with mean vector (B'B + V,')-1(B’w, + V,‘v,) and 
variance-covariance matrix (BB + V;")-!. After updat- 
ing all x,(i =1,...,m), we optionally renormalize the x, 
to have zero mean and unit variance, say, when fitting 
a unidimensional model without a Kennedy-Helms type 
restriction. 

Sampling from these distributions updates all the unknown 

quantities in the probit model. At the end of iteration t, 
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denote the current values of the parameters of interest as 
EO — (BO, a, XO). Iterating the MCMC algorithm pro- 
duces a sequence ED, ED, ... that comprises a Markov chain, 
with the joint posterior density for & as its limiting distribu- 
tion. That is, after a large number of iterations of the algo- 
rithm, successive samples of E are drawn from its posterior 
density. These samples are saved and summarized for infer- 
ence. Any function of these parameters can also be computed 
and saved, such as rank orderings of the legislators, pairwise 
comparisons of legislators, or the separating hyperplanes for 
particular roll calls. 

The MCMC algorithm is initialized as followa For the 
ideal points, we perform the eigen-decomposition described 
in footnote 4. These initial values are the estimates we would 
get from treating the ideal-point estimation problem as a 
principal-components factor analysis problem, ignoring the 
fact that the roll call data are not continuous variables (the 
binary character of the roll call data becomes less problematic 
as m— oo, and so for large roll call data sets from contem- 
porary U.S. Congresses this procedure yields excellent start 
values). We are grateful to Keith Poole for suggesting this 
procedure, which is also used to generate start values for 
NOMINATE. For the bill-specific parameters B, and œ, we 
obtain start values by running probits of the observed votes 
y, on the start values for the ideal points, j =1, .., m. 

With any MCMC approach, diagnosing convergence of the 
Markov chain is critical. Our experience is that the MCMC 
algorithm performs reasonably well for the roll call problem, 
moving away from start values to the neighborhood of a pos- 
terior mode quite quickly. For simple unidimensional fits, we 
usually let the sampler run for anywhere between 50,000 and 
500,000 iterations and then thin the output (storing the output 
of every 100th to every 1,000th iteration) so as to produce a 
reasonable number of approximately independent samples 
from the posterior for inference (say, between 250 and 1,000 
samples). 

Figure 5 shows trace plots of the MCMC algorithm for 
single parameters (the ideal points of Kennedy, Collins, and 
Helms, from a unidimensional model fit to 106th U.S. Sen- 
ate) in the top three panels and for the joint density of se- 
lected pairs of legislative ideal points in the lower panels. 
The identifying restriction is that the ideal points have mean 


zero and variance one across legislators In the upper pan- 
els, the MCMC algorithm is approximately a random walk 
without drift ın the parameter space, consistent with the sam- 
pler behaving well, randomly traversing the posterior den- 
sity; the gray line indicates the posterior mean based on the 
post “burn-in” samples, and the dotted lines indicate 95% 
confidence intervals. The snaking solid lines are smoothed 
or moving averages and a cumulative mean; note that after 
sufficient iterations, the cumulative mean of the MCMC al- 
gorithm becomes indistinguishable from the posterior mean, 
and the running mean slowly undulates about the posterior 
mean, indicating lack of drift (consistent with the sampler 
having converged on the posterior density). Half a million 
iterations were computed; so to as produce approximately 
independent samples from the posterior density, only every 
1,000th iteration is retained for making inferences and the 
thinned iterations have AR(1) parameters averaging about 
.06 (maximum AR[1]=.23, for Boxer). In the lower panels, 
each joint posterior mean is indicated by an open circle, 
and jot 95% confidence intervals are indicated by el- 
lipses, the latter computed assuming that the joint poste- 
nor densities can be approximated with bivariate normal 
densities. 


Computing 


For small roll call data sets, the free, general-purpose MCMC 
package WinBUGS (Spiegelhalter et al. 1997) can be used 
to implement our approach: only a few lines of WinBUGS 
commands are needed. For instance, the WinBUGS code 
for a simple unidimensional model fitted via logit and the 
Kennedy—Helms identification constraint is given in Table 3. 
Elaborations on this basic setup are available at Jackman’s 
Web site (http//jackman.stanford.edu/ideal). Given the com- 
putational burden of analyzing larger roll call data sets, we use 
aC program, authored by Jackman, implementing the MCMC 
algorithm for the probit model discussed above. In turn, this 
program can be called directly from the (free) R statistical 
program. We also use R for preparing roll call data sets for 
analysis, inspecting the output of the MCMC algorithm, and 
producing the graphs and tables in the body of the paper. All 
code is available upon request. 


TABLE 3. WinBUGS Code, Implementing Standard Euclidean 


Spatial Voting Model Via Logit 


model { 
for(i in 1:N){ 
for(j in 1:M){ 
# model for roll calls 


## loop over legislators 
## loop over roll calls 


logit (pifa,j]) <— x[i]*beta[j,1] — betalj,2] 


yl1,j] ~ dbern(pili,4]) 
} 
} 


## priors over model paraneters 


for(i in 3:100){ 
x[i] ~ dmorm(0, .01) 
} 
x[i] <— —1 
x[2] <— 1 
for(j in 1:M){ 


#f vague normal priors 


## Kennedy constrained to —1 
# Helms constrained to 1 


beta[j,1] ~ dnorm(0,. 04) # vague normal priors 
beta[j,2] ~ dnorm(0,. 04) # mean zero, variance 25 


} 





D eee, 
Note: This code presumes that the roll call matrix has been sorted such that the voting 
histories of Kennedy and Helms are In the first and second rows, respectivety. 
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[FIGURE 5. Iterative History of the MCMC Algorithm, 106th U.S. Senate 


KENNEDY Burn-in = 1¢+06 Iterations 





COLLINS Burnin = 16+08 Iterations 








COLLINS 


Note: For the top three panels, the Itght gray horizontal line is the postenor mean (based on the post-burmin iterations), the dark solid line 
undulating around the posterior mean Is a moving average, and the dotted honzontal lines indicate the width of a 95% confidence Interval 
around the posterior mean; for the bottom three graphs, the open circle ts the joint posterior mean and the grey ellipse approxmates a 
joint 85% confidence Interval (both based on the post-burn-In Hteratons). 





Computing time is clearly a relevant consideration for 
simulation-based methods. Our experience is that computing 
time increases in mmT, where n is the number of legislators 
and m is the number of bills, and so nm 1s the number of 
individual voting decisions being modeled (assuming no ab- 
stentions or missing data), and 7 is the number of MCMC it- 
erations desired. The computational cost of moving to higher 


dimensions is surprisingly small. The unidimensional 106th 
US. Senate example involved modeling n = 102 ideal points 
and m=596 pairs of bill (or item) parameters, with 58,156 
nonmissing individual voting decisions. Half a million itera- 
tions of the MCMC algorithm required just over three hours 
on a Dell PC with a 3.0 GHz Intel processor, or about 2,650 
iterations per minute. For a one-dimensional fit to the 106th 
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House (n = 440, m=1,073, with 444,326 nonmissing indi- 
vidual voting decisions), a 150,000-iteration run took 6.4 h, 
or about 391 iterations per minute, on the same hardware. 
For a one-dimensional fit to the U.S. Supreme Court (n=9, 
m = 213, with 1,907 nonmissing individual voting decisions), 
a 500,000-iteration run took 15 mın, or about 33,250 iterations 
per minute. These extremely long runs are usually not nec- 
essary, but we were being especially cautious about ensuring 
that the MCMC algorithm had converged on and thoroughly 
explored the posterior density. 
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eck, King, and Zeng (2000) offer both a sweeping critique of the quantitative security studies field 

and a bold new direction for future research. Despite important strengths in their work, we take 

issue with three aspects of their research: (1) the substance of the logit model they compare to 
their neural network, (2) the standards they use for assessing forecasts, and (3) the theoretical and model- 
building implications of the nonparametric approach represented by neural networks. We replicate and 
extend their analysis by estimating a more complete logit model and comparing it both to a neural network 
and to a linear discriminant analysis. Our work reveals that neural networks do not perform substantially 
better than either the logit or the linear discriminant estimators. Given this result, we argue that more 
traditional approaches should be relied upon due to their enhanced ability to test hypotheses. 


became an increasingly prominent and sophis- 

ticated area of inquiry within our discipline. 
Just in American Political Science Review, more than 
20 articles over the past five years have applied some 
form of econometric technique to the study of inter- 
national conflict. In particular, estimators based on the 
general linear model have been central to the devel- 
opment of extensive literatures on deterrence, the im- 
pact of democracy and trade on international conflict, 
and many other issues. In the March 2000 issue of this 
Review (Vol. 94, No. 1, 21-36) Beck, King, and Zeng 
(hereafter, BKZ) offered a sweeping critique of these 
research programs and a bold new direction for future 
research. They contend that standard parametric pro- 
cedures are not up to the task of estimating the causes 
of international conflict because these relationships are 
“highly non-linear, massively interactive, and heavily 
context dependent or contingent” (22).! Consequently, 
improvements in theory and data are for naught, with- 
out a substitute for the inadequate, inflexible models 
based on the general linear model. As an alternative, 
BKZ introduce a statistical estimator, commonly called 
a “neural network,” that can approximate complex re- 
lationships without prior assumptions. 

The evidence for BKZ’s claim is the alleged ability 
of neural networks to improve forecasts of the onset 
of militarized disputes. We are, however, less sanguine 
about the forecasting performance of neural networks 
compared to more traditional logit models. If the goal 
is to reject an entire research paradigm, it seems appro- 
priate to use the best model that paradigm has to offer. 
But we contend that BKZ omitted several variables and 


D uring the 1990s, quantitative security studies 


Scott de March: ıs Asmstant Professor, Department of Polit- 
cal Science, Box 90204, Duke University, Durham, NC 27708 
(demarchi@duke edu). 

Christopher Gelphi is Associate Professor, Department of Po- 
litical Science, Box 90204, Duke University, Durham, NC 27708 
(gelpi@duke edu) 

Jeffrey D. Grynaviski is Assistant Professor, Department of Polit- 
cal Science, Unrversity of Chicago, 5828 S. University Ave., Chicago, 
IL 60637 (grynav@uchicago.edu). 

Listing of authors 1s alphabetical. 

1 This represents an assumption about the data-generating process 
that may or may not be correct. 


transformations of variables that the existing literature 
has suggested influence the onset of conflict. These 
omissions badly hamstrung the logit model, which is 
reliant on theory to specify a model’s functional form. 
Until the performance of a neural network is compared 
to a logit model that fully embodies “state of the art” 
theory, we would resist rejecting the parametric ap- 
proach. 

Moreover, we contend that the forecasting perfor- 
mance of neural networks is not as good as it may 
initially appear. Based on calculations using BKZ’s 
Table 1 and their standards for assessing whether a 
model predicts a conflict, their neural network correctly 
forecasted 14 (of a possible 84) disputes in the out- 
of-sample data compared to zero correct predictions 
by a logit model. Unfortunately, this success comes at 
the expense of 14 (of 2,398) nonconflicts predicted in- 
correctly as wars by the neural network but that were 
predicted correctly by their logit model. Thus, BKZ’s 
logit model yielded the same number of correct predic- 
tions as their neural network. If one argues that we are 
only interested in successfully predicting war, a model 
“superior” to either of the above would be to predict 
war everywhere, all the time. This sort of confusion 
demonstrates that we need better standards for com- 
paring statistical models. Consequently, in addition to 
evaluating the promise of nonparametric approaches 
such as neural networks, our broader goal is to provide 
a coherent set of standards that will allow researchers to 
compare different models of conflict. Underlying this 
research are three principles that we believe should 
guide such comparisons. 

(1) Models should be parsimonious (King, Keohane, 
and Verba, 1994). Researchers should avoid the inclu- 
sion of irrelevant variables unmotivated by theory. All 
else equal, more complicated models are inferior to 
simpler models, largely because the inclusion of ad- 
ditional terms or complex functional relationships in- 
creases the combinatorics of the parameter space. This 
wider parameter space makes models harder to inter- 
pret and decreases the reliability of one’s results. In 
particular, a wide parameter space raises the threat of 
overfitting a model to the idiosynchrasies of a particular 
set of data. Thus when a statistical model supports a 
theory that has been generated without regard to the 
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data, we have a higher degree of confidence in this 
relationship than when we “discover” empirical, and 
possibly random, relationships that were not hypothe- 
sized a priori. 

(2) For a statistical procedure to be useful in the- 
ory testing, it must offer interpretable and testable hy- 
potheses. Without such tests, researchers cannot de- 
termine whether the magnitude and direction of the 
effects of predictors comport with theory. As noted 
above, parsimony greatly aids in this endeavor, and 
it is, for example, relatively easy to understand the 
meaning of logit coefficients. Neural networks, with 
their reliance upon vast parameter spaces, run the 
risk of violating this criterion or, at a minimum, mak- 
ing the interpretation of predictors extraordinarily 
difficult.” 

(3) The relative merits of competing theories about 
the data-generating process should be determined by 
an out-of-sample comparison of model fit. On this point 
we agree completely with the argument advanced by 
BKZ and believe that this issue deserves more atten- 
tion in the discipline at large. Although BKZ’s argu- 
ment that forecasts are necessary is extraordinarily 
valuable, we believe that their standard for classify- 
ing predictions is too narrow. Specifically, BKZ report 
wars correctly classified without mention of the number 
of peaceful dyads incorrectly classified; additionally, 
they rely upon an arbitrary threshold for predicting 
“war,” making comparisons between different models 
very difficult. A more appropriate standard, and the 
one we follow here, is presented in King and Zeng 
2001. 

In accordance with the three criteria outlined above, 
we offer a comparison of three statistical procedures for 
explaining the onset of war. These procedures are lin- 
ear discriminant analysis, logistic regression, and neural 
networks estimation. Each of these procedures embod- 
ies a different epistemological perspective and reflects 
different assumptions about the degree to which inde- 
pendent variables interact in generating international 
conflicts. After introducing these methods, we argue 
that the neural network’s expansion of the parameter 
space suffers from two failings, which are implicit but 
not fully explained in BKZ’s discussion. Specifically, 
the neural networks approach (a) leads to unnecessary 
uncertainty about causal relationships because of inef- 
ficient estimates and (b) precludes hypothesis testing 
that focuses on measuring or comparing the effects of 
particular variables. 

Given these problems, there must be evidence of vital 
interaction effects based on out-of-sample forecasts to 
justify the use of the technique. Our empirical work 
demonstrates that this condition is not satisfied in the 
data and models presented by BKZ. Instead, we find 
that neural networks do not perform substantially bet- 
ter than traditional methods, at least compared to a 


2 In logit models, the terms are linear in the logit equation, the signs of 
coefficients are reliable, and the magnitudes of a single coefficient can 
be examined by setting other vanables equal to modal values This 
should not be confused with the broader class of nonlinear models 
represented by neural networks 
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logit model specified in a manner consistent with the 
literature and evaluated with a more appropriate ar- 
tay of forecasting diagnostics? So although BKZ have 
offered a valuable service to the discipline by raising 
neural networks as a viable alternative, the rejection of 
the traditional logistic approach is not nearly as deci- 
sive. 


THREE MODELS OF MILITARIZED 
INTERSTATE DISPUTES 


One of the most valuable contributions of BKZ’s essay 
is that it has forced researchers to question whether 
orthodox statistical procedures are flexible enough to 
model a process as complex as the initiation of in- 
ternational conflicts. In the spirit of BKZ’s essay, we 
compare neural networks, logistic regression, and lin- 
ear discriminant analysis. Each of these is based on 
a different assumption about the complexity of the 
phenomenon under investigation. These three models 
can be naturally ordered from most complex (neural 
networks) to least (linear discriminants). The advan- 
tage of a neural network is its ability to estimate arbi- 
trary functions, including causal relationships charac- 
terized by high degrees of interactivity and nonlinear- 
ity (Bishop 1995). Accordingly, it allows researchers 
to model the massively nonlinear, interactive conflict 
generation process described by BKZ. The model least 
able to model complex processes is the linear dis- 
criminant. This approach estimates a separating hy- 
perplane that divides war from peace dyads, allowing 
for neither interactions nor nonlinearities. Logit mod- 
els lie in between these extremes. Like the linear dis- 
criminant, logit models presume linear effects for the 
independent variables. However, logit adds some com- 
plexity by allowing the researcher to make ex ante 
specifications of expected non-linearities and interac- 
tive effects. 


THEORY BUILDING IMPLICATIONS OF 
NEURAL NETWORKS 


While neural networks are more flexible than the gen- 
eral linear model, this flexibility comes at considerable 
cost. At its core the application of neural networks re- 
flects a choice to deemphasize ex ante theory devel- 
opment in favor of an unparsimonious and inductive 
mode] of the data-generating process, Rather than us- 
ing theory to generate hypotheses about the nature 
of the functional relationships and adding a handful 
of appropriate terms to traditional models to capture 
those effects, BKZ’s neural network increases the size 
of the parameter space almost 30-fold, allowing the au- 
thors to estimate an arbitrary function. Inflating the 
size of the parameter space causes inefficiency and of- 
ten results in “overfitting the data.” Even though the 
resulting inferences may be unbiased, King, Keohane, 


3 In fact, the logit often outperforms neural networks on crucial di- 
mensions, such as the ability to rank cases from most to least likely 


to experience a dispute. 
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and Verba (1994) identify the potentially serious con- 
sequence: “When we replicate a study in a new data 
setin which there is a high correlation between the key 
explanatory variable and an irrelevant control variable, 
we will be likely to find different results, which would 
suggest different causal inferences” (183).‘ 

The underlying problem is best illustrated by an ex- 
ample, Suppose that a hypothetical researcher, with 
neither knowledge of nor concern for theoretical 
claims, acquired a data set collected by someone else. 
Imagine that our researcher then estimated a logit 
model with a large set of nonlinear transformations 
(logs, squares, etc.) for each variable and inserted all 
possible two- and three-way interactive effects among 
the variables. This model would not be viewed as an 
important contribution to the literature but, rather, as 
an exercise in overfitting. But this approach is based 
on the same assumptions and a comparable number of 
parameters as used by BKZ for their neural network. In 
the absence of prior hypotheses about when particular 
variables are likely to influence outcomes and why, at 
best we can induce theory based on our findings, not 
test theory. 

These problems are compounded by the fact that 
BKZ’s results are difficult to interpret. For example, 
the use of a neural network makes it impossible to 
test hypotheses about the magnitude and direction of 
a predictor’s influence on the dependent variable. The 
problem is that in an interactive, nonlinear model, the 
relationship between the dependent variable and a par- 
ticular independent variable is heavily contingent on 
the values of the other independent variables in the 
model. This is not just a more complicated case of the 
difficulty in interpreting logit coefficients. At a mini- 
mum, the logit model’s greater parsimony ensures that 
coefficients do not change signs, even if the magnitude 
of the effects may be somewhat cumbersome to evalu- 
ate. In neural networks, even the direction of a causal 
relationship can fluctuate with relatively minor permu- 
tations of the other variables in the model. This implies 
that the marginal effect tables presented in BKZ’s ar- 
ticle may be misleading, because it is not possible to 
determine the sensitivity of the findings to changes in 
the other variables. 

It is important, however, to give credit to BKZ’s 
argument that out-of-sample testing in part mitigates 
the harms cited above. In our view, their emphasis on 
prediction, rather than repeated efforts to fit models to 
a fixed data set, is a genuine contribution and should be 
heeded broadly in political science. Despite our agree- 
ment on this point, it is worth remembering that out- 
of-sample work only decreases the probability that an 
overfitted model will be selected. In essence, predictive 
work means that one must find a model that tolera- 
bly fits the sample as well as the out-of-sample set. It 





4 To a certain extent, the use of out-of-sample tests insulates BKZ 
from thts problem, however, it should not be surprising to find that 
the neural network with almost as many parameters as conflict events 
would be outperformed by a much simpler model like logit or dis- 
crimunant analysis 


should be obvious that given time, ardent data min- 
ers will arrive at a model that satisfies these criteria. 
Although BKZ did not elect to follow this approach 
(i.e., they chose their model solely upon the in-sample 
performance before generating out-of-sample predic- 
tions, as did we), not all researchers will be puritans 
in this regard. Without the ability to examine a theory 
that researchers believe explains the data-generating 
process, it is very difficult, even with predictive work, 
to detect an overfitted model. 


BROADENING THE STANDARDS FOR 
ASSESSING FORECASTS 


In addition to our concerns about the role of theory in 
building models, we believe that BKZ failed to present 
consistent standards for evaluating their out-of-sample 
results. BKZ rightly remind us that forecasts on out-of- 
sample data help to distinguish between models that ac- 
curately reflect the underlying data-generating process 
and those arrived at through atheoretical curve-fitting. 
That is, forecasting indicates whether a model reflects 
the “true” causal process driving the phenomena of 
interest and guards us against “taking advantage of 
some idiosyncratic feature of the data” (BKZ, 21). Pre- 
dictive success, however, should not be judged against 
an arbitrary 0.5 probability threshold. Given that fore- 
casting is the goal, we believe that there are at least 
two sets of standards that one might use to determine 
the accuracy of a model’s forecasts: (1) the model’s di- 
chotomous predictions of “success” and “failure” and 
(2) the model’s predicted probabilities of “success” and 
“failure.” 


Evaluation of Dichotomous Forecasts 


This is the primary standard emphasized by BKZ. But, 
the thrust of their argument focuses on the ability of 
their neural network to occasionally forecast a dispute 
with a probability greater than 0.5. This measure of 
forecast accuracy is extremely suspect and should not 
be used without a decision-theoretic justification. The 
problem is that we typically face trade-offs between 
models that forecasts a large number of false nega- 
tives (failure to predict disputes) and alternatives with 
a greater number of false positives (failure to predict 
peace). The magnitude of this trade-off is dramatically 
revealed by BKZ’s own results, in which the authors 
fail to generate more overall correct predictions than 
their own logit model. 

In general, the use of any arbitrary cutoff point to 
discriminate between “peace” and “war” or “success” 


5 We cannot emphasize enough how brittle the choice of 0.5 18. A 
poor man’s approach to logit that would yield more predictions of 
war would be to estimate from the training data a constant that adds 
to the logit model’s Pr(war) such that one predicts conflict 4.1% of 
the time (Le , our expectation of war prior to forecastmg). Because 
the choice of this constant uses only the traiming set (L¢., NOT the 
test set), ıt is a pamless way to arrive at a greater number of successful 
predictions of conflict in one’s forecasts 
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and “failure” in classification tasks is risky, and may 
simply be inappropriate (Greene 1997, 892-93; King 
and Zeng 2001, 11-3; Swets 1988, 1285-93). Theoretical 
work on international conflicts provides us with an ad- 
ditional worry in choosing such a threshold. Statistical 
models provide the predicted probability of a conflict, 
but this probability may be low in all cases. As noted 
in Greene, “0.5, although the usual choice, may not 
be a very good value to use for the threshold. If the 
sample is unbalanced—that is, has many more 1’s than 
0’s, or vice versa—then by this prediction rule, it might 
never predict a 1 (or 0).... The obvious adjustment is 
to reduce [the threshold]” (892). 

Of course, one might view the generally low prob- 
abilities generated by logit models as a problem than 
can be fixed by neural networks. However, even wars 
that ultimately do occur may have been generated by 
circumstances where the ex ante probability of war was 
less than 0.5 (Fearon 1995; Gartzke 1999). For example, 
if we view war as “off-equilibrium” behavior (Gartzke 
1999), then the precise timing of the outbreak of mili- 
tary conflict may result from some combination of id- 
iosynchratic events. In this case any attempt to build 
systematic statistical models that generate high ex ante 
probabilities of military disputes will inevitably become 
an exercise in overfitting a particular dataset. The prob- 
lem of low ex ante probabilities does not mean that logit 
models never predict conflict. After all, logit models 
can meaningfully distinguish situations in which the 
probability of conflict is negligible from those in which 
the risk is substantial. Moreover, if we observe four 
cases with predicted probabilities of 0.2, then the ex 
ante probability of conflict in at least one of those cases 
is over 0.5.—even if the model cannot predict exactly 
when crises will erupt. 

King and Zeng (2001) acknowledge in subsequent 
work that it is better to investigate the trade-offs be- 
tween false positives and false negatives for a variety 
of predictive thresholds, and not penalize a model pre- 
disposed to predictions biased too high or too low. One 
way to look at different thresholds would be to generate 
a huge number of classification tables. A better solu- 
tion, however, is to use receiver—-operating characteris- 
tic (ROC) curves. ROC curves are diagnostics that are 
able to cope with the trade-offs between false positives 
and false negatives in model assessment (Swets 1988). 
These curves plot the proportion of conflicts correctly 
predicted on the x-axis and the proportion of noncon- 
flicts correctly predicted on the y-axis. The intuition 
behind the graph is that any threshold used as the cutoff 
between a conflict and a peace prediction will corre- 
spond to a single point on this curve. The area below a 
single point on the curve corresponds to the proportion 
of true negatives for that cutoff, whereas the area above 
the point indicates the proportion of false positives. 
Similarly, the area to the left of a point corresponds 
to the proportion of true positives, whereas the area to 
the right of the point represents the proportion of false 


6 See Morrow 1989 for an early attempt to address this problem with 
international conflict data. 
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negatives. For example, if the cutoff is zero, then all 
disputes (but no cases of peace) are predicted correctly. 
Finally, as the cutoff varies over the range between zero 
and one, the curve will be negatively sloped, as fewer 
conflicts and greater numbers of peaceful dyads are 
forecast correctly. 

The key point to glean from a pair of ROC curves 
used for model comparison is that the curve with more 
area underneath it corresponds to a greater propor- 
tion of successful predictions for both war and peace, 
regardless of what arbitrary threshold is settled upon 
for predicting war. In the absence of a specified opti- 
mal threshold, the area under an ROC curve provides 
a useful summary statistic that can arbitrate between 
competing models. 


Evaluating Forecast Probabllttles 


The second set of standards used to evaluate fore- 
casts relates to properties of a model’s predicted prob- 
abilities rather than its dichotomous forecasts. These 
attributes can be derived from the joint distribution 
p(f,x), of the forecast probability f and the event 
x (Murphy and Winkler 1992). This is known as the 
“calibration-refinement factorization” of this distribu- 
tion, where 


PCF. x) = ppf) 


factors the original joint distribution into separate com- 
ponents that reflect two desirable properties that a 
forecast should possess. First, the distribution of the 
forecast probabilities, p( f), indicates whether a model 
generates forecast probabilities that vary widely across 
the zero-to-one spectrum or if, instead, all predictions 
tend to cluster around a single value. This property is 
known as refinement and can be diagnosed using the 
variance of the predicted probabilities, a statistic that 
succinctly summarizes the dispersion around the mean 
prediction. Second, if f = p(x =1| f), then the pre- 
dicted probability is a perfect reflection of the actual 
probability of conflict in a given situation. This prop- 
erty, known as calibration, would be perfectly satisfied, 
for example, if conflicts actually occurred in 10% of 
the dyads for which the forecasted probability was 0.1, 
and a similar relationship held for all other forecasted 
probabilities. 

A useful diagnostic for calibration is the diagram 
used by BKZ in which forecasts were placed into bins 
of width 0.1 corresponding to whether the probabilities 
were in one of the following intervals: {(0, 0.1), (0.1, 
0.2),..., (0.9, 1)}. For each bin, the mean predicted 
probability (e.g., 0.05 for the first bin, 0.15 for the second 
bin, and so on) is plotted on the x-axis versus the ob- 
served proportion of cases with conflict in that category. 
If the plot in the calibration table falls along the 45° line, 
then we would say`that the forecast is well calibrated 
because for each of the bins f © p(x =1| f). 

When some bins are sparsely populated, calibra- 
tion becomes difficult or impossible to assess with the 
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diagram. In such cases a second useful diagnostic is the 
calibration index (CI). This has the form 


GF NG -xP / ON, 
] J 


where j indexes bins j = 1,...,10, N, denotes the num- 
ber of observations in bin j, f, the mean prediction in 
bin j, and x, the relative frequency of conflicts for ob- 
servations in bin j (Yates 1990). Simply put, CI is the 
average departure of a calibration plot from the 45° 
line weighted by the number of observations in each 
bin. Smaller values of CI indicate a better-calibrated 
probability model. 


DATA 


We base our analyses on the data set used by BKZ 
and analyze the initiation of militarized disputes within 
“politically relevant dyads” between 1947 and 1989. 
The data included 23,529 dyad years; 976 of these years 
include a militarized dispute. As noted above, BKZ 
did not utilize a logit model that took into account 
the major findings of the quantitative securities stud- 
ies literature. To rectify this problem, we added to our 
analysis variables and nonlinear transformations that 
we believe this literature has established as central to 
any model of dispute initiation. These choices represent 
a parsimonious addition of terms that are justified by 
previous research, and the success or failure of these 
terms in a logit model or discriminant analysis is open 
to inspection. We also included these new variables 
in our neural network and discriminant analyses, with 
the exception that we allowed the neural network to 
estimate the supposed nonlinearities and interactions 
endogenously. Thus, all three estimators are on a level 
playing field. Changes to the BKZ forecasting model 
are as follows. 


Asymmetry of Military Capabilities. BKZ include 
asymmetry in their logit model, but most work (e.g., 
Oneal and Russett 1999) has hypothesized that the re- 
lationship between military capabilities is curvilinear. 
Importantly, these same studies have found substantial 
support for this hypothesis. Thus we also include the 
square of the asymmetry value in order to account for 
this relationship. 


MajJor-Power Status. It is well established that major- 
power states are much more likely to engage in mil- 
itary conflict. (Bremer 1992; Maoz and Russett 1993; 
Oneal and Russett 1999). We add this variable to all 
three models, based on data from the Correlates of War 
(COW) data set. 


Distance. Although BKZ include a dummy variable 
identifying contiguous states, the actual distance be- 
tween states has also been shown to have a substantial 
impact on military conflict (Bremer 1992; Maoz and 
Russett 1993; Oneal and Russett 1999). Therefore, we 
generated a variable to measure the distance between 


each pair of states in a dyad using the EUGene program 
(Bennett and Stam 2000). 


Democracy. BKZ include democracy in their models 
but do not specify the impact as interactive (Bueno de 
Mesquita and Lalman 1992; Maoz and Russett 1993, 
Rousseau et al. 1996). This interactive specification is 
standard in the literature and thus we include the inter- 
action of the two democracy scores in the dyad (rescal- 
ing the democracy variables appropriately). Addition- 
ally, a number of scholars suggest that the impact of 
democracy on conflict may be curvilinear (Goemans 
2000; Mansfield and Snyder 1995; Snyder 1991). We 
model this theorized curvilinear effect by including the 
square of the interaction of regime-types. 


FORECASTING MILITARIZED DISPUTES: 
A SECOND LOOK 


Before discussing the results of the competing models, 
note that all models were estimated using a pre-1985 
training set, as in BKZ. We do, however, make two 
departures from the original work. First, we extracted 
two different subsamples from the main data set to use 
for out-of-sample forecasts. One of these samples was 
the test set reported in BKZ, consisting of all dyads 
in the years after 1985. We also withheld a second test 
set by drawing a 5% uniform random sample of the 
dyad-years from 1947 to 1985. The latter test set was 
used to test for robustness, and serves as a useful tool 
to determine whether the particular cutoff of 1986 for 
BKZ’s test set might be fortuitous, aiding or hurting 
different models.” Second, we do not present results 
from the training set; to do so may artificially inflate the 
models’ apparent performances and deflect attention 
from their out-of-sample performance. 

We report the results from a logit model estimated 
in the standard manner; a linear discriminant model 
estimated using a method similar to that proposed by 
Fisher, assuming that each class has an equivalent co- 
variance matrix; and two different neural networks es- 
timated with the same basic functional form as that 
estimated by BKZ.° We report results for two differ- 
ent neural networks because our two test data sets 
identified different neural nets as optimal. The first 
neural network maximized the number of correct 


7 We also generated additional test sets using different thresholds 
(eg., 1984 instead of 1986) and found that all models performed 
substantially worse than in the 1986-89 test set. 

8 We implement the Levenberg-Marquardt (LM) algorithm with 
Bayesian regularization to penalize overfitting and a variable learn- 
ing rate for computational efficiency in MATLAB’s Neural Network 
Toolbox. Because LM is a hill-climbing algorithm that can be sensi- 
tive to the initial parameter values, we also estimated multiple models 
with random starting values for each number of hidden units (Bishop 
1995). Following BKZ, to determine the number of hidden units we 
varied their number and compared their forecasting performance on 
a set of observations withheld from the training set. We find that 
15 hidden units seemed to generally offer the best fit and report 
the results for two neural networks that possessed different sets of 
initial values that performed “best” on different forecasting criteria 
reported below 
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TABLE 1. Area Under ROC Curves 


Forecast Set 








Untform Draws 
(Pre-1985) 


Model 
Neural net 
Classification 0 8013 
ROC area 0.8555 
Logt 0.8372 
Unear discriminant 0.8148 


Post-1985 





0.8701 
0.8685 
0.9152 
0.8722 








dichotomous predictions using 0.5 as the prediction 
threshold in the training set. This model was optimal 
for the post-1985 test set (though not for other thresh- 
olds; see footnote 6). As we noted above, we do not 
advocate this criterion, but we report the model so as 
to present the best possible performance according to 
BKZ?’s standards. The second neural net maximized the 
area under the ROC curve in the training set, and so 
it attempts to maximize forecasting accuracy across a 
variety of predictive thresholds. This was optimal for 
the uniform draw test set. 


Dichotomous Predictlons 


The abilities of the four models to rank-order cases 
from most to least likely were evaluated using ROC 
curyes. Table 1 reports the area under the ROC curves 
and Figure 1 plots the ROC curve for each model. Per- 
haps the strongest support for BKZ, with respect to 
the ROC curves, can be found in the forecasts on the 
pre-1985 uniform draw test set. As Table 1 indicates, 
the neural network that maximized the area under the 
ROC curve in the training set marginally outperformed 
all of the other models in this forecast set. Specifically, 
the area under its ROC curve was 0.8555, whereas the 





area under the ROC curve for the logit estimator was 
0.8372. 

Given the rarity of militarized disputes, one should 
not dismiss even a modest increase in forecasting ac- 
curacy, but one cannot avoid the impression that the 
difference between these models is quite small. In fact, 
even the ROC-maximizing neural network is not the 
best estimator for all possible predictive thresholds. As 
indicated in Figure 1, the plots for the logit and ROC- 
maximizing neural network tend to weave around one 
another, indicating that the forecasts of these estima- 
tors are nearly indistinguishable across the range of 
thresholds. Indeed, if any of the estimators stands out 
from the others on the pre-1985 test set, it is the overall 
weakness of the neural network that focuses exclusively 
on the 0.5 classification threshold. 

Our results are even less supportive of BKZ when 
we examine the post-1985 test data set. In this case, 
the logit and discriminant models both outperformed 
the neural networks. Table 1 reports that the logistic 
regression had the greatest area under its ROC curve, 
at 0.9152, while the linear discriminant had the second 
greatest, at 0.8722. Figure 1 demonstrates that for 
practically any given tolerance of false positives, the 
logit model either outperformed or was indistinguish- 
able from its rivals. The overwhelming impression left 
by this set of results is that there is little difference 
among the neural network, logit, and discriminant anal- 
yses in terms of their ability to distinguish between 
dispute and nondispute cases, once one controls for a 
model’s inherent bias toward predictive probabilities 
that are either too high or too low. 


Predicted Probabllitles 


The calibration index reported in Table 2 and the cal- 
ibration diagrams in Figure 2 indicate the extent to 





FIGURE 1. 
Uniform Draw and Post-1985 Test Sets 


ROC Curves for Untform Draw 
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TABLE 2. Callbratlon and Refinement of Predicted Probabllities 
Forecast Set 
Calibration Index Refinement 
Random Draws Random Draws 

Model (Pre-1985) _ Post-1985 _(Pre-1985) Post-1985 
Neural net 

Classification _ 0.0102 0.0094 0.0095 0.0090 

ROC area 0.0094 0.0089 0.0087 0.0084 
Logit 0.0073 0.0076 0.0061 0.0068 
Unear discriminant 0.0223 0.0129 0.0182 0.0173 





which the forecast probabilities are consistent with the 
actual probability of a war. The calibration index re- 
ported in Table 2 suggests that the logistic regression 
offered the best-calibrated set of predictions for both 
of the test sets, followed closely by the two neural 
networks. The linear discriminant provided the most 
poorly calibrated results, which is not surprising given 
the restrictive assumptions made by this model. The 
plots in Figure 2 for the uniform draws from the pre- 
1985 period confirm these findings. Both the logit model 
and the neural network that maximized the area under 
the ROC curve in the training set had calibration plots 
that fell roughly along an imaginary 45° line from the 
origin. Thus, for both of these models, the forecast 
probability of a dispute was approximately equal to the 
relative frequency of conflicts at a given predicted prob- 
ability. The plot for the neural network that minimized 
the number of classification errors in the training set gy- 
rates widely, but this model still performed well in terms 
of the calibration index, because there were relatively 
few observations in the range of forecast probabilities 
where the fluctuations were greatest. 

Plots for the post-1985 test set suggest a level of cal- 
ibration similar to that found from the uniform draws. 


As indicated in Table 2, logit continues to be the best- 
calibrated model, followed closely by the neural net- 
work that maximized the area under the ROC curve. 
Figure 2 indicates that predictions for the logit model 
fall approximately along the imaginary 45° line. The 
logit’s calibration curve does drop sharply when it 
reaches above 0.5, but this drop is due to a single in- 
correct prediction with a probability greater than 0.6. 
With the exception of that error, the logit model re- 
mains well calibrated, as confirmed by the results from 
the calibration index in Table 2. The neural network 
that minimized the number of classification errors at 
the 0.50 threshold appears to be better calibrated in the 
post-1985 test set than in the previous test. Nonetheless, 
it continues to rank third among the four estimators. 
The level of refinement for these models is also 
diagnosed in Table 2, which reports the variance in 
the predicted probabilities for the three models. As 
BKZ suggest, the forecasts of the neural networks are 
spread across a wider range of values than are the logit 
model’s forecasts. This higher level of refinement in- 
dicates that it might be easier for decision makers to 
recognize the changes in the predicted probabilities 
of neural network models as substantively meaningful. 
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However, the neural networks are themselves badly 
outperformed on this diagnostic by the linear discrimi- 
nant, which has approximately twice the forecast vari- 
ance in both test sets. 


SUMMARY AND CONCLUSIONS 


BKZ offered a provocative claim: Estimators based on 
the general linear model fundamentally misspecify the 
dispute generation process. If correct, this claim calls 
decades of research on international conflict into ques- 
tion. Theoretical debates on deterrence, the democratic 
peace, economic interdependence, and many other is- 
sues have all relied on the general linear model for test- 
ing their competing arguments. In this paper, we sought 
to determine whether this generation of scholarship 
should be rejected because of its reliance on inappropri- 
ate methodologies. We conclude that the answer is no, 
that there is little evidence that the general linear model 
is fundamentally biased or incomplete. Our empirical 
work shows that although neural networks may make 
more extreme predictions about the probability of in- 
ternational conflict than logit, they do not offer better 
forecasts of whether or not states engage in militarized 
disputes. The ROC curves, in particular, demonstrate 
that neural networks are not better at discriminating 
between dispute and nondispute cases. In fact, in some 
out-of-sample forecasts, logit proved to be the better 
estimator. Even the linear discriminant—which makes 
assumptions about the data-generation process that are 
directly opposite those of the neural network—does 
nearly as well as the modestly nonlinear logit and mas- 
sively interactive and nonlinear neural network, pro- 
viding some indication that the process that generates 
wars is less complex than argued by BKZ. 

Why is it that we found a logit model to be a perfectly 
adequate predictor of international conflicts, whereas 
BKZ did not? The main source of this discrepancy was 
our inclusion of additional variables and transforma- 
tions of variables that are standard in quantitative mod- 
els of international conflict. BKZ’s omission of these 
relevant variables disadvantaged the logit model more 
than the neural network because the latter estimator, 
in a beautiful way, is able to endogenously estimate 
proxies for the omitted variables. However, once we 
added the relevant variables to the analyses, a neural 
network was no longer needed to endogenously esti- 
mate these nonlinearities. And as one might expect, 
the logit’s greater efficiency made it more robust in 
out-of-sample forecasts than the neural network. 

In the prediction of international disputes, it is clear 
that neural networks hold few unambiguous advan- 
tages over fully specified and theoretically grounded 
logit models. And it is worth remembering that neu- 
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ral networks carry significant costs, as one forfeits the 
opportunity to measure particular parameters and test 
their statistical significance. If we are in the business 
of theory testing, then we should select our statistical 
estimators to reflect the level of complexity specified 
by our theories. We are not aware of any theories in 
the study of international conflict that specify a func- 
tional form so complex as to require, or even suggest, a 
neural network. In the absence of such theories, mod- 
els should be constrained to a functional form that 
does not differ substantially from more traditional esti- 
mators. 
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n this article, we show that de Marchi, Gelpi, and Grynaviski’s substantive analyses are fully consistent 
with our prior theoretical conjecture about international conflict. We note that they also agree with our 
main methodological point that out-of-sample forecasting performance should be a primary standard 
used to evaluate international conflict studies. However, we demonstrate that all other methodological 
conclusions drawn by de Marchi, Gelpi, and Gryanaviski are false. For example, by using the same 
evaluative criterion for both models, it is easy to see that their claim that properly specified logit models 
outperform neural network models is incorrect. Finally, we show that flexible neural network models are 
able to identify important empirical relationships between democracy and conflict that the logit model 
excludes a priori; this should not be surprising since the logit model is merely a limiting special case of 


the neural network model. 


e thank Scott de Marchi, Christopher Gelpi, 

and Jeffrey Grynaviski (2004; hereafter (GG) 

for their careful attention to our work (Beck, 
King, and Zeng 2000; hereafter BKZ) and for raising 
some important methodological issues that we agree 
deserve readers’ attention. We are pleased that dGG’s 
analyses are consistent with the theoretical conjecture 
about international conflict put forward in BKZ—“The 
causes of conflict, theorized to be important but often 
found to be small or ephemeral, are indeed tiny for 
the vast majority of dyads, but they are large, stable, 
and replicable whenever the ex ante probability of con- 
flict is large” (21)-—and that dGG agree with our main 
methodological point, that out-of-sample forecasting 
performance should always be one of the standards 
used to judge studies of international conflict and, in- 
deed, most other areas of political science. 

However, dGG frequently err when they draw 
methodological conclusions. Their central claim in- 
volves the superiority of logit over neural network 
models for international conflict data, as judged by 
forecasting performance and other properties such 
as ease of use and interpretation (“neural networks 
hold few unambiguous advantages...and carry sig- 
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nificant costs” relative to logit [dGG, p. 378]). We show 
here that this claim, which would be regarded as stun- 
ning in any of the diverse fields in which both meth- 
ods are more commonly used, is false. We also show 
that dGG’s methodological errors and the restrictive 
model they favor cause them to miss and mischaracter- 
ize crucial patterns in the causes of international con- 
flict. 

We begin in the next section by summarizing the 
growing support for our conjecture about international 
conflict. The subsequent section discusses the theoret- 
ical reasons why neural networks dominate logistic re- 
gression, correcting a number of methodological er- 
rors. The next section demonstrates empirically, with 
the same data as used in BKZ and dGG, that neural 
networks substantially outperform dGG’s logit model. 
We show that neural networks improve on the forecasts 
from logit as much as logit improves on a model with 
no theoretical variables. We also show how dGG’s logit 
analysis assumed, rather than estimated, the answer to 
the central question about the literature’s most impor- 
tant finding, the effect of democracy on war. Because 
this and other substantive assumptions underlying their 
logit model are wrong, their substantive conclusion 
about the democratic peace is also wrong. The neu- 
ral network models we used in BKZ not only avoid 
these difficulties, but they, or one of the other methods 
available that do not make highly restrictive assump- 
tions about the exact functional form, are just what is 
called for to study the observable implications of our 
conjecture. 


SUPPORT FOR BKZ’S CONJECTURE 
ABOUT INTERNATIONAL CONFLICT 


The explanation of what drives international conflict 
put forward in BKZ (22) was built on a simple conjec- 
ture, that “the effects of most explanatory variables are 
undetectably small for the vast majority of dyads, but 
they are large, stable, and replicable when the ex ante 
probability of conflict is large.” That is, our point is that 
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theories of international conflict that have a one-size- 
fits-all approach to regions and time periods should be 
replaced with theories that reflect the highly contingent 
and context-dependent nature of the phenomenon. We 
showed that this simple idea accounts for many specific 
observable implications consistent with the evidence. 
For example, if our conjecture is correct, then statisti- 
cal analyses using methods that specify nearly identical 
effects for all observations, such as the logit models that 
had previously dominated the literature, should reveal 
apparently small to nonexistent effects or effects that 
vary across specifications. Such is indeed the case in 
most of the literature. Second, when the effects in the 
subset of high ex ante probability of war dyads are suffi- 
ciently large, these effects would be robust across speci- 
fications, even using logit. This is indeed the case for rel- 
atively atheoretical but powerful variables such as time 
since the last war and contiguity. Third, our conjecture 
implies that small changes in the sets of dyads included 
in an analysis would lead to disproportionate effects on 
the results, which would appear to account for much of 
the instability of results across articles in the literature. 
Fourth, when combined with the strong priors that ex- 
ist among scholars in international conflict studies, our 
conjecture would lead us to expect to see results that 
vary considerably from researcher to researcher, which 
is precisely what we see. Fifth, data subsetting practices 
in the literature, such as limiting analyses to politically 
relevant dyads, would be expected to generate selec- 
tion effects that strengthen results compared to the full 
data set, but the results would still not be as large as 
plausibility checks or qualitative researchers suggest. 
All these implications are consistent with evidence in 
the literature. 

Our conjecture is also strongly supported by the neu- 
ral network analyses in BKZ, which estimated and re- 
vealed the nonlinearities and massive interactions di- 
rectly, and by the new analyses in Lagazio and Russett 
(2002). dGG’s model also unambiguously reveals their 
estimated relationships to be much flatter or nonexis- 
tent among the low ex ante probability of war dyads, 
just as our approach predicts. Of course, logistic re- 
gression is a much more limited procedure in terms of 
the types of empirical results that are possible for it to 
produce. In fact, the effect of an explanatory variable 
in the usual logit model, measured in terms of a small 
change ın an explanatory variable on the probability of 
war (i.e., a derivative), is restricted to be between only 
zero and one-quarter of the respective coefficient, at a 
maximum, and much narrower for low-probability rare 
events; these probabilities will also vary inflexibly, and 
very little as the data change, from dyad to dyad. Hence, 
some of the results consistent with our conjecture were 
effectively assumed by dGG as part of their logit model 
specification, rather than estimated; these parts imply 
dGG’s implicit theoretical agreement with our conjec- 
ture, hard coded in their choice of specification, rather 
than direct empirical support. The nonlinear portions 
of dGG’s specification means that the derivative of the 
probability of war with respect to democracy can vary 
a good deal more than their other variables, and so 
provide a somewhat better test of our conjecture, al- 
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though still limited in terms of the types of results it can 
model. 


WHY NEURAL NETWORKS DOMINATE 
LOGIT: THEORY 


In this section, we provide the theoretical reasons why 
neural networks dominate logit models for the analysis 
of international conflict data’ and correct a variety of 
methodological errors in dGG. We show that neural 
networks do not make “the interpretation of predic- 
tors extraordinarily difficult” as dGG claim, they do 
not produce “inefficient estimates,” and they generate 
no extra “uncertainty about causal relationships”; and 
their claims that neural networks “preclude hypothesis 
testing” and that “it is not possible to determine the 
sensitivity of findings to changes in other variables” 
and “impossible to test hypotheses about the magni- 
tude and direction of a predictor’s influence on the 
dependent variable” are also false. Neural networks in 
fact are quite standard statistical models requiring no 
new “epistemology” or theory of inference. We now 
clarify these and several other points. 


Eplstemology and Interpretabllity 


dGG claim that neural networks “embody a different 
epistemological perspective” from logistic regression. 
This claim is false. Neural network models may have a 
strange name, but they embody no mystery and, as com- 
monly used, are no harder to interpret than logit mod- 
els. As BKZ demonstrate, neural networks are ordinary 
statistical models, just like logit or linear regression. 
They work completely within the standard Bayesian or 
likelihood theories of inference, just like logit. 

To see these points, let Y, be 1 if dyad i is at war and 0 
if it is at peace, and denote X, a set of explanatory vari- 
ables. Then, as we describe in BKZ, a linear regression 
(or “linear probability”) model can be written 


Pr(¥; = 1) = linear(X,), 


where linear (X,) = X,f, and a logit model can be 
written 


Pr(¥, = 1) = logit(linear(%,)), 


where logit(a) = 1/(1 +e). The extra logit function 
makes logistic regression parameters harder to in- 
terpret, and indeed the vast majority of published 
articles in political science do not interpret the param- 
eters of logit models directly. Most political scientists 
instead choose to compute predicted values, first differ- 
ences, and other quantities of interest (King, Tomz, and 
Wittenberg 2000). Neural network models are simple 
generalizations, which can be written 


Pr(¥, = 1) = logit(linear(logit(linear(X,)))). (1) 


1 We could also show why neural networks also dominate discrimi- 
nant analysts, but because the latter is not commonly used ın political 
science, dGG focus on logit, and there exist many good reasons to 
prefer logit to discriminant analysis, we consider only dGG's logit 
here. 
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Just as with logit models, political scientists who use 
neural network models do not interpret their coeffi- 
cients directly and instead report predicted values, first 
differences, or other quantities of interest (Andreou 
and Zombanakis 2001; Bearce 2000; Beck, King, and 
Zeng 2000; Borisyuk et al. 2001; King and Zeng 2002; 
Lagazio and Russet 2002; Zeng 1999, 2000). We see no 
reason to think that a predicted probability of war is 
any harder to interpret, no matter how it is calculated. 
Neural networks are certainly less familiar to political 
scientists, but in our view the enormously important 
public policy issues at stake mean that the improved 
forecasting performance that comes from using these 
techniques that we demonstrate in BKZ and the reanal- 
ysis below should outweigh any inconvenience some 
researchers may have in learning new methods. 

dGG are incorrect that one cannot compute uncer- 
tainty measures such as standard errors, confidence in- 
tervals, or hypothesis tests about quantities of interest 
in neural network models. The marginal effect plots 
in BKZ report error bars from a neural network anal- 
ysis, which portray confidence intervals, and one can 
use standard Bayesian or frequentist theory to com- 
pute these for any quantity in the same manner as 
one computes it for logit. Of course, in any statistical 
model, hypothesis tests should not focus on arbitrarily 
parameterized coefficients but, rather, should be about 
quantities of real substantive interest to researchers? 


Model Flexibility 


Although it has a parametric form that is almost as 
straightforward as logit, neural network models do 
have advantages over logit. They, but not logit mod- 
els, have “arbitrary approximation capabilities” (White 
1992). This means that at least one member of the 
neural network family of models (or a neural network 
model with a sufficient number of hidden neurons) can 
approximate any functional form suggested by the data, 
even if not specified by one’s theory ex ante. This is a 
tremendous advantage, as it enables one to estimate 
relationships not known from prior theory. What logit 
models do in contrast is to make assumptions, some 
based on theory and some based on convenience, all 
of which require one to ignore empirical evidence. In 
contrast, neural network modeling is a more power- 
ful information extraction tool. In fact, neural network 
models include logit models as limiting special cases, 
and so a proper use of neural network models should 
always outperform logit, at least in expectation (Hastie, 
Tibshirani, and Friedman 2001). Indeed, if the logit 
specification is correct, a competent neural network 
modeler should find that something arbitrarily close to 
the logit is the preferred specification. Moreover, both 
logit models and neural network models can be used for 


2 For our models reported in the text, we compute probabilities, as 
the building blocks for our quantities of mterest, by mtegrating over 
the posterior distribution of the parameters. This incorporates all 
information on uncertainty in our final estimates, rendermg signfi- 
cance tests irrelevant and, for rare events data like wars, reducing 
mean square error (King and Zeng 2001). 


testing any relevant hypothesis; the only difference is 
that under logit the validity of these tests is conditional 
on a variety of more stringent assumptions. 

The logit specification includes a whole range of re- 
strictive assumptions that no prior theory or data in in- 
ternational conflict supports and no method other than 
those with flexible functional forms like neural net- 
works is capable of testing. For example, the only func- 
tional forms to have been derived from formal theories 
of international conflict are massively violated by the 
restrictions of logit models, especially that logit proba- 
bilities are usually monotonic functions of the explana- 
tory variables (Signorino 1999). Moreover, Signorino 
and Yilmaz (2003) prove that if even the simplest form 
of strategic interaction exists among the dyads, then the 
restrictions inherent in logit models make its estimates 
“biased and inconsistent.” In contrast, neural network 
models can approximate and thus test models derived 
from strategic theory to any degree of precision. Thus, 
instead of using a statistical model with arbitrary re- 
strictions, incapable of finding, testing, or confirming 
patterns indicated by theory, our general approach is to 
follow when feasible the simple maxim from King and 
Zeng (2002): “When we know something, we assume 
it; when we don’t know, we estimate it.” 

The flexibility of neural networks contrasts vividly 
with the rigidity of a standard logit specification, which 
uses the same effect parameter when predicting the 
probability of war between Burkina Faso (one of the 
poorest countries in the world, located in western 
Africa) and St. Lucia (a small Caribbean island tourist 
destination) as it does for the probability of war be- 
tween the United States and North Korea. (That is, the 
estimated 6 coefficient relating the change in the ex- 
planatory variables to the conflict outcome is the same 
for both dyads.) This strikes us as incorrect, if not ab- 
surd, but it is precisely the kind of assumption made 
by the vast majority of the scholarly literature prior to 
BKZ. It is also the assumption made by dGG’s logit re- 
gressions. Given this point and the observable implica- 
tions of our conjecture to the contrary offered in BKZ, 
we would expect to see effects that are highly variable 
across the dyads, with massive interaction effects and 
nonlinearities that could only partially be picked up by 
techniques like logistic regression. 

dGG worry that neural network models “increase 
the size of the parameter space almost 30-fold.” This 
is misleading. Neural networks typically have more pa- 
rameters than logit models, but Bayesian regulariza- 
tion reduces this nominal number of parameters to an 
“effective number of parameters” that is usually much 
smaller (Bishop 1995, 377, 410). Of course, the number 
of parameters per se is not the right criterion by which 
to judge a model, for the complexity of a model should 
match that of the data. A model too simple to extract 
information from the data would be simply wrong and 
useless. Overfitting—modeling the idiosyncratic fea- 
tures of data rather than the systematic features that 
will persist—cannot be ascertained by naive parameter 
counts but rather requires understanding the demands 
being put on the data by the complete estimation pro- 
cedure, Even the user’s manual for the software dGG 
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use warns against naive parameter counting like this 
(Demuth and Beale 2002, 5-54). By the appropriate 
use of Bayesian prior densities—and most importantly 
the use of test sets during the training stage rather than 
merely in-sample t-tests—neural network models en- 
able researchers to estimate relationships rather than 
assume them, and at the same time they can help avoid 
overfitting and the resultant suboptimal forecasts. A 
logit model used in the standard way, without test sets or 
cross-validation, has little to prevent it from overfitting, 
even though it is a much less flexible form. 

Neural networks also avoid “underfitting,” a key 
problem with logit analysis, which makes assumptions 
about fundamental substantive relationships that we 
know little about. In fields where prior knowledge is 
extensive, specific, and highly informative, assumptions 
like these may be appropriate. Describing the interna- 
tional conflict literature in this way would be a stretch. 

dGG’s specification decisions exacerbate this inher- 
ent problem with logit analysis, because they appear to 
implement a notion of “theory” that does not depend 
heavily on prior evidence. To BKZ’s specification, (GG 
“added variables and nonlinear transformations” that 
they claim “the literature has established as central to 
any model of dispute initiation.” They cite 10 previous 
studies in support of this new specification. Whereas 
distance between the countries and whether one is a 
major power are good additions that we are happy to 
add to our model, what dGG do not say is that none 
of these prior articles include dGG’s specification of 
nonlinear transformations or anything close to it. In 
fact, no other published or unpublished work we could 
find, whether or not it was cited in dGG, used this spec- 
ification. Even the most recent publications (Reuveny 
and Li 2003; Russett, Oneal, and Berbaum 2003) and 
the most recent working paper by one of the authors 
(Gelpi and Grieco 2000) chose more traditional speci- 
fications, very much unlike the one in dGG. 

Consider democracy, which is the variable that has 
received the most attention in the conflict literature 
among those considered by dGG. The raw measures 
for most specifications begin with the Polity scores for 
each country in a dyad, coded —10 (autocracy) to 10 
(full democracy). dGG then add 11 to each (changing 
the range to 1-21) and include the product of these two 
scores and the square of this product. They exclude 
the main effects of each, the difference between the 
countries, the minimum or maximum of the two, and 
the sum of the two, which is equivalent to assuming that 
the effects of these variables more common in the lit- 
erature are exactly zero (see Oneal and Russett 1997). 
These are exceptionally strong theoretical assumptions 
about the causes of international conflict hard coded 
into their logit model. If the assumptions are incorrect, 
then dGG’s logit model will yield incorrect results, as 
the model is not flexible enough to adjust. dGG jus- 
tify this specification by appeal to “theory” but not one 
prior publication in the literature (including those cited 
by dGG) has used, examined, or even discussed this 
specification. 

To be more specific, under the dGG scoring, even 
ignoring the quadratic term, the difference between 
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a dyad with two democracies and a mixed dyad (one 
democracy and one autocracy) is 27 times larger than 
the distance between a mixed dyad and two autocracies. 
For the squared interaction the same difference is 441 
times larger. Because nothing in the logit functional 
form could correct for such a difference, and indeed no 
theory or empirical analysis in the literature supports 
anything like it, this cannot be considered a plausible 
specification. Of course, the problem is not much better 
in the two standard scoring schemes in use in the vari- 
ous logit analyses of conflict in the literature. Both the 
“minimum democracy score” and the “binary democ- 
racy variable” approaches treat the totally autocratic 
and mixed dyads as if they were identical. While one 
might find some complicated nonlinear transform of 
the two democracy measures that works in particular 
applications under logit or other nearly linear models, 
the only reasonable general approach would use a flex- 
ible functional form, such as a neural network. 

Although dGG’s specification does not rise to a rea- 
sonable notion of prior theory, we have no objection 
to it as one possible (reduced form) theory of interna- 
tional conflict. Indeed, inspiration for some of the non- 
linear terms suggested by dGG could have been coded 
directly from the neural network-generated marginal 
effect plots in BKZ. Hence, to stack the deck as much 
as possible in dGG’s favor, we adopt their specification 
for the logit model used for empirical comparisons in 
the rest of this article. (In our neural network model, 
we follow dGG and include only the basic explana- 
tory variables without their hard-coded interactions 
and nonlinear terms.) 


The Role of Parsimony In Polltical Sclence 


Although neural networks do not necessarily have a 
larger “effective number of parameters” than logit 
models, they do have a more complex mathematical 
form. This worries dGG, who declare, as a criterion for 
comparing approaches, that “models should be parsi- 
monious (King, Keohane, and Verba 1994).” We think 
that dGG misread King, Keohane, and Verba (1994, 
20), who wrote, 


The principle of choosing theories that imply a simple 
world is a rule that clearly applies in situations where there 
is a high degree of certainty that the world is indeed sumple. 
Scholars in physics seem to find parsimony appropriate, but 
those ın biology often think of it as absurd.... We believe it 
is only occasionally appropriate. . We should never insist 
on parsimony as a general principle of designing theories, 
but it is useful in those situations where we have some 
knowledge of the simplicity of the world we are studying. 
Our point is that we do not advise researchers to seek 
parsimony as an essential good, since there seems little 
reason to adopt it unless we already know a lot about a 
subject. 


Thus, one cannot claim that the simplicity of the logit 
model is an advantage unless that simplicity is consis- 
tent with the data, and no theory or empirical evidence 
indicates that it is (A generation ago methodologists 
argued about logit versus a simple linear probability 
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model; the argument for the latter, which is no longer 
taken seriously in the literature, is akin to dGG’s, that is, 
that one can easily read off effects in the linear proba- 
bility model and that interpretation of logit coefficients 
is difficult. That debate is now clearly archaic.) 

Instead of using parsimony, King, Keohane, and 
Verba (1994) argue that we should choose theories 
that “maximize leverage,” by searching for as many 
observable implications of a theory as possible and 
testing them, regardless of how complicated the the- 
ory itself is. As it turns out, the central conjecture of 
BKZ is highly parsimonious and maximizes leverage 
by producing many observable implications consistent 
with it. In general, neural network models tend to be 
less parsimonious than logit models, but at least for 
international conflict data they maximize leverage by 
producing many more observable implications consis- 
tent with the data. Moreover, the standard practice of 
evaluating models by out-of-sample forecasting tests 
ensures that parsimonious models will be chosen only 
when the empirical evidence indicates that they should 
be chosen. 


Out-of-Sample Evaluations 


dGG agree with BKZ that using out-of-sample fore- 
casting evaluations to maximize leverage and thus eval- 
uate models is critical. However, dGG miss two criti- 
cal consequences of this decision. The more important 
consequence is that out-of-sample evaluations can be 
used to guard automatically against overfitting. The use 
of in-sample statistics, such as those used by dGG, en- 
courage researchers to “peek” at the data in choosing 
a model, generate post hoc explanations for apparent 
patterns, and thus fit idiosyncrasies in the data that do 
not generalize. This overfitting becomes apparent when 
researchers use such a model to predict out-of-sample, 
a situation where they are vulnerable to being proven 
wrong. Thus, if neural networks were overfitting the in- 
sample data, this would simply cause them to perform 
worse in the out-of-sample forecasting tests that we 
and dGG use. If neural networks provide better out- 
of-sample forecasts, it cannot be because they overfit 
the in-sample data. Thus, while our Bayesian regular- 
ization scheme guards against overfitting, the rigorous 
evaluation of models via out-of-sample forecasting per- 
formance further ensures that models that are overfit 
will not be chosen.? 

The commitment to out-of-sample validation also 
shows that evaluating neural networks is as easy as for 
logit models. And whereas some may prefer to live in 
a simple world where one only asks if t-ratios exceed 
two, the commitment to evaluation by out-of-sample 
forecasting shows that such convenience is bought at a 
huge cost. To give one simple example of how hypoth- 
esis testing can be naive, the squared democracy vari- 
able in the dGG logit model has a substantial t-ratio of 
about —5. Thus by the conventional in-sample criterion, 


3 Out-of-sample forecasting and cross-validation in general are rou- 
tinely used techniques m modern statistical modelmg. For an early 
reference on this topic, see Stone 1974. 


we would think that this variable is an important pre- 
dictor of peace. However, when we compare the out- 
of-sample forecasting performance of the dGG logit 
with and that without this variable, we find that fore- 
casting performance is slightly improved by excluding 
the variable! Hence, the huge t-statistic merely reflects 
overfitting, a fact we would not know without out-of- 
sample tests. The key point is that significance tests are 
conditional on the veracity of the model, whereas out- 
of-sample tests are conditional only on the assumption 
that the out-of-sample data are generated by the same 
process as the in-sample data. 

Although dGG tout as one of the advantages of logit 
its ability to test statistical significance (p. 372), their 
claim is incorrect and this “advantage” is both nonexis- 
tent and misleading. It is nonexistent because hypoth- 
esis tests on any quantity of interest can be carried 
out in neural networks as straightforwardly as in logit, 
because neural networks, as we note earlier, work 
completely within the standard Bayesian or likelihood 
theories of inference.‘ It is misleading in that the stan- 
dard in-sample “statistical significance” paradigm using 
either model can easily lead to the choice of infe- 
rior models, as we have seen from the example of the 
squared democracy variable in dGG’s analysis. Thus, 
we see little reason in this context to rely on in-sample 
significance tests and instead encourage model compar- 
ison by out-of-sample forecasting performance. And 
as BKZ and our reanalysis below illustrate, evaluat- 
ing out-of-sample forecasting performance of a model, 
whether it is specified as a logit or a neural network, 
is equally easy. We now turn to a comparison of the 
out-of-sample forecasting properties of dGG’s chosen 
logit model with our preferred neural network model. 


EMPIRICAL EVIDENCE 


In the four parts in this section, we discuss appropriate 
data analysis procedures using neural networks, how to 
evaluate forecasting performance, empirical compar- 
isons of forecasting performance between dGG’s logit 
model and our neural network model, and how dGG’s 
logit model misrepresents substantive information in 
dGG’s data. 


Appropriate Data Analysis Procedures 


The particular neural network models used in dGG ap- 
pear inappropriate to the task. dGG explain that they 
chose these models by maximizing “the area under the 
ROC curve in the training set” (p. 376), which in normal 


4 Both logit and neural network coefficients can be subject to hy- 
pothesis tests, but domg so for logit is easier. However, the ease of 
this test comes with a set of assumptions about the nonexistence of 
mteractions and the veracity of a particular functional form that have 
no basis in the theory or evidence given in the international conflict 
literature. Moreover, coefficients from neither model are quantities 
of mterest and so should not be the subject of hypothesis tests. Testing 
genuine quantities of interest, such as the probability of conflict or 
first differences, is as easy ın both methods, assuming of course that 
estimation uncertainty has not already been integrated out, which 
would make hypothesis tests irrelevant. 
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circumstances is almost guaranteed to result in overfit- 
ting. dGG also chose not to use the software we used 
in BKZ or one of the other sufficiently sophisticated 
neural network programs available; their alternative 
choice was not well suited to avoiding overfitting, es- 
pecially given their model selection criterion. 

dGG did recognize the importance of including 
Bayesian regularization (i.e., prior densities). How- 
ever, the regularization scheme in their software was 
not designed for predicting a dichotomous dependent 
variable like war. As the manual for their computer 
program (Neural Network Toolbox for Matlab [see 
Demuth and Beale 2002, 5-60]) warns, “Bayesian reg- 
ularization implemented in the toolbox does not per- 
form as well on pattern recognition problems [i.e., 
classification problems, in particular binary dependent 
variables] as it does on function approximation [i.e., 
continuous dependent variable] problems.” This is be- 
cause, among other things, their program is running 
least squares with a dichotomous dependent variable, 
as if they were running a regression, rather than us- 
ing the correct likelihood for the problem—a practice 
political scientists have known was flawed and for the 
most part abandoned since the 1970s. This program, 
by using a single regularization parameter, also makes 
the incorrect assumption that all their parameters have 
identical variances in their prior distributions. Bishop 
(1995, 340-42) proves that this simplistic assumption 
is inconsistent with a model that has appropriate scal- 
ing properties. Thus, their choice of software with a 
flawed error function and regularization procedure, un- 
corrected by true out-of-sample tests, apparently led to 
the neural network they chose overfitting their data and 
forecasting suboptimally. 

Another problem is that dGG do not follow the stan- 
dard practice of normalizing their data prior to run- 
ning their neural network. Normalization is a key part 
of preprocessing (to which Bishop 1995, chap. 8, de- 
votes an entire chapter) and has long been recognized 
as an integral part of neural network modeling. It in- 
volves transforming variables to mean zero and unit 
standard deviation for estimation; after estimation the 
variables are transformed back to the scale of interest. 
Normalization thus also scales the parameter values, 
which is important for any nonlinear optimization pro- 
cedure, and is especially important here as, without it, 
the Bayesian priors would be scaled incorrectly. (In this 
data set, normalization may be critical, as the scales of 
the vanables in their original units differ by a factor of 
more than 12,000!) As Hastie, Tibshirani, and Friedman 
(2001, 358) explain, normalization “can have a large ef- 
fect on the quality of the final solution.” Normalization 
is recommended by the user’s guide to dGG’s chosen 
software, which even provides commands for doing it 
automatically (Demuth and Beale 2002, 5—61). 

In our analysis, below, we use the same software rec- 
ommended in BKZ, which does not have the shortcom- 
ings of the Neural Network Toolbox for Matlab (and, 


5 That 1s, researchers who use test sets properly have much greater 
freedom in choosing software A list of available neural network 
software can be found at http//gking harvard.edu/nn. 
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unlike Matlab and the Toolbox, also has the advan- 
tage of being free and open source). We use the same 
variables as in dGG’s neural network models, which 
include the basic linear terms of the input variables (ap- 
propriately normalized), but, following dGG, exclude 
the extra nonlinear transformations and interactions in 
dGG’s logit model. 

As in dGG’s logit analysis, we evaluate the perfor- 
mance of the neural network using two evaluation sets, 
one consisting of the last four years of data (1986-89) 
and the other a random subset of the pre-1986 data. 
As a test of forecasting the future, the post-1985 data 
set seems better, but performance in this context de- 
pends not only on picking up the underlying structure 
in the data, but also on the assumption common to all 
methods in this context—that the process generating 
the in-sample data is the same as that generating the 
out-of-sample data. The problem with the post-1985 
test, then, is that it is just one test and what it shows will 
differ from other out-of-sample tests. As a test of the 
comparative performance of the two methods when the 
out-of-sample data are known to be generated by the 
same process as the in-sample data, however, the ran- 
dom subset is near-optimal. Thus, although we present 
all our results for both test sets, it is the random subset 
results on which we should focus. 

We chose the neural network architecture by using 
the in-sample data only (i.e., pre-1986 data minus the 
random subset) and avoid overfitting via the appropri- 
ate regularization procedures and by the use of test 
sets created temporarily from within the training data, 
all following the procedures described in BKZ. And 
following King and Zeng (2002), we use committee 
methods to reduce variance and improve out-of-sample 
performance. Finally, as in BKZ, we examine the out- 
of-sample evaluation sets once, only after we arrived at 
our final model. It is these final results that appear in 
this paper. 


Evaluative Criterla 


BKZ and dGG agree on the necessity of model evalu- 
ation via comparing out-of-sample forecasts but differ 
on the criteria used to summarize forecasting perfor- 
mance. BKZ’s evaluative criterion was the number of 
zeros and (mostly) ones correctly predicted in out-of- 
sample test sets. Under this criterion, BKZ’s neural 
network model outperformed BKZ’s logit model. By 
BK2Z’s criterion, dGG’s logit model reduces forecast- 
ing performance relative to BKZ’s neural network and 
logit models, even though dGG’s logit uses additional 
explanatory variables. 

However, for new analyses, dGG argue that ROC 
curves are preferable to the evaluative criterion used 
in BKZ, and we are happy to go along. (ROC curves 
were first introduced in political science research by two 
of us after BKZ was published [King and Zeng 2002]; 
if we had been aware of them when we wrote BKZ, 
we obviously would have used them.) dGG’s method 
of summarizing the information in ROC curves, by re- 
porting the area under them, is not the only choice 
available, but it may be reasonable in this application 
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and so we adopt it here too as the evaluative criterion. 
Obviously, a model chosen by optimizing according 
to one criterion will not necessarily do well on an- 
other, and evaluating it with respect to a criterion it 
was not designed to advance would make little sense. 
(By analogy, a least-squares estimator minimizes the 
sum of squared residuals and would not generally do 
well if evaluated by a minimum absolute deviations 
criterion.) Thus, whereas we chose a neural network 
model in BKZ by optimizing the number of zeros and 
(especially) ones correctly predicted, we now choose a 
model that optimizes according to ROC areas. For this 
paper, we do not estimate neural network models to 
satisfy other criteria. 





6 Thus, the mam differences in our neural network procedure from 
BKZ 1s unng dGG's “ROC area” optimization criterion and explana- 
tory variables. The only other difference is committee methods, which 
are increasingly used in neural network analyses and other areas due 
to their variance-reduction properties, which make the analysis more 
robust. Our conclusions would not change without this refinement 
because all members of the committee also outperform logit, but 
we recommend committee methods because it is now widely under- 
stood that they are normally superior. More specific details about 
our analysis are available in the replication data set accompanying 
this article. Our results for calibration tests, which are also used in 
dGG and BKZ, lead to similar conclusions as for ROC analyses, so 
for simplicity we only present the ROC analyses here. 


FIGURE 1. Area under ROC Curves for Two Out-of-Sample Evaluation Sets 
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Forecasting Performance Comparison 


We now compare three models using dGG’s preferred 
criterion. First is dGG’s logit model, which, with their 
gracious help, we were able to replicate exactly. Second 
is our neural network model. And, finally, is an “athe- 
oretical baseline” model that has only a constant term 
and a correction for duration dependence (using dGG’s 
splines based on the “years since the last war” variable) 
and merely says “peace persists.” A miminal baseline 
model like this is of course a standard procedure used 
to facilitate comparison in a variety of fields.’ 

For each of the two evaluation sets, we calculated 
ROC curves (they appear in the Appendix) and then 
use dGG’s preferred summary of them, the area under 
the ROC curve, for each of the three models. Figure 1 
reports these results and conclusively demonstrates 
that the neural network model substantially outper- 
forms dGG’s logit model. The out-of-sample forecast- 
ing performance of the model that dGG characterize as 


7 Our use of a baseline model 1s essentially the same as an economic 
forecaster who chooses a persistence model, that is, with a baseline 
forecast equal to the value of the vanable during the last observed pe- 
riod. In contrast, a baseline equal to the global mean alone, or in our 
case assuming equal probabihties of war for all dyads, would ignore 
enormous time series dependence ın the data and would therefore 


generate meaningless comparisons 
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taking “into account the major findings of the quantita- 
tive security studies literature” (p. 375) is the leftmost 
bar in the figure. The height of this bar constitutes, in 
dGG’s view, a summary of the sum total of the liter- 
ature’s prior knowledge about international conflict; 
the fact that dGG went to the extent of writing a 
paper to defend an entire “generation of scholarship” 
would seem to imply that they judge this performance 
to be substantial and substantively important. Some 
others will probably disagree (and we are happy to 
remain agnostic on this issue). But however one judges 
this performance, neural networks give us more than 
twice the forecasting accuracy and knowledge about 
the world, relative to the baseline, as all this prior 
knowledge about international conflict combined, 
relative to the same baseline. And this is without 
using any of the nonlinear terms used in dGG’s logit 
model or adding a single extra datum or variable (and 
has no estimation uncertainty as the parameters are 
integrated out in the computation). The doubling 
of forecasting performance relative to baseline is 
approximately the same for the post-1985 test (on the 
left) as for the random subset of dyads (on the right). 

Figure 1 was drawn using dGG’s choice of asummary 
statistic, the ROC area, but to understand the numbers 
we need to translate them into units that are substan- 
tively meaningful. As is, the numbers are not compa- 
rable to numbers computed from other data sets and 
are not immediately related to any substantive feature 
of the results: We know that bigger is better but do 
not know whether the differences in ROC areas in the 
figure should be characterized as “large” or “small,” 
and perusal of the full ROCs in the Appendix does not 
help either. Thus, restating the above more precisely, 
one meaningful unit is the distance from the atheo- 
retical baseline to dGG’s logit model, which according 
to dGG represents everything we know in the field, 
as summarized by dGG’s logit. Using this as the unit 
of measurement, our neural network model takes us to 
more than two units. Thus, the marginal contribution of 
our methods—the distance between the performance 
of dGG’s logit and our neural network—is slightly more 
than one unit. To assess whether a marginal improve- 
ment of one unit is “large” or “small” is thus equivalent 
to assessing whether the distance from the baseline to 
dGG’s logit is large or small, as it is also about one unit. 
Because the one unit from the baseline to dGG’s logit is 
dGG’s summary of all the empirical knowledge existing 
in the field of conflict studies, dGG clearly imply that 
one unit is substantial. If this is correct, then our neural 
network’s improvement over logit is just as substan- 
tial. Others may view the contribution of quantitative 
conflict studies as smaller than dGG do, and so should 
view the improvement of neural networks over logit 
to be concommitantly smaller as well, but even in this 
case the improvement is as large as in all prior research 
on the subject. We also show in the next section that 
this one incremental unit beyond dGG’s logit turns out 
to be enormously important from a substantive per- 
spective, 

Another way to think about this is to consider the 
commonly reported proportionate reduction in error 
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(PRE) statistic; the error in this case is one minus the 
ROC area (for comparison, the PRE in linear regres- 
sion is R°). For the post-1985 data, the PRE is 11.7% for 
logit and 24.0% for the neural network; for the random 
subset, the proportionate reduction in error is 16.4% 
for logit and 33.6% for the neural network. 

Finally, as a robustness check on our results, we ran 
4,400 additional neural network models based on differ- 
ent numbers of committee members, methods of com- 
mittee decision making, procedures for calculuating 
the probability, network architectures, random seeds, 
and prior specifications—none of which were tuned 
according to the test procedures described in the text. 
Although skipping proper model selection rules is not 
advisable for purposes of inference or model compari- 
son, these runs give a sense of how hard it would be to 
find a neural network model that forecasts worse than 
dGG’s logit. It would be hard: Of the 4,400 runs, the 
logit was outperformed on the random test set (and in- 
sample data) by all 4,400 network models and on the 
post-1985 test set by 4,151 of the neural networks. 


Substantive Implications 


We now demonstrate that the substantive differences 
between dGG’s logit specification and our neural net- 
work are of the utmost importance to the field. Indeed, 
dGG’s logit specification included assurnptions that vir- 
tually guaranteed that they would draw the wrong con- 
clusions about the most important hypothesis in the 
literature, the effects of democracy on war, no mat- 
ter what the data said. Many scholars have found that 
countries are less likely to fight each other if both are 
democratic; however, we do not know whether this is 
because they are democratic or because they are alike. 
In fact, there exists evidence in the literature that a pair 
of countries is less likely to fight if they are both autoc- 
racies under some circumstances (Peceny, Beer, and 
Sanchex-Terry 2002). A plausible alternative theory 
we now provide some evidence for is that “likes don’t 
fight,” in that dyads with different levels of democracy 
are those likely to go to war (Werner 2000). Because 
these issues remain highly controversial in the litera- 
ture and remain the subject of a considerable research 
program, we think that dGG’s specification—which 
makes some of these results impossible to obtain— 
unwisely substitutes unsupported theory for empirical 
analysis. 

Figure 2 plots the probability of war as a function 
of the [—10, 10] Polity score for the two countries in a 
dyad (marked “Dem a” and “Dem b”), holding con- 
stant other explanatory variables at values indicating 
high ex ante probabilities of conflict (computed from 
the median of the United States and China in their con- 
flict years). We do this with a three-dimensional surface 





8 We also find, incidentally, that committee methods at least partially 
protect one from bias, in addition to reducing vanance. The larger 
the committes, the less dependent results are to particular modeling 
choices and model selection procedures and the better are the fore- 
casts on average. This result is unlikely to have arisen by chance, as it 
ts consistent with findings from other fields in unrelated applications 
(e.g , Stock and Watson 2003) 
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Note Surface and contour plots for the probability of war grven differing levels of democracy (coded for each country In a dyad as —10 
for autocratic to 10 for democratc) and a high ex ante probablltty of war. The logit model on the right is based on a replication of dGQ's 
analysis. The neural network model on the left reveals features mischaracterized by the logit model. 





plot (which makes it easy to visualize the relationship) 
and a contour plot (which helps provide numerical pre- 
cision). The height of the surface is the probability of 
war. The contour plot should be thought of as the view 
of the surface plot from above, and the numbers reflect 
the probability for equal probability contours. For vi- 
sual clarity we have not included confidence intervals, 
which are of no value for model selection with binary 
dependent variables anyway. dGG’s logit model ap- 
pears on the right: The dip in the surface at the left, 
where Dem a and Dem b are both near 10 (indicating 
full democracy), apparently supports the democratic 
peace hypothesis. 

So far so good, until one realizes that other hypothe- 
ses were impossible to uncover with the logit model 
dGG specified, no matter how loudly the patterns in 
the data screamed.‘ That is, the only results that could 
be found by dGG’s specification were ones in which the 
probability of war was specified to be a logistic function 
of the product and the squared product of the degree of 
democracy of the two countries in each dyad (and other 
variables), a specification that mathematically restricts 
without theoretical reason the probability of conflict 
at (—10, —10) to be similar to that at (—10, 10) and 
(10, —10). One way to demonstrate that the data were 
indeed screaming to get around this specification is by 
running our neural network model, as it encompasses 
dGG’s logit model as a limiting special case. Hence, 
if the pattern on the right in Figure 2 were correct, the 


FIGURE 2. Surface and Contour Plots for the Probability of War 











plots on the left in the figure, calculated from the neural 
network model, would display the same pattern. The 
differences, however, are massive. In the plot on the 
left, the low probability-of-war dyads are not uniquely 
clustered in the bottom-left corner where both coun- 
tries are democratic. Instead, two clear regions of low 
probability of war appear, one reflecting the democratic 
peace and the other reflecting an autocratic peace. The 
ridge in the middle, indicating areas of high war prob- 
abilities, fit the “likes don’t fight” hypothesis, as well as 
reflecting the well-known result that partial democra- 
cies (even if alike) tend to be more violent. In addition, 
the bottom-right corner, which was missed by logit, is 
not a minor feature of the data: The gradient from 
the top of the ridge to the right corner represents a 
remarkable 30 percentage point drop in probability— 
from 0.65 at the top to 0.35 at the right. This has obvious 
and critically important normative implications for the 
central hypothesis in the field. 

Thus, for substantively oriented scholars of interna- 
tional conflict, neural network models are far more ca- 
pable than logit models of representing the rich array of 
theoretical ideas in the literature and enabling scholars 
to draw valid empirical conclusions from existing data. 


CONCLUDING REMARKS 


dGG make three main points. We all clearly agree 
with their first point, that almost all researchers in this 
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field (and most other fields) should use out-of-sample 
forecasts, and tools like ROC and calibration curves, 
to evaluate statistical models. This will help end the 
practice of specifying models that are nearly invulner- 
able to being proven wrong and then drawing strong 
substantive conclusions without empirical grounding. 

However, dGG’s two other points are incorrect. A 
properly specified neural network model clearly out- 
performs dGG’s logit model; in fact, a correctly con- 
stituted neural network model improves on the dGG 
out-of-sample forecast performance to the same degree 
that dGG improves on an atheoretical model that sim- 
ply says that peace persists. dGG are also incorrect that 
neural network modeling involves some kind of novel 
epistemology or is of no interest to the theoretically 
minded student of international conflict. As we have 
seen, neural networks are mterpreted using the same 
tools, in the same way as for logit, and both have the 
same statistical underpinnings. Neural network models, 
but not logit, are capable of fitting models where na- 
tions are acting strategically. Indeed, the logit model is 
simply a limiting special case of a neural network, and if 
the world is such that the simple logit form adequately 
represents the data, a correct neural network analysis 
will reveal this fact. 

Not only does the neural network outperform logit 
for the data under consideration, but also we saw in 
Figure 2 that the neural network results are of tremen- 
dous interest for settling important controversies in the 
study of conflict. dGG’s logit model is simply incapable 
of returning results consistent with an entire range of 
hypotheses in the literature, no matter what the data 
indicate. Clearly such a model cannot be used to de- 
cide fairly among competing hypotheses. Surely, data 
sets will exist where logit performs as well as neural 
networks but where neural networks (or other mod- 
ern methods that enable one to estimate rather than 
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assume the functional form) forecast better; we should 
neither fear to use them nor worry that our standard 
analytical methods must be discarded before we use 
them. Indeed, a whole range of methods has now been 
developed that does not require the functional form 
assumptions inherent in logit and the other techniques 
commonly used in political science. For prediction, 
these include neural networks, models of intermediate 
flexiblity like generalized additive models (Beck and 
Jackman 1998), those that have unique optima such 
as support vector machines (Vapnik 1995, 1998), and 
other types of models and methods such as boost- 
ing, regression and classification trees, kernel methods, 
and mixture models (Hastie, Tibshirani, and Friedman 
2001). For estimating causal effects, in areas where pre- 
and posttreatment controls are clearly distinguishable, 
the techniques tend to be matching and related ap- 
proaches (King and Zeng 2003). Which of these tech- 
niques is appropriate will depend on the application, 
but in almost all situations these techniques will dom- 
inate those with restrictive functional forms like logit, 
except in the presence of theory far more informative 
than in most areas of political science. 

If our basic conjecture—that the world of interna- 
tional conflict combines a small number of dyads where 
a variable has a large effect with a large number of 
dyads where it has essentially no effect—is correct, 
then we must turn to appropriately flexible methods 
like neural networks that allow for these massive inter- 
actions that cannot be specified a priori. For the data 
and subject matter at hand, the massive interactions are 
there and logit is inadequate to deal with them. 


APPENDIX: ROC CURVES 


Figure 3 presents the three model ROC curves for each eval- 
uation set. Clearly the ROC curve for the neural network 





| FIGURE 3. ROC Curves for the Post-1985 (Left) and Random (Right) Out-of-Sample Test Sets 
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model lies closer to the top right than the corresponding logit 
curve almost everywhere, and the gap between the curves is 
about as large as that between the baseline ROC and the logit 
curve. 

Figure 1 provides a cleaner summary of the relevant char- 
acteristics of these graphs using dGG’s preferred summary 
statistic, and the text explains how to understand the impor- 
tance of the ROC differences seen here. 
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$25 00 Paperback 0-521-53862-9 


On Capitol Hill 


The Struggle to Reform Congress and Its Consequences, 
1948-2000 


Julian Zeler 
$30 00 Hardback 0-521-80161-3 
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provoca 
f study of the 
ongins of America’s most fascinating but 
least-understood insntution. The book 1s an 
intellectual triumph and a delight to read” 
—Roes K. Baker, Rutgers University 


Mandarins of the Future 


Modernization Theory in Cold War America 
Nils Gilman 
“Mandanns of the 
Future will be 
definite, 
pethaps for 
decades to come 
Gilman provides f 


—Michael Adas, Rutgers Unrvernty 
New Studies in American Intellectual and Cultural 
History Howard Brick, Senes Editor 

$48.00 bardcover 


From Warfare to Welfare 


Defense Intellectuals and Urban Problems in 
Cold War America 


Jeantfer S. Light 


“An outtanding discussion and evaluation 
of the capabrhties, efforts, took, and 
contributons of technologist, scientists, 
managers, planners, analysts from the 
military, aerospace, and other federal 
government cies.” 

—Harold Finger, former NASA Associate 
Administrator for Organization and 
Management and former HUD Assistant 
Secretary for Research and Technology 
$42.00 bardcorer 
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“Non-Germans” 
under the Third Reich 


Uneasy Balance 
Civil-Military Relations in Peacetime America 
since 1783 

Thomas S. Langston 

“Langston examınes through historical case 
studies of U.S war-to-peace transitions 
how leaders face the problem of preserving 
a mihtary that 1s innovative and prepared to 
































































fight the next 

major war and 

finding missions 

for the military 

in the meantime 

iar y legal and executrve provisions and practices 

ankin applicable to ‘Fremdvolkische’ I consider ıt 

leadership and the only scholarly work of this nature that 

the general addresses all of the people concerned and 

public.” the treatment they received from the Nan 

Michael C. authonties.”—Walter O. Weyrauch, 

Desch, Univer- University of Florida 

sty of Kentucky Published in assodation with the United States 
$149.95 bardcover 

False Alarm Europe without Borders 


Why the Greatest Threat to Soaal Secunty and 
Medicare Is the Campaign to “Save” Them 
Joseph White 

“This book 1s compelling in almost every 
way. White takes on all challengers, and he 
does so with clarity and sumplicity that 
downplays the depth of his understanding.” 
A Century Foundation Book 

$22.95 paperback 


Federalism and 
Territorial Cleavages 

edited by Ugo M. Amoretti and Nancy Bermeo 
“I found it to be an often fascinating book 
dealing with an extremely important, 
timely, and under-explored topic” 
—Ruchard Gunther, Ohio State University 
$55.00 bardcorer 


Remapping Territory, Citizenship, and Identity 
in a Transnational Age 


edited by Mabel Berezin and Martin Schaln 


“Europe without Borders plays an important 
role in bringing together muludisaphnary 
perspectives that, taken together, provide an 
insightful and enlightening overview on 
this emerging phenomenon called ‘Europe’”’ 
—Davıd Jacobson, Arizona State Univernty 









































Governance 
on the Ground 


Federalism and Democracy 
in Latin America 
edited by Edward L. Gibson 


“A strong contribuuon to a debate that is 
likely to become central in comparative 
politcs and for students of democracy and 
polucal insntutions.” 

—Gerardo Munck, School of International 
Studies, Univernty of Southern California 
$18.95 paperback 
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Ground describes people at a local level 
working through municipal insntutons to 
take more responmbility for their own lives 
and environment. 

Vibodrow Wilson Center Press 

$24.95 paperback 
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Window on Congress: A Congressional i of Barber B. Conable Jr. 
awn James S. Fleming 
Foreword by Richard F. Fenno Jr. 


In hls foreword the renowned congressional scholar, Richard F. Fenno Jr. writes, “Barber Conable w 
an especially admirable United States Representative; and JIm Fleming has written an especially ac 
rable congressional biography. This book is, therefore, a special gift.” 


Barber B. Conable Jr. served as a Republican congressman from westem New York from 1965 to 1985. He |: 
recognized as one of the most respected members of the House of Representatives In recent years. This blo 
raphy explores his twenty-year congressional career, focusing on his remarkable educational abilities as a gH 
teacher-egisiator. Using excerpts from Conable’s private journal, hls newsletters and news columns, and fror 
| __4 personal Interviews, James S. Fleming has crafted a book that enables readers to appreciate why Conable w 
m held In high regard by his constituents, his colleagues, the press, and congressional scholars. 





20 b/w lilus, 275pp, ISBN: 1 58046 128 X, Available May 2004, $29.95 hardcover 


University of Rochester Press, 688 Mt. Hope Avenue, Rochester, NY 14620 
Tel: 585-275-0419 Fax: 585-271-8778 www.urpress.com 
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The Urban Voter 
Karen M. Kaufman 


“This Is an elegant book, deftly argued and amply 
documented. It Is a must-read for all those serlously 
concerned with urban politics, race, and the future of 
our major citles.” 

— Lawrence D. Bobo, Harvard Untversity 


Political Psychology In 
International Relations 
Rose McDermott 


A comprehensive account of varfous aspects of the 
fleld of polttical psychology for scholars and students 
of International relations. 


30TH ANNIVERSARY EDITION 


The Politics of Rights 
Lawyers, Public Policy, and Political Change 


Stuart A. Scheingold 


“This book has already exerted an enormous Influ- 
ence on two generations of scholars. It has had an 
enormous Influence on political scientists, soclolo- 
gists, and anthropologists, as well as historians and 
legal scholars. With this new edition, thls Influence 
Is likely to continue for still more generations. The 
Politics of Rights has, | belleve, become an American 
classic.” — Malcolm Feeley, The Untversity of 
Callfornla, Berkeley 


Models, Numbers, and Cases 
Methods for Studying International Relations 


Detlef F. Sprinz and 
Yael Wolinsky-Nahmlas, Editors 


“,.. provides a state-of-the-art overview of research 
methods In International relations for students and 
scholars allke. The authors collectively demonstrate 
that there Is a unlty In the logic of research and a 
complementarity In the use of methods In Interna- 
tlonal relations. Sprinz and Wolinsky-Nahmlas have 
organized outstanding essays from a group of distin- 
gulshed scholars Into an unusually coherent volume.” 
— David A. Lake, Unlverstty of California, San Diego 


REVISED EDITION 

Public Opinion and 
American Foreign Policy 
Ole R. Holsti 

“,.. a standard setter....” —APSR 


“Holsti examines most of the major Issues pertain- 
ing to public opinion and forelgn pollcy In one highly 
readable volume. Thls Is what scholarly writing should 
be: well researched, free of academlc Jargon and con- 
clse. For anyone Interested In the subject, this book Is 
a must-read.” —Canadlan Journal of Political Science 


Before Norms 
Institutions and Civic Culture 


Robert W. Jackman and Ross A. Miller 


A provocative argument In a timely debate among 
social sclentlsts. Jackman and Miller argue the superi- 
ortty of the Institutfonallst approach to understanding 
political behavior. 


The Scourge of War 
New Extensions on an Old Problem 


Paul F. Diehl, Editor 


Reassesses the groundbreaking work of political scl- 
entlst J. David Singer to produce a clearer, more con- 
temporary understanding of how wars happen, and 
how they might be avolded. 


Racism, Liberalism, 
and Economics 


David Colander, Robert E. Prasch, 
and Falgunl A. Sheth, Editors 


“There Is an Immense economic literature on raclal 
discrimination, employing a variety of models and 
decomposition methods. This volume makes a unique 
contribution by focusing on the philosophical as- 
sumptions at the root of this analysis and by present- 
ing many slides of the very vigorous debate surround- 
Ing these controverslal Issues.” 

—Thomas Maloney, Untverstty of Utah 


The University of Michigan Press 
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BEMOCRATIC CRITICISM 


Humanism and Democratic 
Criticism 

Edward W. Sald 

Foreword by Aksel Biigram! 

In his final book, Sald makes an Impassioned argument 
for revitalizing the humanities He contends that words 
are vital agents of historical and polttical change and 
that reading teaches people to continually question, 
upset, and reform. Intellectuals must reclalm an active 
rote In public Ife, but at the same time the academic 
trend toward needless jargon and obscurantsm must 
be combated, as must the dismissive, axciusionary 
forms of humantsm. 


192 pages © $19 95 doth 
COLUMBIA THEMES Bi PHILOSOPHY 


The Politics of the Governed 
Reflections on Popular Politics in Most of the World 


Partha Chatterjee 

How global tachniques of power have changed the 
nature of democracy in most of the world and how the 
majortty of the word's population Is responding 


“[A] remarkable collection of essays Cmi! society 
will not look the same after this book has been read 
and debated.”—Ayjun Appadural, New School 
UnWerstty 


200 pages © 19 phatbs © $29 50 cloth 
LEQMWAAD HASTENS SCHOFF LECTURES 


Democracy In Action 
Communtty Organizing and Urban Change 
Kristina Smock 


“A splendid critical analysis of the value and power of 
community organizing and a practtcal gulde to the 
efficacy of varlous models of communrty organing. 
Invaluable for a broad audience concemed with 
grassroots civic renewal.”—Robert Asher, author of 
Let the People Decide 

296 pages © $26 50 paper © $62 50 cioth 


Islamist Networks 
The Afghan-Pakistan Connection 
Mariam Abou Zahab and Olivier Roy 


Thus indispensable book Investigates the almost twenty- 
five-year gestation of the interlinked radical Islamist 
networks of Pakistan, Afghanistan, and Central Asia, out 
of which Al Qaida emerged. 


92 pagas e $49.50 cioth 
THE CA SFS N COMPARA POLITICS AD ATESTU IONAL STUDES 


| NEW FROM THE CHINESE UNIVERSITY PRESS 


' A Cloud Across the Pachic 
; Essays on the Clash between Chinese and Western 





New from Columbia 


China's Democratic Future 
How It Will Happen and Where It Will Lead 
Bruce Gilley 


An eminent China expert considers how the Chinese 
Communist Party will be removed from power and 
democratic transition will take piace. 


“The most thoughtful analysis yet of China's current 
problems and future possibilities. Deeply informed, 
carefully argued, and forcefully written.” —Andrew J. 
Nathan, co-editor, The Tananman Papers 
320 pages © $29 60 cloth 


The India-China Relationship 
What the United States Needs to Know 


Edited by Francine R. Frankel and Harry 
Harding 

Scholars from political science, history, economics, 
Intemational retations, and security studles offer fresh 
Insight Into the relationship between the two most pop- 
ulous nations on Earth. 

352 pages ¢ $22.50 paper © $62.50 cloth 


A PUBLICATION OF AStA SOCIETY AMD THE WOODROW WELSQN CENTER FOR 
MTERA TOAL STUDIES 


Economic Governance In the Age 
of Globalization 
Wililam K. Tabb 


Integrating economics and political science, Tabb traces 
the emergence of the global state from the closing days 
of World War I and examines Its future prospects. 


“The best comprehensive analysis and crttique of 
globalization and especially of economic governance 
m a vast Irterature.”—Richard Falk 

528 pages © $29.50 paper ° $89 50 cloth 


Political Theories Today 


Thomas A. Metzger 


“Important and seminal... Bniliant, daring ideas that 
challenge some of the conventional wisdom in the 


| fleld of polttical philosophy.” —Chang Hao 


; 752 pages ¢ $55 00 cioth 


. Guangdong 


Preparing for the WTO Challenge 
Edited by Joseph Y. S. Cheng 


j “A wide range of authomtatve expertise... Academic 


analysis at rts best.” —Robert Ash, SOAS 


‘ 378 pages © Tables © $39.00 cloth 


COLUMBIA UNIVERSITY PRESS 
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Diminished Democracy 
From Membership to 











| 
TO MAMAGEMENT IM AMERICAN CIVIC LIFE in avic mvolvement, which has not 
THEDA SKOCPOL 


] sharp-eyed focus on the state's role m avic hfe 5 a useful corrective to the 
romamnc locabam and the arrier kinds of cukural analyes 90 prevalent m American polin- 
cal thought today."— brea a 


Tech Siop s & ional thee eec she constantly niin her scatan mando 
the task of makmg the United States a farer and more democratic place. Dorsrashed 
Demoga 1s 2 brillant contribution to that effort, a collection of insights, that will over- 
turn much conventonal wsdon All who care about democracy and.sf prospects owe 
themselves tme with thts book” —E. J. Dionne, Jr., d colummt and author of 

Why Americans Hate“ Pa 
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STATE TERRORISM AND THE UNITED STATES 


From Counterinsurgency to the War on Terrorism 
Frederick H. Gareau 





“An important, courageous analysis .. Gareau’s research 
and documentation ts path-breaking. “ 
Chalmers Johnson author of Biowbacic The 
Costs and Consequences of Amencan Empire 


™Using as his evidentiary base the truth commission 

reports that have followed the ouster of terror regimes, 

Gareau makes his case for vital Washington support for 

these regimes compellingly and soberly. ... a valuable 
addition to the Itterature on terronsm.” 

Edward Herman 

Wharton School of the University of Pennsylvania 


Clarity Press, Inc. € 
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The $800 Million Pill 

The Truth behind the Cost of New Drugs 
Merrill Goozner 

“A masterful work of explanatory and Inves- 
tigative Journalism. Goozner has versed 
himself In the InterlockIng worlds of medl- 
cine, business, politics, and basic sclence 
to explain how pharmaceutical break- 
throughs truly occur. He also explains why 
drug costs are now so needlessly high... 

A compelling and Important book.” 


—james FaLLows of Tae ATLAwnic Afoerracy, 
author of Breaking the Mews 


$24.95 cloth 


The Children of NAFTA 
Labor Wars on the U.S./Mexico Border 
David Bacon 

“Bravo to David Bacon for hts tough-minded, 
unsparing portralt of working Ufe at global- 


lzatlon’s ground zero.” —Ray Suanez, 
senlor correspondent, The NewsHour with 
Jim Lehrer, and author of The Old Neighborhood 


$27.50 cloth 





Taking Back the Streets 
Women, Youth, and Direct Democracy 
Temma Kaplan 
“This Is a strong, livety, deeply felt, and 
wonderfully Informed account of the power 
of women struggling to reverse tyranny In 
many of the most distressed countries In 
the world.” —Viviaa Gonic, author of 
The End of the Novel of Love 
$55.00 cloth, $21.95 paper 





Strong Democracy 

Particlpatory Politics for a New Age 

Benjamin R. Barber 

With a New Preface 

Twentleth-Anniversary Edition 

“Hrs distinctive contribution Is not to develop 

a legislative package but to sketch a way of 

looking at polltics from which a stronger 

democratic practice might emerge..Hls 

approach Is origina! and provocative.” 
New Youx Ties 

$16.95 paper 
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Cruel Justice 

Three Strikes and the Politics of 

Crime In America’s Golden State 

Joe Domanick 

“A brave and utterty compelling response to 

the tynch mob mentallty that has done so 

much damage to the moral fabric of 

California Ufe. Domanick might be our Zola.” 
—IMuc Dens, author of Dead Cties 

$24.95 cloth 


Globalization 

Culture and Education In 

the New Millennium 

Marcelo M. Suérez-Orozco and 

Desirée Baollan Qin-Hilliard, Editors 
“This book comprehensively explores the 
challenges that globalization poses to edu- 
cators... The contributors, experienced 
educators themselves, have thought freshly 
and deeply on the cultural Implications of 
the globallzing process.”—AsTHum Scarsimcm 
Published h assoaation with the Ross Institute 
$50.00 cloth, $19.95 paper 


No There There 

Race, Class, and Political Community 

In Oakland 

Chris Rhomberg 

“This sophisticated account of a remark- 

able city’s coalitions and conflicts over half 

a century Is an outstanding contribution to 

urban hlstory and political anatysis.” 
Cers Tiy, author of Durable inequality 

$39.95 cloth Q 
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CRUEL 
JUSTICE 


in bux 


There’s Something 


Happening Here 
The New Left, the Klan, and 
FB! Counterintelligence 
David Cunningham 
“Cunningham’s landmark study of the FBI’s 
response to Sbdles protest couldn't be more 
timely. We galn fresh and disturbing Insight 
Into the culture and dynamics of the agency 
at a time when once again It has been 
empowered to monitor political dissidence. 
We need thls history so as to avoid 
repeating It” 

—RocHarp Faas, author Making History 
$27.50 cloth 


Bringing the War Home 
The Weather Underground, the Red 
Army Faction, and Revolutionary 
Violence In the Sixtles and Seventles 
Jeremy Varon 
“Yaron achleves something very difficult, 
and very Important: he conveys the Intense 
heat of moral purpose and polltical convic- 
tlon that drove some radicals to violence, 
while at the same time he malntalns a cool 
analytic distance that enables him to pro- 
vide a brillant and compelling Interpreta- 
tion of the logic and limits of dissent In 
democratic socletles.” 

—jon Wan, author of Gimme Some Truth 
$55.00 cloth, $21.95 paper 





Modernizing 

China’s Military 

Progress, Problems, and Prospects 
David Shambaugh 

Now m Parexeacx—“Superb..An authoritative 
and comprehensive account.”—Foesses Armas 
$19.95 paper 


UNITED STATES INSTITUTE OF PEACE 
GRANT PROGRAM ¢ FALL 2004 COMPETITIONS 


The United States Institute of Peace is accepting applications for its Spring Unsolicited and Solicited 
Grant competitions. The Grant Program offers financial support for research, education, training, and the 
dissemination of information in the fields of international peace and conflict resolution. + The Unsolicited 
Grant competition is open to any project that falls within the Institute’s general mandate of international 
conflict resolution. # The Solicited Grant competition is only open to projects that fall within the themes 
and topics identified in advance by the Institute. The topics for the Fall 2004 competition are: 


+ SOLICITATION A: The Economics of War and Peace 
+ SOLICITATION B: Southeast Asia 


The Institute encourages applications from nonprofit organizations and official public institutions. Individuals may 
also apply. For further information and application materials, please call, write, or e-mail: 


Grant Program ¢ United States Institute of Peace 

1200 17th Street NW, Suite 200 

Washington, DC 20036-3011 

(202) 429-3842 + Fax (202) 429-6063 + TTY (202) 457-1719 + e-mail: grants@usip.org 


Application materials may also be downloaded from our website: www.usip.org / grants 


The closing date for receipt of Fall 2004 Unsolicited and Solicited grant applications is 
October 1, 2004. Award announcements will be made on March 31, 2005. 











WHO SHOULD Pay FOR MEDICARE? 

Danlel Shaviro 

“Dan Shaviro is that rare scholar who can blend legal and economic 
thinking in designing social reform In this masterpiece of logic, he takes 
us outnde the narrow debate over what we get and asks the fundamenul 
queson of who should pey for it. Only with approaches like Shaviro's 
are we ever going to face up to the types of Medicare—end broader 
health care—teforms that sooner or later we must undertake ” 
—Eugene Sceuerle, Urban Institute 

CLorm $25.00 


THE POWER OF PRODUCTIVITY 

Wealth, Poverty, and the Threat to Global Stability 
William W. Lewis 

“Over the last decade, the McKinsey Global Institute has produced a 
series of penetrating and provocative reports on the sources of producnvi- 
ty and wealth differentials between countries. Bill Lewis was the prime 
daver of thar work, and he has now brought the conclusions together. 
The Pewer of Productivity is full of innghtr, which emerge from patient, 
sector-by-sector analyms. . It opens a new and difficult agenda for eco- 
nomic policymakers around the globe, and [Lewis's] book deserves a 
wide readership.”—Sir Howard Davies, Director, London School of 
Economics 

Core $28.00 


THE POLITICAL MAPPING OF CYBERSPACE 
Jeremy W. Crampton 

As inherently spacal beings, humans are challenged by the absence in 
cyberspace of all that is familiar in terms of our ability to define, organ- 
me, govern, and map socal places. In The Poltica! Mapprng of Cyberspace, 
Jeremy Crampton shows thet cyberspace is not the virtual reality we 
think it to be, but instead a rich geography of polincal pracuces and 
power relanons. 


Paper $25.00 
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DANIEL A. 
SUZANNA 


RESCUED BY EUROPE? 

Social and Labour Market Reforms In Italy from 

Maastricht to Berlusconi 

Maurizio Ferrera and Elisabetta Gualmin! 

“Italy's successful transinon to EMU in the 1990s required change and 
renewal on many fronts. . . . Two of Italy's foremost public policy spectal- 
ists, Ferrera and Gualmini are well placed to tell the story of how Italian 
political elites, long onented towards buying off opposition and vested 
interests by expanding a bloated public debe, were finally confronted 
with reality by EMU membership critena "—Marun Rhodes, European 
University Institute, Florence 

DESTRI BUTED POR AmSTERDAM UNIVERSITY Prose 

Paper $30.00 


Now in paper 
DESPERATELY SEEKING CERTAINTY 

The Misguided Quest for Constitutional Foundations 
Daniel A. Farber and Suzanna Sherry 

“Terre, effective... . Furnishes a useful wake-up call to those who feel 
theory can construe the Constitunon better than a judge with a sense of 
history, precedence, and humanity.”"—Philip Weinberg, Mew York Law 
Journal 


Parar $17.00 


BUILDING THE COLD WAR 

Hilton International Hotels and Modem Architecture 
Annabel Jane Wharton 

“In a chought-provoking and creative work of cold war scholarship. .. 
[the author] has produced a study that explores the interrelationship 
between the design and construction of Hilton hotels and the expansion 
of US cultural and polrucal influence... . Wharton presents a wealth of 
fascinating marenal . . . which may cause us to rethink what we mean 
by the corndors of power "—Jonathan Rosenberg, Jeternational History 
Renew 

Papar $27.50 
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The Dubious Link Globalizing LA. 
Civic Engagement and Trade, Infrastructure, and 
Democratization Regional Development 
cr THING N ARIEL C. ARMONY STEVEN P. ERIE 
SOR Ee aS ones “Thuis book is a crical advancement of our under- “Stnctly by itself, and on its own terms, metropoli- 
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Telepopualisn 





standing of civil society’s role ın the state-society 
relations of modern democracy. With its clear 
structure and in-depth case studies, it 15 an mvalu- 
able contmbution to political science, sociology, 
legal studies, and Latın American studies.” 
—Mark Ungar, Brooklyn College, 
The Cıty Unrversity of New York 
$55 00 cloth 


Overruled? 

Legislative Overrides, Pluralism, 
and Contemporary Court-Congress 
Relations 


JEB BARNES 
“Barnes provides a much-needed corrective to 

common assumptions about congressional and 
judicial behavior. Employmg an exemplary com- 
bination of quantitative and qualitative analyais, 
he offers vivid illustrations of individual policy 
disputes and explains the logic of divergent 
cases. This book makes an important contribu- 
ton to the field, making a complicated subject 
understandable.” 

—R. Shep Melnick, 

Boston College 
$50 00 cloth 


The Rule of Law in Nascent 
Democracies 
Judicial Politics in Argentina 


REBECCA BILL CHAVEZ 

“In this analytically penetrating and timely book, 
Rebecca Bill Chavez argues that the rule of law is 
fundamentally a question of dispersed power. This 
powerful idea is supported by nich evidence 
gleaned from mnovative field research at the 
national and sub-national levels. This book should 
be required reading for students of development, 
law, sociology, and politics.” 


Popular Political Support in 
Urban China 


JIE CHEN 

On the basis of three carefully drawn surveys of 
Benmg residents between 1995 and 1999, the 
author finds that diffuse support for the current 


polincal system—hased on attitudes toward institu- 


tons and values—remams strong, at least among 
city-dwellers, though it us gradually declining. 

A co-publication with the Woodrow Wilson Center 
Press, Washington, D C 

$21 95 paper $55 00 cloth 


tan Los Angeles ıs a city-state, a world force, the 
de facto capital of the Ama/Pacific community. In 
this compelling, exhaustively researched study, 
Steven P. Ene offers a preeminent example of how 
global and local energies are converging to create 
a new kind of global city and a new kind of global 
economy.” 

—Dz Kevin Starr, 

State Labranan of California 
$21 95 paper = $55 00 cloth 


Politics from Anarchy to Democracy 
Rational Choice in Political Science 


Edited by IRWIN L. MORRIS, 
JOE A OPPENHEIMER, 
and KAROL EDWARD SOLTAN 
“Politics from Anarchy to Democracy offers both a 
prumer in political philosophy and an introduction 
to rational choxe theory, one of the dominant 
paradigms in politcal scence. Unique ın tts scope, 
this volume is both accessible and provocative, 
and I expect it will have a sgnificant impect on 
both theory and practice.” 
—Michael C. Munger, 
Duke University 
$21 95 paper $55 00 cloth 


Telepopulism 
Media and Politics ın Israel 


YORAM PERI 

This ts the first account of Benjamm Netanyahu’s 
as prime minister. It presents the dramatic cultural 
and polmcal changes that occurred m Israel in the 
1990's with the creation of media-centered democ- 
racy. The author shows how Netanyahu used these 
to construct his poliocal project—Telepopulism. 
$2495 paper $80 00 cloth 


The Welfare Experiments 
Politics and Policy Evaluation 


ROBIN H. ROGERS-DILLON 
“The Welfare Expertments makes a major 
contribution to the literature not just on welfare 
policy, but also on the politics of the policy 
process and public administranon. This is a ser 
ous, sobering look at one of the most mportant 


—Steven Teles, Brandes University 














Free online sample available on our website! 


Politics, Philosophy 
& Economics 


Editors: Gerald F Gaus and Jonathan Riley Murphy 
Institute of Political Economy, Tulane University, New 
Orleans, USA 


Politics, Philosophy & Economics aims to bing moral, 
economic and political theory to bear on the anatyszs, 
Justification and cntcism of political and economic 
insttutons and public policies. The journal seeks to 
provide a distinctive forum for discussions and debates among polrtcal scientsts, 
philosophers, and economists on such matters as consttutional design, property 
nghts, distnbutve justice, the welfare state, egalrtananism, the morals of the 
market, democratic socialism, population ethics, and the evolution of norms. 





For more mformation about the journal, the editors and submissions, contact 
editor@ppe-joumal.org or visit PPE's website at www.ppe-journal.org 
Recent and Forthcoming Papers: 


* The persuasrveness of democratic majontes, Robert E. Goodin and 
David Estiund 


¢ What we owe to distant others, Leif Wenar 

¢ Equality, coercion, culture and social norms, Richard J. Arneson 

¢ Responsibility for past inustice, Chandran Kukathas 

Symposia 

e Ethics, Evolution and Economics: Ken Binmore, Dennis Mueller, 
Herbert Gintis, Alexander Rosenberg 

¢ Distnbutive Justce: Alen Buchanan, Marc Fleurbaey, Philip Pettit, 
Eric Rakowski, John Roemer, Samuel Scheffler 


* The Economic Implications of Republicanism: Philp Pettit, Richard 
Dagger, Geoffrey Brennan, Henry Richardson, Christian List 


Three Times a Year: February, June, October ° ISSN: 1470-594X 


QSAGE Publications 


1 Oliver's Yari, 68 Cay Road, Londos BOTY Ta www.sagepub.co.uk 


Subscription Hotline —* ee) .* Email i 1 ' 





Revised Edition 
NOW ON SALE 





NEUE 
for Political Scarce 


APSA is pleased to announce the release of the new 
edition of the Style Manual for Political Science. 
Recently revised, this edition reflects the current ` 
style of the American Political Science Review and - E, 
contains updated information on documenting D oe 
electronic and online sources. It also includes i 
submission guidelines for APSR and anew foreword 
by Lee Sigelman, editor of APSR. 


Single copy rate: $3 members; 
$6 non-members i 


Order APSA Publications online at 
www.apsanet.org/pubs or call 202.483.2512. 

For more information about these or other APSA 
publications, visit the online catalog at 
www.apsanet.org/pubs/catalog.cfm 























Red Sky at Morning 
America and the Crisis of the 
Global Environment 


James Gustave Speth 


“James Gustave Speth 
brought global environmen- 
tal concerns to the world’s 
attention nearly a quarter of 
a century ago His extraordinary new book 
is an impassioned plea to take these issues 
seriously before ıt is too late. We owe it to 
our children and grandchildren to read Red 
Sky at Mormng and take acuon while we 
can.”—Jimmy Carter, former prendent of 
the United States $24 00 





Ordinary Heroes and 
American Democracy 
Gerald M. Pomper 





and American polincal mst- — 
tutions foster the heroism of n 7" 
ordinary people.”—Bruce Miroff $30 00 


Milosevic 


A Biography 
Adam LeBor 


“The history of Europe 1s 
all too often the history of 
dictators, and Adam LeBor’s 
biography of Slobodan 
iaee wail and aer oeni Sele 
and Hitler. The study of monsters demands 
real innght and scrupulous detail, and 
LeBor’s book has all of that, but there 1s 
more—an exceptionally compelling and 
sharply written story of Europe at the end 
of the 20th century.”—Alan Furst $35 00 





Falun Gong 


The End of Days 
Maria Hsia Chang 


“Chmese interest m and 
practice of rehgion are 
extremely complex subjects, 
but Mana Hsia Chang isthe * 
best possible guide—a veteran pohucal 
scientst, a gifted linguist, and a totally 
independent observer . . . This ıs one of 
the best books on contemporary Ching 
in decades.”—Chalmers Johnson $25 00 





VALE tiiseasie Pree 


Political Science from Yale 


Arguing About War 


Michael Walzer 


“Walzer’s discussions about 
war and ethics are always 
provocauve, well argued, 
and insightful. He 1s able 

to transform complex issues 
into readable, understand- 
able, and persuastve prose ~ 
—Jean Bethke Elshtam 2500 


Why Globalization 
Works 
Martin Wolf 


A distinguished internanonal 
economist here offers a 
powerful defense of global 
market economy. Marun 
Wolf explams how globahra- “ 
tion works, critiques the charges agamst it, 
and offers a realistic scenario for economic 
mternationahsm in the post-9/11 age. $30 00 


. Spe 
WARE 





Pppp | 


ng Lavepaac| st, 
irmative Uo oy 1 
tion avyuuaiyyy : 


E. 

F 

1 

i 

3 
Lo 
A 
f 


dence bearing on them 
Dr. Faye Crosby, a leadmg expert on the 


action and its effects.”—Jack Dovidio 
Current Perspectres m Psychology $30 00 


Emergence of Non-State Authority 
Benjamin Cashore, Graeme Auld, 
and Deanna Newsom 


“A rich theoretical and empirical 
contribution to our understanding of 
private environmental governance in the 
forestry sector m several mdustnal coun 
tries. It will be of considerable interest to 
those mvolved in environmental manage- 
ment and governance.”—David J. Vogel 
$45 00 








SS ie eae 
influenval scholarly Hterature and preparing 
the way for more productve inquiry. No 
student of political parties can ignore it.” 
—Fred Greenstein $17 00 paperback 


One World 
The Ethics of Globalization; Second Edition 
Peter Singer 

With a new preface by the muthor 


“Many people have written 
about the economic mean- 
ing of globalizaton; in One 
World Peter Singer explams 
its moral meaning. No polit- 
ical theorist or moral philosopher, no. 
public official or poHtical activist, can 
afford to ignore his arguments.” 
—Michael Walzer 

314 00 A Yale Nota Bene paperback 


Partisan Hearts 


and Minds 
Political Parties and the 
Soctal Identities of Voters 
Donald Green, 
Bradley Palmquist, 
and Enc Schickler 


“A profound breakthrough in our under- 
standmg of partisan loyalties and makes a 
major contribution to the study of political 
attitudes and voting behavior.” 

—Paul Abramson $18 00 paperback 








Democracy in 
Modern Spain 


Richard Gunther, José Ramón Montero, 
and Joan Botella 





“The case of Spain 1s relevant to the whole 
range of problems of comparative poHtics 
m the many new democracies, and the 
collected expertise of Gunther, Botella, and 
Montero will fill an enormous gap in the 
fleld.”"—juan J. Lmz $4000 


















RUSSIA 


IN SEARCH OF ITSA&LT 


Momentum & modern politics 


Making Government 
Manageable 



















Explaining Foreign Policy 
U S. Decision-Making and the Persian Gulf War 
Stove A. Yotty 













































































































Executive Organization and Management in 
“Using the the Twenty-First Century 
risen cies EUPLAIRING FEREIGD POLICY | | edited by Thomas H. Stanton and 
crisis of 1 
` B | 15 hosm-tat ba hrm rir | | B00tamia Ginsberg 
Steve Yety ee y “Packed with good ideas, practical advice, 
ahoo Ne i and keen insights from observers and 
ımportant f ae REA altke, this a offers 1s a giant in the field of Russian studies To 
foreign policy oe = Pragmatic AOE have tackled a subject as complex as 
decisions are ANE E A Russian identity and the Russian national 
made His fresh R nan J. k, Shh idea with such concision, clarity, and 
and interesung paperback confidence 1s a major achievement.” 
contribution —Strobe Talbott 
Gamat het The Communitarian likod ow ALSS Caries 
competing Constitution $24.95 bardcover 
decision-making Beau Bros! 
theories—rational choice, cognitive, k i im Post-Soviet Women 
domestic policy, organization, and oraa Encountering Transition 
groupthink—to produce a nich interpreta- will sur up some Nation Building, E s l and 
tion of the event”—Ruchard Herrmann, dust and provoke Ci mi ng an 
Director of the Mershon Center at the academic S 
Ohio State Unrvernty cengovers) aaa Communitanen Constitution edited by Kathleen Keobnast and 
$19.95 paperback highly produc- Oe 1e8n8t 0016 Doou Desin Carol Nochemlas 
MRABHTAAARAEORAERETE 





uve way His ateakstahaamraeheeaenans 
strongly stated Aer eanarneoeearenion 
and well-argued testaanattaesaaaseed 
HR RHA AREREARAHEREOn 
thesis—that ACAHERT ERO ENeRHOREaRT 
communitar- weaseanere | 
wean eanen CEEE ZEEE E] 
lanism cannot reitataaaseseseneesed 


‘Women in the former Soviet Union, 
despite a legacy of high levels of education 
and labor force participation, face a host of 
new problems 

Woodrow Wilson Center Press 

$55.00 hardcover 





Liberal Democracy and the 
Social Acceleration of Time 
William E. Scheserman 


“Thus is a ‘must read’ for anyone concerned 
with democracy or consntunonalism ” 
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—Jean Cohen, Columbia Univernty conmigon]  insnmemsiistsee | Decentralization, Democratic 
PAZIO kariet SE EROE E E AE Governance, and Civil Society 
political theorists and students of public in Comparative Perspective 
A Theory of | ir aus D. Sarat, Amherst College SAMP 
' Global $48.00 bard Africa, Asia, and Latin America 
Capitalism edited by Philip Oxhom, Joseph S. Tulchin, 
Pean Ck, | Downsizing Democracy andl Andraw D. Saloa 
and: State ina How America Sidelined Its Cinzens and “A first-rate book It consists of well- 
Ti tional Privatized Its Public researched and well-written case studies 
Hee Matthew A. Creason and that contain plenty of useful new informa- 
tion.”—James Manor, Univernty of Sussex 
Willian I. Boujamin Ginsberg Wbodiow Wilson Center Press 
Robinson “A thoughtful and useful analysıs of $55.90 bardcover 
A “Thel present-day democratic dechne”—Kerry 
eee Lauerman, Washington Post Book Woild Learning from Foreign Models 
ee: igen in Latin American Policy Reform 
ormation 
provides a clear, straightforward, and Daniel Patrick Moynihan edited by Kurt Weyland 
























The Intellectual in Public Life 
edited by Robert A. Katzmann 
“Readers who think they understand 


“This book addresses a very important 
topic with an innovative and interesung 
research design ”—Martha Finnemore, 


convincing account of the economic, 
political, and social contours of contempo- 
rary capitalism ”—Craig N. Murphy, 


Wellesley College, and Chair, Academic A George Washington University 
Counal on the United Nanons System Moyniban’s public career are sure to find Whodrow Wilson Center Press 
Themes in Global Socal Change another dimension of 1t here. Such a book $22.95 paperback 





truly honors its subject ”— Foreign Affairs 
Vbodrow Wilson Center Press 


$17.95 paperbock 


Christopher Chase-Dunn, Senes Editor 
$18.95 paperback 
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THE BEST 


War Crimes and Realpolitik: 
International Justice from 
World War I to the 21st Century 
Jackson NYAMUYA MAoGoTO 

“Offering a welcome new perspective, 
Maogoto highlights the interplay 
between politics and law in the 
international arena.”—Kurt Mills, James 
Madison University » hc $55 © pb $22 


War Crimes 
-Realpolitik 








Nie NEV na i! 
merical Law in World Pomi 
An Introduction 
SHIRLEY V. SCOTT 
“Clearly written and organized in a way 
' that makes sense for students, Shirley 
Scott’s introductory text is one of the 
best I have seen.”"—John King Gamble, 
Pennsytvania State University 
An accessible introduction to the concepts, 
rules, and functioning of international law in 
the arena of world politics. © hc $65 ¢ pb $24.95 


The UN Security Council: From the 
Cold War to the 21st Century 
Davip M. MALONE, EDITOR 
“Encyclopedic in scope and authorita- 
tive in analysis... everyone who cares 
about the future role of the United 
Nations and the international rule of 
law will want to read this book.” 
—John G. Ruggle, Harvard University 
hc $65 © pb $29.95 


Globalization and Inequality: 
Neoliberalism’s Downward Spiral 
JOHN RAPLEY 

“An exciting, thoughtful, and extremely 
well written book... Rapley’s penetrating 
analysis of the multiple global forces 
producing both marginalization and resistance is a must 
read.”—Randolph B. Persaud, American University 

he $49.95 © pb $19.95 


CEEPBRAPING 20 YEARS OI 
. . | e toa . . 












ER PUBLISHERS 


IN POLITICS 
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add IRAN: 


‘CAMPAIGNS 
ELECTIONS 


Campaigns and Elections: 
Issues, Concepts, Cases 

Rosert P. WATSON AND 

COLTON C. CAMPBELL, EDITORS 

“An excellent combination of perceptive r 
scholarship and case studies written by inte 
scholars and practitioners, which places poca 
political campaigning in a realistic 

perspective” —James W. Witt, Perspectives on Politics 
hc $59.95 © pb $23.50 

























Pill Politics: Drugs and the FDA 

STEPHEN J. CECCOL 

“Clearly establishes the political nature of the FDA's drug 
approval process....a welcome addition.” § 
—Kant Patel, Southwest Missouri 
State University « hc $49.95 







Egyptian Politics: 

The Dynamics of 
Authoritarian Rule 

Maye KASSEM 

“A compelling and accessible 
explanation of why democracy is 
stillborn in Egypt today, and why we 
shouldn’t expect the situation to 
change.’—Dlane SIngerman, American | _ - 
University • hc $49.95 ¢ pb $19.95 


Democratic Reform in Africa: 

The Quality of Progress 

E. GyiMAH-BOADI, EDITOR 

“A rich and wide-ranging picture of Africa’s experiences with 
democracy and reform... invigorating and challenging.” 
—Frank Holmquist, Hampshire College 

e hc $59.95 ¢ pb $22.50 


Women and Power on Capitol Hill: 
Reconstructing the Congresslonal Women’s Caucus 
IRWIN N. GERTZOG 

“Well-researched, well-written, and incorporating a wealth 
of detail, this book fills a nearly empty niche.” 

—Karen McCurdy, Georgia Southern University ° hc $49.95 
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WORLD TRADE REVIEW 


...has been established at the initiative of 
the World Trade Organization (WTO) in close 
cooperation with Cambridge University Press. 
lt is an independent journal that includes 
articles written from economic, legal and 
interdisciplinary perspectives on issues of 
relevance to the multilateral trading system. 
Priority is given to papers that, along with being 
academically rigorous, are also accessible to 
government policy officials and the wider 
public. The journal also includes shorter articles 
seeking to rebut or challenge published papers. 




















EDITOR 
Richard Blackhurst, 


uate Institute of Int’! Studies (Geneva) 
EDITORIAL BOARD 
James‘Crawford, University of Cambridge 


J a François, Erasmus University 


Henk Horn, Stockholm University 

Retis Mavroidis, University of Neuchâtel 
mde Sapir, Free University of Brussels 
Wie ph Weiler, New York University 


At Winters, University of Sussex 
AN Warld Trade Organization 
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triannual. volume 3 in 2004 
+e): 4.00 (issn 1474-7456) 
Only): $200.00 (e-issn 1475-3138) 
fi only):/ $48.00 (issn 1474-7456) 
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New from KNOPF 


Disarming Iraq 
by Hans Blix 


The former director of the UN Inspection Commission 
gives his account of the search for weapons of mass destruction 
and the events leading up to America’s invasion and 
occupation of Iraq. His assessment 1s invaluable in trying to 
understand both what brought about the present state of 
affairs and what can be learned in the path toward peace 
and security ın the world after Iraq. 


Pantheon | Cloth | 304 pages | $24.00 


Power, Terror, Peace, and War 
America’s Grand Strategy in a World at Risk 
by Walter Russell Mead 


From one of the most brilliant and original writers on U.S. 
foreign policy comes this stunning and timely book on 
the policy of the Bush administration. In placing the 
administration in historical perspective and analyzing what 
America’s grand strategy should be moving forward, Walter 
Russell Mead in this short book has made a major contribution 
to the debate on America’s future. 


Knopf | Cloth | 240 pages | $19.95 


Bush Versus the Environment 
by Robert S. Devine 
Focumng in on the issues on which the Bush administration 
has been the weakest, and at a time when most Americans are 
caring more and more about environmental issues, Bush Vern 


the Enoronment is a deeply necessary investigation of the Bush 
adminustration’s generally disastrous environmental policies. 


Anchor | Paper | 272 pages | $12.00 


Stalin 
The Court of the Red Tsar 
by Simon Sebag Montefiore 


“Four years ago this book would not have been possible. 
Sebag Montefiore has drawn upon newly opened personal 
files of Party leaders that were transferred . . . in 2000.... 
[I]ts extraordinary revelation of the evil—the complete 
amorality—at the heart of the dictator's court will change 
the way historians approach the great historical questions 
about the Stalinist regime.”—Orlando Figes, Sunday Telegraph 


Knopf | Cloth | 816 pages | $30.00 


For desk and examination copies: 


Reason 
Why Liberais WHI Win the Battie for America 
by Robert B. Reich 
A passionate, cogent, and urgent statement of what liberalism 
is, why it 1s umportant for Americans, why it has been eclipsed 
by radical conservatism, and how it can regain momentum. 
Reason is a desperately needed set of principles, a program and 
a rallying cry for remmvigorating a true two-party system. 
Knopf | Cloth | 272 pages | $24.00 


The Language Police 
How Pressure Groups Restrict What Students Learn 
by Diane Ravitch 
With a New Afterword 
"Fiercely argued... Ms. Ravitch . . . writes with enormous 
authority and common sense. She shows how priggish, 
censorious and downright absurd ‘the language police’ can 


be, and she does so with furious logic. Every bit as alarming 
as it 1s iluminating.”—Michiko Kakutani, The New York Times 


Vintage | Paper | 288 pages | $13.00 


Bushwhacked 
Life in George W. Bush's America 
by Molly Ivins and Lou Dubose 
From Molly Ivins and Lou Dubose, authors of Shrub, 
Busheohacked is a hilarious, no-holds-barred look at George 


W. Bush and his administration, and an essential book for 
understanding the full, destructive umpact of his presidency. 


Vintage | Paper | 368 pages | $12.00 


Going Public 
An Organizers Guide to Citizen Action 
by Michael Gecan 
“Mike Gecan . . . has grven this nation a muscular manual for 


the century ahead. There is nothing ethereal about the moral 
vinon ın Gomg Publ. It is a book about doing right and 


making social change not by playing the pitiable victim but by 
wielding power against power. . . . the story and the strategy 
reside in this remarkable book.”—Samuel G. Freedman 


Anchor | Paper | 224 pages | $12.00 


Visit: www.randomhouse.com/academic 


Email: acmart@randombouse.com 


Write: Knopf Academic Marketing, 1745 Broadway, 20th Floor, New York, NY 10019 
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Left and Right Agree... 





SHATTERED DREAMS 


The Failure of the Peace Process in the 


SerUM RABE vidde East, 1995-2002 
DREAMS Charles Enderlin 


Susan Fairfield, translater 


SHARGEs Pepin 


The most complete inside account yet of what actually happened at 
Camp David. .. . [E]ssental reading.” 


— THE NATION 





“A fly-on-the-wall reconstruction of the top-table diplomacy of the entre 
peace process.” 


— THE NATIONAL REVIEW 





“Enderlin depicts with uncanny insight the major events that make up the 
pieces of this yet-unfinished mosaic, and portrays with much percepuveness 
the cast of characters that played such a key role m shaping the course of 
events. Shattered Dreams makes a compelling read, providing an m-depth 
record of a troubled peace process that has sull not come to fron.” 





— SHIMON PERES, former Prime Minister of Israel 


Hardcover + $28 





Now in Paperback... 
BREAKING RANKS 


Refusing to Serve in the 
West Bank and Gaza Strip 





BREAKING” SA 


Ronit Chacham RANKS : 
“This book should be read by those of all polrncal views concerning the 7. eae ots SeS 
Israeli-Palestinian conflict . . . . I was profoundly moved by rt.” K 5 Sur AN oor 
— DAVID GORDIS, President, Hebrew College dh eee hee 


Paperback - $15 





FF OTHER PRESS 


www.otherpress.com ° To order call toll free 1-877-THE OTHER or emal! orders @ otherpress.com 
To receive emal updates from Other Press: www otherpress.com/subscnbe 
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p Political Science Research Online 


APSA, In cooperation with a consortium of political sdence and related assodations, is pleased tc 
announce the launch of PROL: Political Sdence Research Online! PROL is a comprehensive electron 
ic archive of pre-published research in political sdence featuring over 10,000 conference papers, 
preprints, and center and institute reports all at one centralized web site. 





A multipronged effort to facilitate political science scholarship, the launch of PROL marks the firs 
time that a vast portion of pre-published research In political sclence, International studles, and 
public administration Is available In one central location. PROL consists of three major compo- 
nents: 

Conference Papers 

Conference papers are the comerstone of PROL and represent the greatest volume of pre-publlshed schol- 
arship In political science. Within a year, as many as 17,000 scholarly papers will be avallable through PROL 
In addition to APSA Annual Meeting papers back to 2001, APSA provides support for other associations to 
place thelr conference papers online for the first tme through PROL 


PROL also provides an outlet for Individual scholarly papers provided by scholars for comment or dissem 
nation at earty, pre-publlshed stages of thelr work. There Is no other common source like this avallable. In 
addition, PROL helps reduce the fragmentation occuring from sharing pre-publlshed research only among 
small subgroups of the discipline. 


Center and Institute Reports 

PROL also Incorporates, or polnts to, political sclence research done through centers and Institutes, and 
presented on thelr web servers. Often thls work Is accessible only through outlets that are unknown to 
most polltical scientists and those seeking political sclence research. PROL thus bridges multtdisciplinary 
and subfield divides. 


PROL represents a partnership between APSA and over a dozen scholarly organizations. The 
consortium continuously seeks out partners. The organizations below represent those who dis- 
play work on PROL or are participating in Its organization and development 


Partners 

American Society for Public Administration 
International Political Sdence Association 
Intemational Studies Association 

Midwest Political Sclence Association 
National Conference of Black Political Scientists 
New England Political Science Association 
Northestem Political Science Assoclation 
Political Studies Assoclation (UK) 
Southwestern Political Sclence Association 
Southem Political Sdence Association 
Westem Political Science Association 


PROL is made possible through the generous funding of the Andrew W. 
Mellon Foundation. 


p rol Keep abreast of the latest political sdence research, visit 


(C www.politicalscience.org 
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CAROLYN DEVER 


Skeptical Feminism 

Activist Theory, Activist Practice 
Powerful, illuminating, and galvanizing, 
Skeptical Feminism traces the strategies 
the women's movement has used to make 
theory matter—and points toward a new, 
politically engaged approach to feminist 
thought. 


$18 06 paper + $58 96 cloth + 258 pages 





CURTIS MAREZ 

Drug Wars 

The Ahi Economy of Narcotics 
Curtis Marez examines two hundred years 
of writings, graphic works, films. and 
music, providing a bold interdisciplinary 
exploration of drugs in the popular imagi- 
. nation. Throughout, Marez juxtaposes 
official drug policy and propaganda with 
subversive images that challenge govern- 
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Notes from the Editor 


For more than a century and a half, Democracy in 
America has been an indispensable starting point for 
understanding American politics. After the second vol- 
ume of Democracy was published in 1840, Tocqueville 
continued to monitor political developments in this 
country and committed many of his thoughts to paper, 
in letters to friends in America. Unfortunately, his epis- 
tolary ruminations have never been translated into En- 
glish, and their very existence seems to have largely 
been ignored until now. In “The Third Democracy: 
Tocqueville’s Views of America after 1840,” Aurelian 
Craiutu and Jeremy Jennings use the post-1840 letters 
to tell the rest of the story—or, as they put it, to “recon- 
struct what Volume Three of Democracy might have 
looked like if it had ever been written.” (The portrayal 
of party canvassing on the cover of this issue com- 
memorates that period in American politics.) Craiutu 
and Jennings’s creative reconstruction reveals a sig- 
nificant change in Tocqueville’s perspective, and more 
specifically a growing disenchantment with America 
occasioned by his sense that such problems as corrup- 
tion, slavery, imperialism, and the encroachment of the 
economic sphere upon the political would jeopardize 
freedom and stability in America. Thus, rather than 
the relatively sunny picture that emerged from the first 
two volumes of Democracy, Craiutu and Jennings ar- 
gue that the excesses of democracy in America would 
have been the central focus of the final volume of Toc- 
queville’s seminal work. 

In “Madison vs. Hamilton: The Battle Over Repub- 
licanism,” Colleen Sheehan takes another fresh look 
at the early period of the American republic, this time 
to shed light on a feud that “left a lasting impression 
on the American political landscape.” Sheehan’s focus 
is on the philosophical differences that led Alexander 
Hamilton and James Madison, erstwhile allies in co- 
authoring the Federalist Papers, to become politically 
estranged during the 1790s. Their dispute was not one 
of personal animosity or, as is often claimed, the re- 
sult of Madison giving in to his affection for Thomas 
Jefferson. Rather, Sheehan argues that the conflict be- 
tween Hamilton and Madison was propelled by their 
fundamental disagreement over the nature and role of 
public opinion in a republican government—an issue 
that has been played out, in America and elsewhere, 
over the succeeding two centuries. 

Echoes of that conflict can be heard in David 
J. Samuels’s “Presidentialism and Accountability for 
the Economy in Comparative Perspective.” Demo- 
cratic principles require that public officials be 
accountable to the voters for their actions. This is most 
easily realized when voters are able to attribute 
policy success and failures to specific political decision 
makers. Samuels’s contribution is to investigate the 
impact of institutional, electoral, and political rules and 
forces on accountability in 23 presidential democracies. 
Although the virtues and vices of presidential and 
parliamentary systems are often disputed, Samuels 
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finds that accountability is achievable in either type 
of system. The crucial factor, he concludes, is not a 
presidential versus a parliamentary structure as such, 
but rather the concurrence or non-concurrence of 
elections; in short, the timing is what matters—a new 
perspective on a long-running debate. 

In many newly industrializing states, the key to suc- 
cess is often seen as the liberalization of the state-led 
economy. In most, if not all, instances, liberalization has 
been based on an invariant set of assumptions about 
how individuals make economic choices. However, a 
body of cultural and psychological research indicates 
that people in different societies make choices accord- 
ing to different criteria. In “It’s Not Whether You Win 
or Lose, But How You Play the Game: Self-Interest, 
Social Justice, and Mass Attitudes toward Market Tran- 
sition,” Raymond M. Duch and Harvey D. Palmer 
use a combination of survey and experimental data 
collected in the West African nation of Benin to address 
the micro-foundations of market-based solutions to 
economic transition and development. Benin provides 
an ideal research locale for their purposes, Duch and 
Palmer argue, because the behavioral assumptions of 
classical micro-economics are least likely to hold in 
such a setting. It is thus striking that Duch and Palmer 
find that, although there are indeed variations in Benin 
in the way that “social capital” is employed, these dif- 
ferences do not extend to basic economic reasoning. 
This evidence, drawn from a most-likely case for the 
importance of cultural differences, offers a more nu- 
anced view of economic decision making and in doing 
so suggests ways that tailoring incentive programs to 
local social conditions could enhance their effective- 
ness and economic growth. 

In “Credible Commitment and Property Rights: Evi- 
dence from Russia,” Timothy M. Frye also delves in 
the micro-foundations of economic growth. Drawing 
on surveys of company managers in Russia, Frye con- 
siders the ideas that property rights are threatened by 
private agents and secured by state institutions or, alter- 
natively, attenuated by the arbitrary exercise of state 
power. Frye’s results suggest the need to reconsider 
such sharply contrasting perspectives, for the pattern 
that emerges is more subtly nuanced and has wide- 
ranging implication for policy reform in the Russian 
case and, by extension, elsewhere as well. 

From government regulation in Russia, this issue ex- 
ecutes a half-turn to consider government regulation 
in the United States. Scholars of bureaucratic and leg- 
islative politics have long wrestled with the challenge 
of how to test theoretical models asserting congres- 
sional influence. Charles R. Shipan, in “Regulatory 
Regimes, Agency Actions, and the Conditional Na- 
ture of Congressional Influence,” rises to the challenge 
and in so doing provides intriguing evidence that the 
real question is not whether Congress can regulate its 
bureaucratic agents, but rather when it is best posi- 
tioned to do so, Shipan presents a model that integrates 
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the role of both the executive and legislative branches, 
particularly the congressional committee system, to 
constrain bureaucratic actors and, in an especially nice 
example of the complementarity of formal and empiri- 
cal approaches, tests that model against the monitoring 
activities of the Food and Drug Administration. 

When bureaucrats are too smart or too capable for 
their principals’ good, the would-be principals face a 
dilemma that has served as a source of much theoretical 
and empirical work. But what if the opposite were true? 
That is, what would happen if bureaucrats were unable 
to do what was expected of them? In “Bureaucratic Ca- 
pacity, Delegation, and Political Reform,” John Huber 
and Nolan McCarty show that when agents are unable 
to fulfill their assigned tasks, the effects are not simple 
or straightforward. Their exposition promises to open 
up a whole new discussion on why agents will not—or 
cannot—fulfill their principals’ goals. 

Another modeling exercise was inspired by the “doc- 
trinal paradox” of law and economics, in which major- 
ity concurrence on decision premises does not neces- 
sarily produce majority concurrence on a resolution. 
Christian List’s “Model of Path-Dependence in De- 
cisions over Multiple Propositions” focuses on path- 
dependent decisions. For a system of law and policy to 
remain coherent, future rulings, amendments, and the 
like must accept earlier decisions as necessary founda- 
tions. In essence, the order in which propositions are 
considered may affect a decision’s outcome. This type 
of constraint occurs when individuals are boundedly 
rational and when decisions are collective. Such path- 
dependence allows strategic agenda-setters to manip- 
ulate outcomes by altering the order in which propo- 
sitions are considered. More subtly, the existence of 
alternative decision-paths provides a rationale for ac- 
tors to lie about the virtues of alternatives. 

Finally, the concluding article in this issue steps away 
from the abstract and highly formalized approach rep- 
resented in the Huber-McCarty and List analyses to 
consider some concrete and very familiar manifesta- 
tions of American politics. Are “soccer moms” and 
“NASCAR dads” real and significant groups in the 
American political landscape? Or are they journalis- 
tic hyperboles? According to Janet Box-Steffensmeier, 
Suzanna De Boef, and Tse-min Lin in “The Dynamics 
of the Partisan Gender Gap”, there are indeed endur- 
ing and important differences in the ways American 
men and women think and vote. By analyzing survey 
data over a two-decade span, these authors show that 
underlying demographic and cultural changes in the 
populace have made the differences between the gen- 
ders as pervasive as those between “red states” and 
“blue states.” 


INSTRUCTIONS TO CONTRIBUTORS 


General Considerations 


The APSR strives to publish scholarly research of 
exceptional merit, focusing on important issues and 
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demonstrating the highest standards of excellence 
in conceptualization, exposition, methodology, and 
craftsmanship. Because the APSR reaches a diverse 
audience of scholars and practitioners, authors must 
demonstrate how their analysis illuminates a significant 
research problem, or answers an important research 
question, of general interest in political science. For the 
same reason, authors must strive for a presentation that 
will be understandable to as many scholars as possible, 
consistent with the nature of their material. 

The APSR publishes original work. Therefore, au- 
thors should not submit articles containing tables, 
figures, or substantial amounts of text that have al- 
ready been published or are forthcoming in other 
places, or that have been included in other manuscripts 
submitted for review to book publishers or periodicals 
(including on-line journals). In many such cases, sub- 
sequent publication of this material would violate the 
copyright of the other publisher. The APSR also does 
not consider papers that are currently under review 
by other journals or duplicate or overlap with parts of 
larger manuscripts that have been submitted to other 
publishers (including publishers of both books and 
periodicals). Submission of manuscripts substantially 
similar to those submitted or published elsewhere, or 
as part of a book or other larger work, is also strongly 
discouraged. If you have any questions about whether 
these policies apply in your particular case, you should 
discuss any such publications related to a submission in 
a cover letter to the Editor. You should also notify the 
Editor of any related submissions to other publishers, 
whether for book or periodical publication, that occur 
while a manuscript is under review by the APSR and 
which would fall within the scope of this policy. The 
Editor may request copies of related publications. 

If your manuscript contains quantitative evidence 
and analysis, you should describe your procedures 
in sufficient detail to permit reviewers to understand 
and evaluate what has been done and, in the event 
that the article is accepted for publication, to per- 
mit other scholars to carry out similar analyses on 
other data sets. For example, for surveys, at the least, 
sampling procedures, response rates, and question 
wordings should be given; you should calculate re- 
sponse rates according to one of the standard formulas 
given by the American Association for Public Opinion 
Research, Standard Definitions: Final Dispositions 
of Case Codes and Outcome Rates for Surveys 
(Ann Arbor, MI: AAPOR, 2000). This document is 
available on the Internet at <http://www.aapor.org 
/default.asp? page = survey_methods/standards_and_ 
best_practices/standard_definitions>. For experiments, 
provide full descriptions of experimental protocols, 
methods of subject recruitment and selection, subject 
payments and debriefing procedures, and so on. Arti- 
cles should be self-contained, so you should not simply 
refer readers to other publications for descriptions of 
these basic research procedures. 

Please indicate variables included in statistical anal- 
yses by capitalizing the first letter in the variable 
name and italicizing the entire variable name the first 
time each is mentioned in the text. You should also use 
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the same names for variables in text and tables and, 
wherever possible, should avoid the use of acronyms 
and computer abbreviations when discussing variables 
in the text. All variables appearing in tables should 
have been mentioned in the text and the reason for 
their inclusion discussed. 

As part of the review process, you may be asked 
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ratic regime. Although Tocqueville never wrote a third volume, he continued to be interested in 


T ieran Democracy in America offered the image of an accomplished and successful demo- 
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American political events and exchanged a number of important letters with his American friends 
after 1840. Did Tocqueville change his views on America outlined in the two volumes published in 1835 
and 1840? If so, did the evolution of his views of America affect his theory of democracy? The paper 
answers these questions by examining Tocqueville's unduly neglected correspondence with his American 
friends. It seeks to reconstruct what Volume Three of Democracy in America might have looked like if it 
had ever been written. In these letters, Tocqueville addressed important topics such as the instability of the 
market and the immaturity of American democracy, issues that did not loom large in the two published 
volumes. The paper shows that in the last years of his life Tocqueville became very disenchanted with 
American political life and reassessed some of his previous views of American democracy. 


I, who am half Yankee. 
Tocqueville (1986, 185) 
A century and a half after his death, the writ- 


ings of Alexis de Tocqueville remain an in- 

exhaustible source of inspiration for political 
theorists, sociologists, philosophers, legal scholars, and 
historians. As one of the most quoted political theorists 
on both ends of the political spectrum, Tocqueville is 
celebrated as the theoretician of the modern demo- 
cratic revolution at the heart of which lies the equality 
of conditions. 

Our fascination with Tocqueville has deep roots 
and can be explained by the fact that his work, 
to quote Cheryl Welch (2001, 1), “seems to re- 
tain a greater measure of normative and exploratory 
power—and intellectual provocation—than that of 
many other nineteenth-century thinkers.” As Sheldon 
Wolin (2001, 564—65) has recently argued, Tocqueville’s 
universal appeal can be accounted for by the acuity 
with which he grasped and described the key features 
and dilemmas of modern democracy, the complexity 
of political culture, the importance of civil society, and 
the far-reaching consequences of high social mobility. 
In Wolin’s words “To reflect on present day American 
politics invites reflection on Democracy in America and 
vice versa” (3). 

Democracy in America (1835, 1840) offered the 
image of an accomplished and successful American 
democracy based on the equality of conditions and the 
sovereignty of the people. American democracy com- 
bined administrative decentralization and political cen- 
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tralization, allowed for self-government, and provided 
for a judicious separation of powers and a genuine sys- 
tem of checks and balances. Tocqueville contrasted the 
regular, stable, and effective institutions and mores of 
America with those existing in Europe at that time 
that contained relics of an aristocratic past that proved 
to be a stumbling block against political democracy. 
Thus, in his interpretation, the Americans had solved 
one of the oldest and most important questions in 
political philosophy: What is the best form of gover- 
nment? 

The extraordinary success of Tocqueville’s master- 
piece provided the lenses through which countless gen- 
erations viewed and interpreted the virtues and limi- 
tations of American democracy. It also explains the 
often idealized image of America that admirers of 
Tocqueville derived from reading his book. But was 
Volume Two of Democracy in America (1840) his 
definitive statement on America? Tocqueville lived 19 
more years, often despairing that his native country 
would ever be able to achieve a political regime com- 
parable to the American one. Even if he never wrote 
a third volume of Democracy in America and became 
more and more involved in French politics, Tocqueville 
continued to be interested in American political events 
and exchanged a number of important letters with his 
American friends, some of whom had been instrumen- 
tal in providing him with information about the United 
States while Tocqueville was completing Volume One 
of Democracy in America. 

These letters were published in Volume VI of 
Tocqueville’s Complete Works (1986) and have not 
been translated into English to this day. It is reveal- 
ing that Roger Boesche’s otherwise splendid selection 
from Tocqueville’s letters published two decades ago 
(Tocqueville 1985) contained no single letter from 
Tocqueville’s correspondence with his American 
friends after 1840. Thus, it is both important and 
necessary to ask the following set of questions. Did 
Tocqueville change his views on America outlined in 
the two volumes published in 1835 and 1840? If so, 
which of his views did change and why? What were the 
continuities and discontinuities? How did Tocqueville 
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come to view the American republic as it edged ever 
closer to civil war? Perhaps more importantly, did the 
evolution of his views of America affect his theory 
of democracy and what were the implications of any 
“failure” of American democracy? 

This essay answers these questions by examin- 
ing Tocqueville’s correspondence with his American 
friends after 1840 as well as his participation in the 
constitutional debates of 1848 in France. Our main 
purpose is to reconstruct from Tocqueville’s letters 
what the message of Volume Three of Democracy in 
America might have been if that book had ever been 
written. The importance of Tocqueville’s letters for un- 
derstanding his post-1840 views on American democ- 
racy has been underestimated or neglected in the aca- 
demic English- ing circles. A notable exception, 
however, is Sheldon Wolin. His Tocqueville between 
Two Worlds (2001) called our attention to the para- 
doxical fact that after a long silence about American 
democracy, Tocqueville “suddenly rouses himself to a 
renewed interest in America, follows its desperate de- 
bates over slavery, fears that if the cause of freedom is 
lost in America, its future in Europe will be lost as well, 
and, as though in a last effort to resolve a lifetime of 
ambivalence about democracy, declares himself at the 
end to be half-Yankee” (562) Given the prominence 
of Tocqueville’s work, the general lack of interest in 
exploring his later impressions of America is surpris- 
ing.! Even classic books such as Jardin’s biography of 
Tocqueville (1988), Pierson’s (1996), and Schleifer’s 
(2000) studies of the writing of Democracy in America, 
have virtually nothing to say about Tocqueville’s post- 
1840 views of America. 

Therefore, our goal is to initiate a dialogue that might 
be of interest not only to political theorists but also to 
historians and students of American politics in gen- 
eral. The essay is divided in two major parts. It begins 
by exploring the image of America in France before 
Tocqueville. This preliminary discussion provides the 
necessary background for interpreting Tocqueville’s 
correspondence with his American friends; it also gives 
us a better understanding of the nature and original- 
ity of Tocqueville’s analysis of American democracy 
placed in its original historical context. As we shall see, 
Tocqueville came to share some (though not all) of 
the opinions held by his predecessors who espoused 
a critical view of America. The discussion of the im- 
age of America in France also addresses, if obliquely, 
the issue of anti-Americanism, a highly relevant topic 
today when the rhetoric of American exceptionalism 
and America’s hegemony dominate the headlines? If 


1 Among historians, two notable exceptions are Brogan 1991 and 
Mélonio 1987. It is surprising that the issue of Tocqueville’s changing 
views of America after 1840 was overlooked by important exegeses of 
Tocqueville such as Eisenstadt 1988 and Masug: 1991. Welch (2001) 
devotes a footnote to this topic, while other recent mterpretations 
of Tocqueville such as Janara 2002 and Mitchell 2002 gloss over the 
evolution of Tocqueville’s views of America after the publication of 
Volume Two of Democracy. 

2 On this topic, see Mathy 1993 and Roger 2002. For a concise discus- 
sion of the history and significance of anti-Americanism, see Ceaser 
2003. 
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America must remain unique and thinks that it should 
offer an example to the entire world, then the implica- 
tions of a potential failure of the American model are 
as staggering today as they were for Tocqueville two 
centuries ago. 

The second part of our essay examines in detail the 
main points and concerns about America raised by 
Tocqueville in his letters to his American friends writ- 
ten after the publication of Volume Two of Democracy 
in America. As we shall see, his warnings against the 
effects of the instability of the market on the function- 
ing of political institutions and the damaging effects 
of unbridled materialism sound surprisingly fresh to- 
day when economics dominates politics to an unprece- 
dented degree and the relationship between democracy 
and the market has become a key topic in our public 
debates. More generally, Tocqueville’s remarks invite 
us to reflect upon the problems and dilemmas faced 
by an America where, according to many social critics, 
the vitality (as well as viability) of political and civil life 
seem to be on the decline. 

A methodological caveat is in order here. Recon- 
structing the possible message of a hypothetical Vol- 
ume Three of Democracy in America is a speculative 
theoretical attempt that must be taken with the neces- 
sary grain of salt. Such an endeavor is likewise subject 
to the possible criticism that the views expressed in 
private letters are not those that would have been ex- 
pressed in public and, thus, that no firm conclusions can 
be drawn about how Tocqueville might have changed 
his views on America after the publication of Volume 
Two of Democracy in America. If, it can be argued, 
in published texts his fears for America were likely 
to be expressed obliquely, then in personal correspon- 
dence there is less reticence about expressing views 
more openly and, therefore, more harshly. Similarly, 
it can be contended that Tocqueville might have tai- 
lored his comments to suit the views of his individual 
correspondents. If these methodological reservations 
are accepted, it would be easy to conclude that the 
criticisms voiced of America by Tocqueville in his let- 
ters after 1840 were not substantially different in ei- 
ther substance or tone from the sense of foreboding 
that is clearly evident in some parts of Volume Two of 
Democracy in America. 

Our response to these challenges is the following. 
Tocqueville’s letters represent an indispensable source 
of information for any student of his work. This is 
documented not only by the wide array of topics ad- 
dressed by Tocqueville’s correspondence but also by 
his request to some of his friends to keep his letters. 
One might go as far as to argue that in many re- 
spects, Tocqueville’s letters are as important as his pub- 
lished writings. Tocqueville writes so beautifully that 
his readers enjoy his letters for both their style and 
their substance. Much more than the published books, 
Tocqueville’s correspondence demonstrates his strong 
passion for liberty as well as the seriousness with which 
he sought to apply his political and philosophical ideas 
in the public realm. And unlike his published writings, 
Tocqueville’s letters also allow us to trace his hopes, 
fears, and disillusionment with politics so that, to quote 
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Roger Boesche, “when we know the man better, we 
find him no less political.” 

Volume VII of Tocqueville’s Complete Works lists 
133 letters in total, of which 28 were written before 
1840. Because Tocqueville’s correspondents remain 
substantially the same over the entire period, covering 
the 28 years from 1831 to 1859, we can assume that 
he felt no need to adopt new epistolary strategies or 
to tailor his comments to suit the views of his corre- 
spondents. Moreover, if the recipients of his letters re- 
mained unchanged, the views expressed by Tocqueville 
certainly did change over time, the most significant shift 
occurring in the 1850s. The early letters were usually 
designed to solicit information, advice, or guidance. 
It is only in the later exchanges that his deep mis- 
givings and criticisms became evident, and when they 
did so they became matters of central importance to 
his correspondence. Finally, Tocqueville’s reappraisal 
of America drew not upon a rereading of his earlier 
evidence of American life and institutions but upon a 
close observation of contemporary political and social 
developments in America that postdated the publica- 
tion of his earlier account. It was these developments, 
and not the reassessment of earlier evidence, that led 
him to the stark conclusion that America no longer 
held out hope for the friends of liberty. 


BEFORE TOCQUEVILLE 


A thorough analysis of Tocqueville’s assessment of 
American democracy before and after 1840 requires 
that we first survey various French attitudes toward 
America during the Bourbon Restoration (1814-1830) 
and the early years of the July Monarchy (1830-1835) in 
order to place Tocqueville’s views on American democ- 
racy into the larger historical and cultural context to 
which they belonged. His analysis of America has too 
often been detached from this background and the 
time has come for a more historical and contextual 
interpretation of Tocqueville’s writings. 

It is worth repeating that Tocqueville was not the 
first one to write on the nature of American democ- 
racy in France.* Before it came to be seen as a po- 
litical model for nineteenth-century Europe, America 
had previously acquired the almost-mythic image of a 
country unspoiled by luxury, a land where the main oc- 
cupation was agriculture and people lived peacefully in 
austerity, frugality, and virtuous simplicity. Crévecour’s 
(1957) letters describing the modes of farming and 
the manners and peculiar customs of the Americans 
contributed a lot to the dissemination of this pastoral 
and idyllic image of an American Arcadia.° As such, 


3 From Boesche's introduction to Tocqueville 1985, 20. 

4 Two names stand out: Crèvecoeur and La Fayette. The first was 
a Frenchman who emigrated to the United States and became a 
farmer in Orange Country, New York. He was the author of Letters 
from an American Farmer (first published in London in 1782), which 
depicted eighteenth-century American rural hfe. Because of his sup- 
port for the American republic, La Fayette was a sort of mythical 
hero in the United States and enjoyed a promment political status m 


postrevolutionarý France. 
See, for example, Crèvecoeur 1957, 7-8. 
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America appeared as a boundless and bountiful conti- 
nent, a virgin environment, and a territory of adventure 
and discovery that inspired the traveler’s imagination. 

Eventually, the ideal of a frugal, simple, pastoral, 
and virtuous life came to be replaced by the image of a 
frantic social and economic life in constant transforma- 
tion, bent on indefinite progress and innovation. This 
change led to the appearance of a new idea of happiness 
that was in stark contrast to the older one. Individuals 
no longer derived their happiness from keeping their 
desires limited, cultivating their plot of land, and living 
asimple and frugal life. On the contrary, their happiness 
was directly related to the rise of their living standards, 
comfort, and material prosperity as a result of booming 
commerce and navigation. America eventually came to 
be regarded as the hallmark of utilitarianism, a country 
in which everything and everyone were on the move, in 
constant search for a way of improving the conditions 
of daily life. The dynamic face of America was seen as 
an outcome of its democratic institutions and came to 
be regarded as a possible model for Europe. 

What is less known, however, is that in France the 
idea that the study of the political organization and 
institutions of the United States might profit those in- 
terested in the future of democracy had appeared be- 
fore the publication of the first volume of Tocqueville’s 
Democracy in America. As René Rémond (1962, 340, 
650-51) pointed out in his magisterial study of the im- 
age of the United States during the first half of the 
nineteenth century in France, from 1815 to 1830 the 
majority of French public opinion looked favorably 
at America and was relatively optimistic with regard 
to the stability and future of its political institutions. 
In his important history of the United States pub- 
lished in 1825, Charles-Arnold Scheffer (1796-1853), 
the secretary of La Fayette and, later, editor of Com- 
merce, Claimed that the American federation showed 
the entire world the “true popular government and 
liberty.”” The suggestion made by Scheffer was that 
the republican political experiment that succeeded in 
America had a universal importance because it paved 
the way for similar political developments in Europe. 
The same theme appeared in Revue américaine, edited 
by Armand Carrel from July 1826 to June 1827. The ar- 
ticles published in this journal focused their attention 
upon the Americas as a whole, with the intention of 
providing as much information as possible about the 
constitutions and governments of Latin, Central, and 
North America® 


6 For an excellent summary, see Rémond 1962, 650-51. The French 
Ideologues (Dupont de Nemours, Destutt de Tracy) admired 
America and it was not a mere coincidence that Thomas Jef- 
ferson translated into English Destutt de Tracy’s commentary on 
Montesquicu’s Spurt of the Laws (the book appeared first in English 
in 1811 and eight years later in French). In turn, Madame de Staal 
(1818) expressed her appreciation for the American political exper- 
iment and equated the government of the United States with the 
government of reason. For more details, see Madame de Stael 1818, 
part VI, chap. 7. Also see Rémond (1962, 552, 636-37). 
Scheffer as quoted in Rémond 1962, 532. 

8 For more details on references to the American model, see Rémond 
1962, 657, n 11. 
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Thus, sympathy for America and the idea that the 
government of the United States might be a model 
for France came in various nuances and served dif- 
ferent political agendas. For example, Armand Carrel 
described the American constitution as “a model con- 
stitution,” adding that this was not due to the fact 
that the Americans possessed an instinctive political 
genius. “The American constitution,” he commented 
(Carrel 1832) “is not foreign to us; it is the daugh- 
ter of eighteenth-century French political doctrines, 
fortunately combined with the most reasonable and 
tested elements of English institutions.” Defenders of 
America such as Madame de Staél, Birbeck, Barboux, 
Vinet, Scheffer, and Barbé-Marbois invoked the wis- 
dom of its laws, principles, and institutions.’ Vinet ar- 
gued that “this state seems to have solved the prob- 
lem which consists in reconciling the highest degree of 
individual liberty and the supreme conditions of secu- 
rity and order.”*? Other French writers claimed that 
in America power was limited, no one was above the 
law, and everyone obeyed the laws that were clearly 
formulated to serve the common good.!! The Amer- 
ican government, it was argued, imposed low taxes, 
and treated its citizens with respect. “As the hand of 
Providence, it governs without being felt and almost 
without being perceived,” wrote the author of an article 
published in Revue encyclopédique in August 1819. 
Other virtues of the American system highlighted by 
the French included the strong educational system,’ 
the abolition of hereditary power, the presence of ex- 
ceptional political leaders, and civic virtue (linked to a 
healthy family life and a solid work ethic). Last but not 
least, the French emphasized the connection between 
liberty and religion in the United States and pointed to 
the important role played by religious toleration, lib- 
erty of conscience, and freedom of the press in securing 
individual liberties. Worth noting in all these accounts 
is the identification of America with liberty and the 
seminal relation among freedom, reason, and virtue, 
themes that also appeared in Tocqueville’s work a few 
years later. 

Nonetheless, doubts about the proper functioning of 
American institutions began to appear gradually after 
1832 in France, as the United States entered a new 
stage of its political development with the coming of 
age of a young generation of politicians who imposed a 
new political style. The old, positive image came to be 
replaced by the image of a country with an uncertain 
future, facing a complex set of problems and challenges, 
and therefore unable to serve as a political model for 
Europe. What did the French dislike about American 


9 For more details, see Rémond 1962, 540-43. 

10 Vinet as quoted in Rémond 1962, 549, n. 32. Vinet’s Memoir in 
Favor of Liberty was published in 1825 

11 “Arbitrary power 1s unknown here; everyone enjoys a liberty mod- 
erated by the need to use it,” wrote Dumersan in 1822 (as quoted in 
Rémond 1962, 544). 

12 As quoted in Rémond 1962, 545. 

13 See, for example, the following statement by Dupont de Nemours, 
dating from 1812. “Concerning education, the United States are more 
advanced than most other political societies” (as quoted m Rémond 
1962, 554, n. 59). 


394 


democracy? The question is particularly important in 
light of our attempt to reevaluate Tocqueville’s criti- 
cism of American democracy against the background 
of earlier analyses of America. 

The concerns expressed in the French press of this 
period (1832-35) varied from doubts regarding the 
maintenance of the unity of the country confronted 
with the problem of slavery to calling into question 
the existence of a genuine national character or of 
the American people.'* The large and expanding ter- 
ritory of the Union also seemed a serious challenge 
in this regard. The French wondered if the American 
republic would be able to survive and surmised that 
it might be replaced by monarchy or a new form of 
military dictatorship. More importantly, the French 
began questioning not only the possibility of importing 
American institutions, but also their own raison d’étre. 
It is no small matter that the French came to view 
with skepticism two of the principles that had previ- 
ously been considered pillars of American democracy: 
bicameralism and federalism. American individualism 
was also regarded as a form of “narrow rationalism” 
and “Protestant egoism,”!° and was soon to be identi- 
fied with the doctrine of the Anglo-Saxon “Yankee.” It 
was described as a form of “solidly organized egoism,” 
“the evil regulated and systematized.”!” 

A common theme in the French press of that period 
was the image of America as a young and immature so- 
ciety. “The government of the United States,” claimed 
Guizot in a famous discourse from 1834, “is a good 
and beautiful government for the United States, in the 
circumstances in which this society found itself at the 
moment of its birth, because it is a society that has just 
been born, une société enfant.”'8 The American model 
came to be questioned even on the left wing of the 
political spectrum. The most progressive republicans 
expressed their discontent with a number of aspects 
of American democracy and made a fundamental dis- 
tinction between the moral aspects of the American 
democracy and its political institutions and principles. 
What is interesting is that they questioned the moral 

of American life, while accepting the political 
model. This dichotomy appeared, for example, in the 
following fragment of A. Marchais and J. F Dupont 
published in Revue républicaine in 1834. “From the 
moral point of view,” they wrote, “we do not belong to 
the American school. From the practical point of view, 
we belong to the American school... in this sense that 
we invoke the example of America as a practical proof 


14 «A merica,” wrote Gustave de Beaumont (1958), ın Marie “is the 

land of the free who cannot do without slaves America is the cradle 

of equality, and no country in Europe contains so much servitude” 

(57). 

15 The fact is, however, that doubts about the virtues and qualities 

of America were voiced even before the moment of mdependence. 

As Philippe Roger (2002) has recently shown, the “prehistory” of 

French anti-Americanism ls to be found in the Enlightenment. 

16 Marchais and Dupont (in Revue républicaine, April 1834) as 
uoted in Rémond 1962, 670, n. 17. 

17 Cense (in L’Européen, November 1835) as quoted in Rémond 

1962, 670. 

18 Guzot as quoted in Rémond 1962, 664, n. 20. 
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that demonstrates the application of the republican 
doctrine of universal suffrage and its consequences.”9 

Finally, the early 1830s saw the emergence of a new 
topic that had been somewhat muted before: slavery. 
The idea that the presence of slavery tended to under- 
mine the future of American democracy gained wide 
currency during this time. In an article published in 
Revue britannique in 1831, Saulnier denounced the 
hypocrisy of American democracy, which combined 
“this liberty without any limits on one side, and this 
abject servitude on the other.” The French were 
struck by the tension between, on the one hand, “this 
Congress where can constantly be heard the nice words 
independence, human dignity, imprescriptible rights, 
and the inviolability of natural rights” and, on the 
other hand, the miserable condition of the black pop- 
ulation.” If some critics pointed out that equality re- 
mained a mere ideal in a country with extreme forms 
of inequality, others went further to claim that Amer- 
ica had too much equality, and was developing, to use 
the words of Théodore Jouffroy, “a form of mediocre 
civilization.”~ 

The French also focused on the allegedly uncivilized, 
mercantile, and materialistic character of American so- 
ciety. Benjamin Saint-Victor, who visited America in 
the early 1830s, noted: “The main question here (and 
it’s the alpha and omega of life), is to gain money, and 
then to use this money to gain ever more. ... The entire 
world does not seem enough to satisfy to their cupidity” 
(Saint-Victor 1835, 26-27).”3 In the eyes of the hedonis- 
tic French, American life was monotonous and coarse 
because it lacked poetry and savoir-vivre. “America,” 
claimed La Mennais, “is struck by the plague of com- 
merce. The outcome is a spirit of calculation that stifles 
or weakens all generous sentiments.” Saint-Victor 
(1835, 30-31) deplored the restless competition and 
frantic commercial life that created “this perpetual 
and boundless movement of all cupid passions, the 
endpoint of all the worried thoughts of this multitude 
of people, who constantly turning their heads to the 
ground, exhaust themselves under the sun in order to 
build a treasure or to increase without measure the one 
that they have already amassed.” 

An interesting critique was advanced by Victor 
Jacquemont, who visited America in 1826-27, five years 
before Tocqueville. Educated in the pro-American 
milieu of the Ideologues that inspired Jefferson, 
Jacquemont came to America without any precon- 
ceived ideas. He was enthusiastic about learning En- 
glish and refrained from passing superficial judgments 
about the country. Yet, toward the end of his jour- 
ney, he could not conceal his dissatisfaction with the 


19 Marchais and Dupont as quoted in Rémond 1962, 668. 

2 Saulnier as quoted m Rémond 1962, 737, n. 41. 

Zi The words are from a discourse by Lamartine from 1835 as quoted 
in Rémond 1962, 741, n. 62. 

7 Jouffroy as quoted m Rémond 1962, 721. 

B Saint-Victor’s second letter described Amencan mores in detail. 
It is worth pointing out that Samt-Victor’s letters, written in 1832-33 
from America, were published in France in 1835, at the same time as 
Volume One of Tocqueville’s Democracy in America. 

% La Mennais as quoted in Rémond 1962, 763-64, n 3. 
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American way of life. “I dislike the American mores,” 
he wrote in a seminal, 40-page letter to Victor de Tracy 
in September 1827; “their aspect, sometimes severe, 
and then not entirely lacking in nobility, is most often 
cold, flat, and vulgar” (Jacquemont 1885, 153). Jacque- 
mont referred to the great American leaders as repub- 
licans by principle and aristocrats in their mores and 
concluded that, for all its virtues, the American way of 
life did not provide a civilized type of liberty. 

A similar critique appeared in Stendhal’s On Love 
(originally written in 1821), in which he drew a bal- 
anced, if ultimately equally unflattering, portrait of 
daily life in the United States. “All their attention,” 
wrote Stendhal (1975), referring to Americans, “seems 
to be concentrated on a sensible arrangement of the 
business of living, and on foreseeing all mishaps” (164). 
But these efforts come at a high price: “When at last 
they reach the point of harvesting the fruit of so much 
care and orderly planning, they find no life left with 
which to enjoy.” Yet, unlike other critics, Stendhal did 
not see American mores as rude. On the contrary, 
it was their politeness and rationality that he found 
problematic and unsatisfying. He acknowledged that 
America had a free government that “does its citizens 
no harm, butrather gives them security and tranquility” 
(163). But in Stendhal’s opinion, this was not enough to 
create true happiness; there is political happiness and 
true happiness and the two do not always coincide. The 
spirit of American liberty, concluded Stendhal (163), is 
“a coarse spirit” that gives the people the illusion of 
being happy simply because they enjoy security and 
tranquility. Yet, happiness means much more than be- 
ing free from the interference of a bad, harmful, and 
incompetent government. 

More importantly, the unflattering picture of Amer- 
ican mores led to the calling into question of some of 
the political principles underlying American democ- 
racy. Thus, one commentator claimed that American 
government was not as cheap as it had previously 
been argued.” Other French observers of American 
political life deplored its extreme form of partisan 
politics that triggered violent attacks on one’s oppo- 
nents and overheated electoral campaigns. President 
Jackson’s own political style was also criticized. By 
reflecting on the recurrent episodes of popular vio- 
lence such as the popular revolts of September 1834, 
the French observers of American politics feared that 
law-abidingness in America was waning and concluded 
that the American people were impulsive and bel- 
licose. Last but not least, the weakness of the fed- 
eral government, which had previously been seen as 
a plus, came to be interpreted as a major weak- 
ness. This perception fueled the image of a democ- 
Tacy prone to degenerate into mobocracy, a regime 
that chronically engendered violence and risked de- 
generating into popular autocracy. These themes 
loomed large in Tocqueville’s letters from the 1840s 
and the 1850s, in which he pointed to inadequate trials 





B See Saulnier’s critique m Rémond 1962, 690-91. 
26 For more details, see Rémond 1942, 698-703; 711-12. 
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and summary executions, lynching, and flagrant viola- 
tion of individual rights. 

All these arguments account for the growing percep- 
tion that came to dominate French public opinion in the 
late 1830s that America was, in fact, a degenerating so- 
ciety. The country no longer had the brilliant statesmen 
that had fought for its independence from England and 
had been instrumental in drafting its Constitution. It 
was around this time that the French came to believe 
that the presence of slavery was going to cause the 
breakup of the Union. 


TOCQUEVILLE’S VIEWS OF AMERICA 
BEFORE 1840 


By the time Tocqueville finished writing Volume One 
of Democracy in America, the viability of the Ameri- 
can political model was seriously questioned in France. 
Serious doubts emerged concerning the possibility of 
transplanting it onto French soil as well as regarding its 
own effectiveness. America, it was argued, could not 
serve as a political model for France. 

Tocqueville participated in this French debate. In his 
published writings as well as in his private correspon- 
dence, he advanced a series of arguments that were 
meant to respond to the ideas of his contemporaries. 
In 1835, Tocqueville presented a relatively optimistic 
image of American institutions and constitution. In 
memorable pages, he described how Americans prac- 
ticed the art of self-government, how they felt attached 
to their townships and participated in their administra- 
tion. More importantly, Tocqueville demonstrated that, 
contrary to what its critics argued, the sovereignty 
of the people could be channeled into effective and 
regular institutions that promote the common interest 
and the common good. Although the unusually long 
chapter that concluded Volume One of Democracy in 
America voiced a series of concerns about the proba- 
ble future of the country, Tocqueville did not suggest 
that the stability of the country itself was at risk. He 
foresaw the gradual and inevitable disappearance of 
the native races and predicted that the destiny of the 
blacks would be intermingled with that of the white 
population. “The most dreadful of all the evils that 
threaten the future of the United States,” surmised 
Tocqueville (2000, 326), “arise from the presence of 
blacks on its soil.” Although in a certain part of the 
country the legal barrier that separated the two races 
tended to fall, he was convinced that slavery remained 
a formidable challenge because the prejudice to which 
slavery had given birth was unchanged. 

The twin issues of race relations and slavery were not 
the only topics discussed by this chapter. Tocqueville 
also analyzed the relations between the federal govern- 
ment and the states and noted that the federal power 


27 The following statement appeared in Le bonhomme Richard, pub- 
lished in September 1832: “It is in vain that some try to offer as 
remedies to our problems these institutions which are shaky and 
rotten on the very next day followmg ther bith” (as quoted in 
Rémond 1962, 713). 

28 For a comprehensive analysis of this topic, see Mitchell 2002, 
132-81 
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was decreasing at the time he visited the country. He 
claimed that “Americans have much more to expect 
and to fear from the state than from the Union.... 
The federal government...is...by its very nature a 
weak government” (Tocqueville 2000, 352-53). Fur- 
thermore, Tocqueville expressed optimism regarding 
the survival of the republic in America, by which he 
meant that form of government based on “the slow and 
tranquil action of society on itself” (379). Because the 
institutions of the country were essentially republican 
and enjoyed “a sort of consensus universalis” (382), 
Tocqueville believed that it would be extremely diffi- 
cult to found a monarchy or an aristocracy in the United 
States. In spite of his musings about the tyranny of the 
majority, Tocqueville’s words offered a reassuring con- 
fidence in the future of the country, based on his image 
of America as an accomplished and mature democracy, 
capable of overcoming its challenges (slavery, territo- 
rial expansion, the growth of population), destined to 
rise to the status of the world’s premier power, and 
expected to serve as a political model for the rest of the 
world. The idea of an immature and unstable American 
democracy that loomed large in his later letters did not 
appear in Volume One. 

Published in 1840, Volume Two of Democracy in 
America qualified this optimistic view by introducing a 
series of new themes and concerns about the demo- 
cratic individual. Yet Tocqueville’s thoughts on the 
shortcomings of democracy did not call into question 
the viability of the American political model, which, 
in his opinion, remained a mature and solid one. He 
seemed less interested in making predictions about the 
future of American democracy than in drawing the 
portrait of the democratic individual living in the age of 
equality. In spite of his distrust of the bourgeoisie and of 
his anxieties about the pernicious effects of rampant in- 
dividualism in modern democratic society, Tocqueville 
did not question the foundational virtue of the Amer- 
ican middle class, the famous “self-interest rightly un- 
derstood.” The vices of individualism, he argued, can be 
successfully mitigated and combated by religion, free 
institutions, and local freedoms. Tocqueville (2000) also 
noted that l’intérét bien entendu “forms a multitude 
of citizens who are regulated, temperate, moderate, 
farsighted, master of themselves” (502) and concluded 
that self-interest rightly understood, although prevent- 
ing some individuals from mounting far above the or- 
dinary level of humanity, prevents many more from 
falling below it. 

Two things are worth noting in Tocqueville’s anal- 
ysis in Volume Two of Democracy in America. First, 
he suggested that the Americans were moderate and 
responsible individuals, who knew well their interests 
and rights, were capable of controlling their passions, 
and were not easily swayed by mob flatterers or dem- 
agogues. The idea that democracy might chronically 
engender violence and turmoil or that the market might 
become uncontrollable precisely because of the grow- 
ing multiplicity of interests seems rather implausible 
based on Tocqueville’s account that ascribed to Ameri- 
cans common sense, temperance, and moderation. Vol- 
ume Two of Democracy in America dealt surprisingly 
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little with the perils of prosperity and the limits of 
abundance (one finds more details about these issues 
in Tocqueville’s rich correspondence). ? 

Moreover, Tocqueville had little to say here about 
the possibility of the economic sphere invading the 
political realm and corrupting its institutions. Yet, in 
Volume Two of Democracy in America, he discussed 
the possibility of the rise of an “industrial aristocracy” 
in democratic America. This new type of manufactur- 
ing aristocracy, he argued, would be different from the 
old one, since its goal would be only to make use of 
the population rather than to govern the country. “It is 
one of the hardest [aristocracies] that has appeared on 
earth,” surmised Tocqueville (2000, 532), “but it is at 
the same time one of the most restrained and least dan- 
gerous.” All things considered, in Tocqueville’s opin- 
ion, the possible emergence of an industrial aristocracy 
was not enough to call into question the image of an 
accomplished American democracy. For the author of 
Democracy in America, this was to change after 1850, 
as he became more and more concerned about the 
consequences of the unbridled spirit of enterprise at 
the core of the American way of life. 

Second, in spite of his dislike of the French rising 
bourgeoisie of his time, Tocqueville was more or less 
silent about the limitations of the American middle 
class. He praised the latter’s outstanding spirit of en- 
terprise and showed how democracy favors new de- 
velopments in industry and multiplies the opportunity 
for various lucrative enterprises. He wrote sine ira et 
studio about the independent spirit of the (American) 
middle class, its restlessness, and the multiplicity of its 
(economic) enterprises, which he saw as a distinctive 
characteristic of democracy that opens up new vistas 
and careers to all individuals, regardless of their sta- 
tion in life. To be sure, one will find in Tocqueville’s 
balanced interpretation neither the vitriolic attacks 
on the boundless cupidity of the Americans that un- 
derlay the conservatives critiques of America such as 
Saint-Victor’s Letters from the United States of America 
(1835) nor the profound dislike for the American 
vulgar mores and the prosaic American way of life 
expressed by Victor Jacquemont and Stendhal in the 
1820s. 


TOCQUEVILLE’S VIEWS OF AMERICA 
AFTER 1840 


In spite of Tocqueville’s cautionary words about the 
possibility of the emergence of a new type of soft 


B See, for example, Tocqueville's correspondence with Royer- 
Collard, published in Tocqueville 1971. 

© It is worth pointing out that Tocqueville was also the author of 
a “Memoir on Pauperism,” written in early 1835 after his visit to 
England In this essay, which was primarily directed at England, 
Tocqueville addressed the paradox of the then most opulent coun- 
try, England, having the greatest number of paupers A defender of 
private charity, he argued against pubhc charity, which, in his view, 
had a series of unfortunate unforeseen consequences. Nonetheless, 
Tocqueville did not detect the same problems in America, which, 
for him, was the country of the middle class For more details, see 
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democratic despotism, Volume Two of Democracy in 
America offered the image of a relatively solid and 
stable democratic regime that, for all its inherent short- 
comings, managed to reconcile successfully the de- 
mands for equality, liberty, and justice. 

Tocqueville did not publish a sequel to Democracy in 
America or any major statement on American democ- 
racy between 1840 and his death in 1859. Fully in- 
volved in the French parliamentary politics of his time, 
Tocqueville’s research interests eventually shifted to 
studying the Old Regime and the origins of the French 
Revolution.” But what happened with Tocqueville’s 
interest in America? Did he ever change his views on 
American democracy? What were his views of Ameri- 
can democracy in the last years of his life as the country 
elected a new president (Buchanan) and was preparing 
for the Civil War? 

To answer these questions we turn to Tocqueville’s 
correspondence with his American friends after 1840 
and we also examine his participation in the consti- 
tutional debates of 1848 in France. Due to the inde- 
fatigable work of Françoise Mélonio and her collab- 
orators, we now have access to more than 100 letters 
that Tocqueville sent from October 1840 until his death 
to a number of his American friends (alas, a few let- 
ters written by the American correspondents were 
lost). Tocqueville’s addressees were mostly individuals 
whom he and Beaumont had met during their voyage 
to America. The group included Jared Sparks (1789~ 
1866), the president of Harvard University and editor 
of Washington’s and Franklin’s papers; Franz Lieber 
(1800-72), editor of the Encyclopaedia Americana 
and professor at South Caroliniana College; Edward 
Everett (1794-1865), ambassador to London, Secre- 
tary of State, and senator; Theodore Sedgwick (1811- 
59), who provided Tocqueville with essential informa- 
tion while he was writing Volume One of Democ- 
racy in America; Charles Sumner (1811-74), a Harvard 
law professor and Republican senator; and E. L. and 
E. V. Childe, members of an American family living 
temporarily in France, with whom Tocqueville had 
close relations in the 1850s. 

Some of Tocqueville’s letters are short and limit 
themselves to brief comments on political events in 
both America and Europe. Many deal with the mun- 
dane matters of everyday life (planned visits, illnesses 
and deaths within the family, and other such matters 
that figure prominently in the correspondence between 
friends as they get old). A regular topic of inquiry was 
the economic stability of the country (as Tocqueville 
owned shares in a number of American railway com- 
panies and was worried about them). Other letters, 
however, belong to the best letters he ever wrote and 
give us an invaluable opportunity to gaze into his mind 
and soul, touching as they do upon important issues 
regarding the evolution of American political institu- 
tions and the future of the Union. They raise a whole 





31 For a recent and orignal analysis of the writing of The Old Regime 
and the Revolution, see Gannett 2003. 

2 For an excellent new collection of Tocqueville's letters, see 
Tocqueville 2003. 
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series of new concerns and perspectives on America. 
References are made to an emerging American imperi- 
alism, the excesses of American democracy, the decline 
of mores and the rise of lawlessness, the revolutionary 
fervor of American politics, poor political leadership, 
and the reckless spirit of American capitalism. Lurking 
behind all of these issues was Tocqueville’s growing 
concern about the issue of slavery and the impending 
breakup of the Union as the country entered a new 
turbulent era paving the way for the Civil War. 

Tocqueville’s growing doubts about the vitality of 
American democracy were fueled by the political de- 
velopments that occurred in the late 1840s and during 
the 1850s such as the Kansas—Nebraska bill, the elec- 
tion of Buchanan as president, and frequent cases of 
mob violence and lawlessness. These events led him 
to utter harsh words about the excesses and corrup- 
tion of American democracy, harsher in any case than 
what one finds in either of the two volumes of Dernoc- 
racy in America. Tocqueville’s letters of this period 
show that he called into question the very image of 
an accomplished and successful American democracy, 
the conclusion of his two-volume work on the United 
States, Finally, in the 1850s, amidst growing skepticism 
about America’s political future, Tocqueville came to 
regard the corruption of American institutions and 
mores (along with slavery) as a major threat to Amer- 
ican democracy. It was in this context that Tocqueville 
began reflecting on the invasion of the political by the 
economic sphere as well as on the inevitable shortcom- 
ings of market society, two topics that had not loomed 
large in his previous writings. 

In his letters, Tocqueville reaffirmed his interest in 
American politics and administration. During the first 
half of the 1840s, he remained confident that American 
political institutions offered the best combination of 
liberty and security and continued to believe that the 
future of liberty in the world depended to a signifi- 
cant extent on the preservation of a democratic repub- 
lic in the United States. Describing himself as “half 
Yankee,”* Tocqueville argued that, in spite of tem- 
porary misunderstandings and rivalries between the 
two countries, France and the United States were two 
natural allies united by profound ties that needed to be 
strengthened. The implicit suggestion was that Amer- 
ica could serve as a political model for France. “Noth- 
ing interests me more than everything that concerns 
this great and powerful nation [America],” he wrote 
to N. Niles in 1843 (Tocqueville 1986, 88-89). “I have 
broken a bit the chain of my relations with the United 
States lately. I regret it and would like to renew these 
relations....One of the foundations of my politics is 
that, in spite of preconceptions and small quarrels, 
France and the United States are two allies so natu- 
ral and so necessary to each other that they should 
never lose sight of each other for a single moment.” 
Tocqueville’s ongoing interest in America is docu- 
mented by another letter sent to Lieber in September 
1844, in which he claimed that America was not only 


3 Letter to E. V. Childe, December 12, 1856, ın Tocqueville 1986, 
185. 
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the freest nation in the world, but also that in which the 
individuals were most protected from the interference 
of society in their daily lives (93). 

That Tocqueville remained convinced that the Amer- 
ican model had something to offer to France is also 
illustrated by reference to his remarks in the power- 
ful constitutional debates that accompanied the birth 
of the Second Republic in 1848.4 Those debates en- 
capsulated the revolutionary obsession with unity and 
equality and repeatedly returned to the hostility of the 
republicans toward both bicameralism and federalism, 
each of which had become intimately associated with 
the cause of counter-revolution. The genius of France, 
it was frequently stated, was “unitary” and there could 
be no grounds for dividing up the sovereign will of 
people by reference to artificial institutional mecha- 
nisms designed to thwart its wishes. More specifically, 
both bicameralism and federalism were associated with 
aristocratic power and, in the eyes of their opponents, 
could be easily associated (and therefore discredited) 
by reference to the English constitution. 

Tocqueville opposed these views both in parliamen- 
tary debates and in the discussions of the constitutional 
commission. Recognizing that his views would never 
secure majority support, he nonetheless persisted in ar- 
guing that France had much to learn from the American 
experience (Tocqueville 1990, 82). All of this was within 
a perspective that, at this early stage, saw Tocqueville 
expressing the view that one of the consequences of 
the 1848 revolution would be to give more liberty to 
individuals. In particular, Tocqueville opposed the pre- 
ferred republican model of a single chamber by ref- 
erence to the American system. There was, he pointed 
out (82-83), only one democratic republic in the world, 
America, and that two chambers. 

Tocqueville doubted that France could borrow indis- 
criminately from the American model, but made the 
curious point that America possessed 30 states “which 
are in a position similar to ours” and that each of these 
possessed two chambers. Not a single American, he 
remarked, wished the system to operate differently. 
Tocqueville was, in fact, echoing the views of his Ameri- 
can friends who also considered bicameralism as one of 
the main causes of the survival of a democratic republic 
in the United States. The presence of two chambers, it 
was argued, offered the best protection not only against 
mob flatterers and demagogues, but also against the en- 
croachment of the executive power. In a letter sent to 
Tocqueville in June 1848, his American correspondent 
J. C. Spencer emphasized all these points and added 
that the two chambers must also have different modes 
of election that would allow them to withstand better 
the whims of public opinion. The senators, elected for a 
longer time, can resist better the various demands of the 
masses and learn to form a healthy and sober second 
thought. “It is this second thought,” argued Spencer, 
“that has saved us from many disasters” (Tocqueville 
1986, 122, n. 2). Tocqueville agreed with this view. In 


H See especially Tocqueville 1990, 55-166. For a discussion of 
Tocqueville’s role, see Lamberti 1984 
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a letter to Everett from March 1849, Tocqueville de- 
fended bicameralism as a pillar of constitutionalism. 

Moreover, in the 1848 debates Tocqueville denied 
that the two chamber model used in America was 
an English invention. States such as Pennsylvania and 
Massachusetts had begun life with one chamber: “Pub- 
lic reason” had demonstrated the necessity of two. This 
example, Tocqueville remarked, is striking. Likewise, 
he disputed the claim that a two-chamber system auto- 
matically institutionalized aristocratic power, arguing 
that France required “as in America, the two cham- 
bers represent in a similar manner and through similar 
means, the same interests and the same classes of peo- 
ple.” He then outlined the three advantages of such 
a system—it prevented executive power from abuse, 
it strengthened the executive in its relations to parlia- 
ment, and it prevented “legislative intemperance”— 
commenting that “two chambers do not prevent rev- 
olutions but they prevent the bad government which 
leads to revolutions” (Tocqueville 1990, 83). 

Tocqueville’s next reference to the American model 
was in the context of the heated debate that sur- 
rounded the election and powers of the president. First, 
Tocqueville opposed the principle of reelection at the 
end of a President’s mandate. This, he acknowledged, 
was what existed in America, and even there it posed 
an increasing problem, but fortunately the American 
President possessed “little power.” In France, by con- 
trast, a President would undoubtedly abuse his power 
in order to secure election (Tocqueville 1990, 99, 106- 
7). Next, Tocqueville recommended the American sys- 
tem of electoral colleges as the best means of securing 
the election of the French president. 

On all these issues (and many more), Tocqueville was 
defeated by the majority republican party, intent on 
reliving the days of the Convention. For Tocqueville it 
was a deeply troubling experience, and one that left him 
convinced that France would again fall prey to political 
instability. He did not believe that the American model 
could be copied without due consideration of France’s 
own particularities, but he did believe that politicians 
in France should have the wisdom to learn from the 
experience and rules of a constitution that he did not 
hesitate to refer to as “a work of art.” This, as we shall 
see, was all the more remarkable given that he himself 
was having growing doubts about the direction in which 
America itself was moving. 

After 1848 Tocqueville’s confidence in the vitality of 
American institutions was going to be challenged by 
subsequent political developments that brought to the 
fore more than ever before the corruption of Amer- 


ican democracy. True, his own political experience in ` 


France made him prone to pessimism, as his own coun- 
try demonstrated once more that the French nation 
lacked a political tradition of moderation and practical 
sense. Tocqueville’s own tendency to discouragement 
and his declining health also played a certain role in 
deepening his sense of isolation that constantly fright- 
ened him. “J would have liked to shake off for good my 
tendency to discouragement,” confessed Tocqueville 
in an important letter to Madame Swetchine from 
January 1856 (Tocqueville 2002, 272). “I have had this 
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ailment, however, for most of my life... . I have worked 
hard to get rid of it for some years, and I have cer- 
tainly decreased its intensity.... You cannot imagine, 
Madame, how painful and often cruel it is for me to 
live in this kind of moral isolation, to feel excluded 
from the intellectual community of my times and my 
country. To be alone in the desert often strikes me 
as being less painful than being alone among men.” 
Although Tocqueville’s pessimism and tendency to po- 
litical spleen can be explained to a certain degree by 
his own physical and spiritual constitution (he was a 
truly restless mind in constant search for certainty and 
intellectual companionship), political developments in 
America and France contributed to his disillusionment 
and to his acute sense of distance from his contempo- 
raries.> 

In short, Tocqueville’s views of American democracy 
did not change independently of other events in the last 
decade of his life. Consumed by a permanent revolu- 
tionary fervor, the French proved to be incapable of 
changing the constitution through peaceful parliamen- 
tary reform. They were also unable to conceive long- 
term political plans that would reconcile the immediate 
goals of all political actors with the interests of the 
country at large. In the late 1840s and early 1850s, 
Tocqueville continued to believe in the practical sense 
of the Americans, as displayed by the country’s strong 
tradition of self-government, and expressed again and 
again his hope that, in spite of its problems, American 
democracy would survive and prosper.” Yet the tone of 
Tocqueville’s letters on America became more severe 
and pessimistic as he approached the end of his life and 
saw the future of liberty threatened and compromised 
in both France and the United States. 

He expressed serious concerns regarding the emer- 
gent American imperialism, which he described as a 
sign of the bad health of American democracy. For 
Tocqueville, the idea of conquering new territories to 
be annexed to the already existing ones was absurd 
because, in his view, the country had already acquired a 
vast territory. Stressing again his American credentials 
as “half American citizen,” Tocqueville (1986) wrote 
to Sedgwick on December 4, 1852: “As one of your 
compatriots, I have not seen without apprehension this 
spirit of conquest and even a bit of rapine, which has 
been on display among you for several years. It is not 
a sign of good health for a people which already has 
more territories than it can fill” (146-47). A related 
problem in Tocqueville’s eyes was the growing immi- 
gration to America that brought into the country a 


35 For a statement of Tocqueville's (2003) sense of moral isolation, 
also see his letter to Madame Swetchine of October 20, 1856 (1218). 
% See, for example, Tocqueville’s letter of February 15, 1850, to 
Everett (as well as Everett’s response) in Tocqueville 1986, 133-34. 
37 In an important letter of April 1857 to E. V. Childe, Tocqueville 
(1986, 193) wrote about the American people m the following terms. 
“Like all sovereigns, it naturally loves courtesans and flatterers, but I 
have enough confidence m its practical sense to believe that in times 
of crises ıt would better place its trust. At least, I hope so, because I 
passionately desire that the great experience of self-government that 
is currently going on in America succeeds It it fails, political liberty 
on earth would be finished for ever.” 
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considerable number of individuals who did not be- 
long to the English race and had a different political 
culture. 

A week later, on December 11, 1852, Tocqueville 
wrote to Sparks that American democracy had to fear 
nothing but from its own excesses. He also argued that 
an immoderate spirit of adventure, risk, and conquest 
would endanger the stability of democratic institutions. 
The American democracy, wrote Tocqueville (1986, 
148-149), “has nothing more to fear than from itself, 
from the abuse of democracy, the spirit of adventure 
and conquest, from the sense and exaggerated pride 
in its strength, and from the impetuosity of youth.” 
The solution recommended by Tocqueville was moder- 
ation: “I cannot recommend the virtues of moderation 
enough. Nations, as much as individuals, have need 
of it” (149). What is remarkable in these passages is 
Tocqueville’s emphasis on the excesses of democracy, a 
theme that was in stark contrast with the earlier image 
of an accomplished and stable American democracy, 
more or less immune to chronic turmoil and violence. 
In 1852, Tocqueville realized much better than in 1835 
or 1840 that it is in the very nature of democracy itself, 
even in the most advanced democratic regime in the 
world, to tend to transgress its limits and to subvert its 
own foundations. 

To be sure, the cacophonic spectacle of American 
political life in the early 1850s only fueled his increas- 
ing skepticism and provided numerous examples of 
the growing corruption of its democratic institutions. 
The image of a law-abiding and tranquil citizenry that 
Tocqueville presented in Volume One of Democracy 
seemed a relic of the past or, at best, a mere theoretical 
ideal. In a letter to Sedgwick from September 19, 1855, 
Tocqueville (1986) denounced “the violent, intolerant, 
and lawless spirit” (163) manifested in some parts of 
the country. He reiterated the same concern two years 
later, in a letter to Lieber from October 9, 1857, in 
which he deplored the acts of personal revenge and vio- 
lence that tended to subvert the authority of the law and 
the judicial system in America. The point is illustrative 
of Tocqueville’s concern with the rapid degradation of 
mores that he had previously described as the pillar of 
American democracy. Tocqueville (215) suggested that 
if political parties offered the spectacle of a ruthless and 
rude competition for power, punctuated by uncivilized 
electoral campaigns and unacceptable attacks on one’s 
political opponents, it was because political mores and 
habits had become violent and rude. 

This conclusion allowed Tocqueville to make a rather 
surprising claim that cannot be found in Democracy 
in America. The deterioration of mores, he argued, 
fostered a certain form of revolutionary fervor and 
the country seemed to be heading toward revolution. 
“What is certain,” Tocqueville (1986) wrote in 1856 
(echoing the image of America in Europe at that time), 
“is that in Europe the idea that you are approaching 
rapidly the time of revolution gains wide acceptance 


38 See, for example, Tocqueville's letter of Angust 6, 1854, to Beau- 
mont, in Tocqueville 1967, 228-29. Also see Tocqueville's letter of 
October 14, 1856, to Sedgwick, m Tocqueville 1986, 182. 
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and spreads very quickly” (183). Tocqueville stopped 
short of endorsing this prevailing opinion. Yet certain 
doubts had already entered his mind, and they seemed 
strong enough to call into question some of his pre- 
viously held opinions about the stability of American 
democracy. 

Furthermore, confronted with the spectacle of a 
democracy that tended to transgress its limits and to 
engender chronic violence, Tocqueville came to rethink 
what made America original. In 1835, Tocqueville had 
stressed self-government and the successful alliance 
between the spirit of liberty and the spirit of religion. 
In 1840, he had emphasized the Cartesianism, indi- 
vidualism, and religious beliefs of Americans and had 
shown how self-interest rightly understood underlay 
their actions. In the later letters, the story was some- 
what different, as religion no longer played a major 
role in his account of American democracy and was 
conspicuously absent from his correspondence with 
his American friends.*? Although Tocqueville earlier 
believed that religion properly practiced could provide 
the foundation for sound mores and a virtuous form 
of materialism in an age of individualism and skepti- 
cism, by the 1850s this belief seems to have waned. Put 
simply, in a situation where he sees growing evidence 
of “popular disorder,” Tocqueville no longer makes 
mention of America’s originality in combining liberty 
and religion.” 

Rather, Tocqueville now claimed that America’s par- 
ticularity lay in combining a certain degree of prim- 
itivism with the most advanced elements of civiliza- 
tion. This primitivism was manifested by the extreme 
forms of the spirit of adventure and risk fostered by 
the growing abundance and the limitless opportuni- 
ties for gain, the same elements that Tocqueville had 
previously considered as the causes of Americans’ 
restlessness. In an important letter to Sedgwick from 
October 14, 1856, Tocqueville (1986) expressed his con- 
cern regarding “this race of anxious gamblers to which 
your prosperity, in a land that is half empty, has given 
birth, a race which combines the passions and instincts 
of the savage with the tastes, needs, vigour and vices of 
civilized men. The world, I think, has never seen any- 
thing like it before. Who can say where this might lead 
if ever they gain the upper hand in your affairs?” (182- 
183). What is particularly striking in this passage is the 
suggestion that the endless quest for increased comfort 
and material prosperity is the most important cause of 
these excesses. Far from softening human nature, as 
Tocqueville argued in Volume Two of Democracy in 
America, abundance fosters, in fact, an excessive spirit 
of enterprise, adventure, and risk by arousing bound- 
less desires and fueling an unbridled materialism. Also 


3 It is worth pointing out that in an important letter to Corcelle 
from July 29, 1857, in which Tocqueville (2003) spoke admiringly of 
England, he praised that country’s “perfect accord between religious 
and political morality” (1253). A smilar reference to the seminal 
relationship between religion and liberty in England can be found in 
Tocqueville’s (2003) letter to Kergolay from August 4, 1857 (1256; 
translated in Tocqueville 1985, 355-57). 

4 Fora recent analysis of Tocqueville’s views of religion, see Antoine 
2003. 
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implicit in this passage is the idea that the market can 
hardly be controlled once free reign is given to individ- 
ual ambitions and interests. The inevitable outcome of 
this development is the perversion of the political that 
is gradually invaded by the economic and turns into 
an arena for a chaotic display of personal ambitions, 
rivalries, and interests. 

Tocqueville did not draw, however, the conclusion 
that this was the stage of an advanced democracy. On 
the contrary, he came to believe that this was the symp- 
tom of a young and immature democracy that, like 
a child unable to control his emotions and passions, 
takes the liberty of indulging in its own whims and 
desires without considering their pernicious long-term 
effects. In taking up the issue of the alleged immaturity 
of American democracy—a more or less new theme 
in his writings—Tocqueville only returned, in fact, to 
the metaphor of Guizot, who in 1834 had referred 
to America as a young and inexperienced society. 
Tocqueville’s reference to Hobbes’s puer robustus in 
a letter to Sedgwick from August 29, 1856, is revealing 
in this regard. “What is certain,” Tocqueville (1986) 
wrote, “is that, for some years now, you have strangely 
abused the advantages given to you by God which al- 
low you to commit great errors with impunity. Viewed 
from this side of the ocean, you have become the puer 
robustus of Hobbes. By being so, you distress all the 
friends of democratic liberty and delight all of its op- 
ponents” (177). America, Tocqueville wrote in 1856, 
risked disappointing the hopes of millions of people 
for a better future, because it offered in reality the 
disquieting spectacle of an unstable regime led by in- 
competent and dishonest leaders,“ relying on corrupt 
institutions, and incapable of controlling the excesses 
of the spirit of enterprise and adventure. 

In a series of letters, Tocqueville also raised the issue 
of the physical composition of America. In the letters 
he wrote to Sedgwick in 1854, he spoke of the lack of 
public life in Germany, attributing this to the historic 
absence of free institutions in that country. America, 
Tocqueville went on, was annually receiving around 
140,000 German immigrants. How could they possibly 
be assimilated into American life?** Two years later, 
again in a letter to Sedgwick, he took up this theme 
once more, this time sensing the future breakup of the 
Union. The large number of immigrants now present in 
American society made it impossible to talk of the na- 
ture of America. “Unfortunately,” Tocqueville (1986, 
177) commented, “each new day brings to you so many 
foreign elements that soon you will no longer be your- 
selves.” 

These concerns for America’s future, however, paled 
into insignificance by the side of those he voiced con- 
cerning slavery. Here we must remember that upon his 
election as a member of the Chamber of Deputies in 
1839, one of the principal responsibilities allotted to 
Tocqueville had been that of drafting a law abolish- 


41 Writes Tocqueville (1986), “The greatest plague of America, after 
slavery, is the government of the country by the least honest, if not 
the least capable part of the nation” (192-93) 

© See Tocqueville 1986, 156-59. 


ing slavery in the French colonies. The report that he 
prepared was subsequently translated and published in 
America, leading him to hope that it might have some 
useful impact, especially in the South. In the context of 
this report he wrote to Sparks, on October 13, 1840, that 
“slavery has on your soil roots that are more profound 
and more enduring than anywhere else” (Tocqueville 
1986, 83). It was a source of great sadness to him that 
America had incorporated slavery into itself in the way 
that it had. In his view, it was because these roots were 
so deep that slavery would survive longer in America 
than elsewhere. Indeed, he predicted that neither he 
nor Sparks would see its end. 

Tocqueville’s profound concern about the impact of 
slavery continued long after his retirement from ac- 
tive politics. This was made abundantly clear when, 
in 1856, he published an open letter in The Liberty 
Bell. Describing himself as “the persevering enemy of 
despotism everywhere,” Tocqueville (1986) indicated 
in the strongest terms that he was “pained and aston- 
ished” by the fact that “the freest people in the world 
is, at the present time, almost the only one among 
civilized and Christian nations which yet maintains 
personal servitude” (163-64). Slavery, he went on, re- 
tarded America’s progress, gave support to her detrac- 
tors, and threatened the future of the Union. As a man, 
and not just as “an old and sincere friend of America,” 
he was moved by “the spectacle of man’s degradation 
by man” and hoped to see the day when the law would 
grant equal civil liberty to all (163-64). Less than a year 
later, Tocqueville struck a different tone. The election 
of Buchanan as President significantly increased his 
fears that slavery would be extended. Such a possi- 
bility, he indicated, could only be viewed as “one of 
the greatest crimes that men can commit against the 
general cause of humanity” (189-90). 

It was this theme that came to dominate the remain- 
ing letters he wrote to his American friends prior to his 
untimely death in April 1859. For example, although 
he appeared to agree with Sedgwick that the greatest 
danger facing the Northern states was political corrup- 
tion, the entire prospect of the extension of slavery 
filled him with horror and despair. Distancing himself 
from the abolitionist position” (being fearful of the 
consequences of a policy that he described as “prema- 
ture and dangerous”), he believed that the extension 
of slavery would constitute “a crime against humanity” 
(Tocqueville 1986, 194-95). He could not accept that a 
great people had no right to express an opinion about 
the extension of what Tocqueville did not hesitate to 
describe as “the most horrible of all social evils” (195). 
Fortunately, he commented, he himself did not have 
to face the terrible dilemmas that such action would 
entail for the Union. 


© An analysts of Tocqueville’s position on colonization, abolition, 
and emancipation goes beyond the scope of this article. In 1835, 
Tocqueville became a member of the French Society for the Abolition 
of Slavery and later called for the abolition of slavery in the French 
West Inches. Yet, unlike radical French abohtonists, he argued that 
the state ought to give the former slave masters some form of indem- 
ification. For more details on this issue, see Tocqueville 2001. 
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Increasingly, however, Tocqueville felt that the 
Union was in danger. An equivalent level of political 
agitation and passion would have been such, he re- 
cognized, as to lead to civil war in Europe. That same 
year he identified the cause of the Union with “that 
of liberty across the world” (Tocqueville 1986, 206). 
To the end, he continued to hope that slavery would 
be vanquished. Nevertheless, it is hard not to conclude 
that the debate about the possible extension of slavery 
added to the mounting political pessimism that charac- 
terized the final decade of Tocqueville’s life. 


CONCLUSION 


If Tocqueville’s ideas have become extremely influen- 
tial today among academics and elected politicians, it 
is also true that the richness of his writings has some- 
times generated a certain tendency to over praise his 
work that turned the Frenchman into a form of intel- 
lectual guru. Reacting against this trend, a number of 
perceptive critics of Tocqueville have raised a series 
of important questions regarding the accuracy of his 
analysis of American democracy. 

For example, in his essay “Beyond Tocqueville, 
Myrdal and Hartz,” Smith (1993) argued that “the 
Tocquevillian story is deceptive because it is too nar- 
row” (549). In Smith’s opinion, the relevance of 
Tocqueville’s interpretation is limited by the fact that 
he used “categories derived from the hierarchy of po- 
litical and social statutes men have held in Europe” 
(549). As such, concluded Smith, Tocqueville’s analysis, 
at the heart of which lies the concept of the equality of 
conditions, fails to give due consideration “to the ine- 
galitarian ideologies and conditions that have shaped 
the participants and the substance of American politics 
just as deeply” (549). 

The examination of Tocqueville’s (often neglected) 
correspondence with his American friends after 1840 
tells a different story, however, and one that is in 
some ways more poignant than the account offered 
by the two volumes of Democracy in America. The 
Frenchman did not overlook the issue of slavery and 
he was not oblivious to the fact that the equality of con- 
ditions extended only to a part of the population. The 
perceptive letters sent to his American friends clearly 
show that Tocqueville’s image of American life was 
neither “deceptive” nor too narrow. It is both ironic 
and telling that toward the end of his life, Tocqueville 
reached a set of conclusions about American democ- 
racy that differed from those he drew in Volumes One 
and Two of Democracy in America. Twenty years ear- 
lier, he had seen “in America more than America” 
(Tocqueville 2000, 13), the image of democracy itself, 
and wanted to find lessons there from which everyone 
in his native country could profit. Tocqueville was fully 
convinced that the future of democracy and freedom 
in the world depended on the success of the demo- 
cratic experiment in the United States. He also be- 
lieved that the development and progress of American 
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democracy responded to a secret design of Providence. 
Tocqueville’s subtle and elegant analysis of American 
mores and institutions sought to convince his compa- 
triots that American democracy was stable and orderly 
and represented a great step forward for the cause of 
justice in the entire world. 

For all his nuanced and sometimes critical portrait 
of the democratic individual, it can be argued that 
Tocqueville did not radically change his views 
in Volume Two of Democracy in America (compared 
to Volume One), although his focus eventually shifted 
from what was distinctively American to what was 
democratic in general. This is not meant to under- 
state the originality of Volume Two of Democracy, in 
which he expressed uneasiness about the mediocrity of 
democratic life, bent on commerce and imbued with 
utilitarian values, and wanted instead to see more indi- 
viduals capable of conceiving loftier plans than to gain 
money and to make their lives more comfortable. As 
illustrated by the last book of Volume Two of Democ- 
racy in America, in the late 1830s Tocqueville (2000) 
believed that the greatest dangers to democracy were 
not anarchy and the ensuing collapse of political and 
social order, but a new form of “democratic” despo- 
tism, one that strikes the mind rather than the body and 
transforms all citizens into a mass of docile individuals 
in perpetual need for an enlightened tutor who relieves 
them “entirely of the trouble of thinking and the pain 
of living” (663). The great importance that he attached 
to the topic of democratic despotism is conveyed by a 
letter to Royer-Collard in which Tocqueville wrote: “It 
is true that I am now at the most difficult and delicate 
place in the whole work....I sense that I am treating 
here the most important idea of our time. To be 
sure, the main idea of Tocqueville’s analysis in the last 
book of Volume Two of Democracy in America is that 
democratic despotism is the inevitable companion of 
centralization and the outcome of the multiplication of 
the functions and agencies of government demanded 
by the growing equality of conditions. 

Yet the final image of American mores and insti- 
tutions presented in Volume Two offered a few sig- 
nificant reasons for optimism, in stark contrast to the 
more negative assessments of American democracy of- 
fered in his post-1840 correspondence. In his book, 
Tocqueville offered four important conclusions that 
are worth restating. First, the instincts and passions 
of democracy ought to be constantly moderated and 
held in check because democracy goes hand in hand 
with materialism, tyranny of the majority, rampant in- 
dividualism, and centralization. Second, this task of 
moderating and purifying democracy can be success- 
fully accomplished by a regime such as the Ameri- 
can one.“ Third, Tocqueville believed that “political 


45 See Tocqueville 1971, 67. The letter is also quoted in Schleifer 
2000, 212. 

46 In his notes, Tocqueville wrote: “Use democracy to moderate 
democracy. It ıs the only path to salvaton that 1s open to us. To 
discern the feelings, the ideas, the laws which, without being hostile to 
the principle of democracy, can nonetheless correct its troublesome 
tendencies” (quoted in Schleifer 2000, 234). 
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liberty is the greatest remedy for nearly all the evils 
with which equality menaces man.” Fourth, he ar- 
gued that because “there is only Democracy (by this 
word I understand self-government) which can lessen 
and make bearable the inevitable evils of a democratic 
social state...it is necessary with all one’s might to 
hasten to give enlightenment and liberty to people who 
have already such a social state”“*® These points show 
that Tocqueville arrived at the conclusion that soft 
(bureaucratic) despotism and centralization of power 
posed much greater threats to democracy than unruly 
legislative assemblies, power-driven individuals, or the 
process of industrialization. 

How can one explain, then, that a decade and a 
half later, Tocqueville came to doubt some of these 
conclusions and claimed that the hopes pinned on 
American democracy and freedom had been severely 
compromised? Volume Three of Democracy in 
America, if it had ever been written, would have mir- 
rored his disenchantment and skepticism and would 
have called into question some of the most significant 
ideas of his widely acclaimed book that, according to 
Tocqueville’s critics, presented an unduly embellished 
image of America. The high stakes and implications of 
a possible failure of the American model for Tocquevil- 
le are unambiguously illustrated by a letter sent to 
Lieber on September 1, 1856, in which he expressed in 
memorable words his disenchantment with American 
political life. “I have passionately desired to see a free 
Europe,” wrote Tocqueville (1986, 179) 


and I realize now that the cause of true liberty is more 
compromised today than it was during the epoch when 
I was born. I see around me nations whose souls seem to 
weaken as their prosperity and physical force grow, nations 
which remain, to borrow Hobbes’s phrase, robust children 
who deserve only to be treated by means of the stick and 
the carrot. Your America itself, to which once turned the 
dream of all those who lacked the reality of liberty, has 
for some time in my view given little satisfaction to these 
friends. 


The implicit assumption is that America should re- 
main exceptional and must continue to offer a hopeful 
example to all those who lack liberty. The reference 
to the cause of “true liberty” being discredited by po- 
litical developments in Europe shows the magnitude 
of Tocqueville’s disappointment and disillusionment 
in his last years of his life. It is well known that he 
wrote Democracy in America for French readers in the 
hope that they would profit from the lessons taught by 
the American democracy. The unflattering remark in 
the passage above about America’s alleged failure to 
fulfill the dream of all the friends of liberty suggests the 
extent to which Tocqueville was still seeing in America 
more than America. What is so striking, however, in 
the letters written in the decade following his retreat 
from political life is a growing and generalized pes- 
simism that touches all aspects of his political thinking. 
Tocqueville expressed a series of doubts about 


47 Tocqueville as quoted m Schleifer 2000, 232-33. 
* Tocqueville as quoted in Schleifer 2000, 200, 233 
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American political life and mores that were strikingly 
similar to the doubts of his predecessors. 

His letters to his American colleagues were at once 
perceptive and despairing about the character of the 
Second Empire as well.” All of those, he commented, 
“who had received a liberal education and who have in- 
volved themselves either directly or indirectly in public 
affairs understand and clearly see that, in the name of 
the sovereignty of the nation, all public liberties have 
been destroyed, that the appearance of a popular elec- 
tion has served to establish a despotism which is more 
absolute than any of those which have appeared before 
in France” (Tocqueville 1986, 144). As he neared the 
end of his time, there appeared little cause for optimism 
that the cause of liberty would triumph and now not 
even the inspiration derived from America remained.” 

The degree of Tocqueville’s disenchantment can only 
be explained by the magnitude of his hopes. When writ- 
ing Volume One of Dernocracy in America, he swam 
against the current of French public opinion as America 
lost its previous Arcadian image and was regarded in 
France as a decaying society. Tocqueville revealed to 
his compatriots a different image, that of a healthy 
and prosperous society, which, according to one of his 
critics, Victorien Sardou, was an idealized or “sweet- 
ened” version of America, “l'Amérique en sucre.”*! 
By drawing upon his case study, Tocqueville suggested 
that democracy should and could be moderated and 
channeled into regular institutions that promote the 
common good and successfully reconcile the demands 
for equality and liberty. The American model, he ar- 
gued, showed that this aim could be achieved thor- 
ough a combination of healthy and stable mores, fed- 
eralism, self-interest rightly understood (that softens 
the negative effects of extreme individualism), reli- 
gion, and the art of association. Twenty years later, 
he feared, however, that he might have been mistaken 
and that his critics might have been right after all. In 
Tocqueville’s (1986) opinion, American public and po- 
litical life was dominated by individuals “who lacked 
moderation, sometimes probity, above all education” 
(227) and resembled mere political adventurers, vio- 
lent, gross, and devoid of principles. The institutions 
in place were able to do relatively little to stave off 


“9 It 1s interesting to note that m spite of his disillusionment about 
French politics in the late 1850s, Tocqueville (2003, 1266) refused to 
believe that despotism was the normal and definitive state of French 
society. The dismal state of affairs in French politics under Napoleon 
MI did not affect Tocqueville's strong belief in political liberty. For 
more evidence regarding his passion for liberty and his views on 
French society, see Tocqueville's important letter to Beaumont from 
February 27, 1858 (1292-96). An English translation can be found in 
Tocqueville 1985, 365—70. 

2 Tocqueville similarly came to express some doubts about the one 
other country that had attracted French liberals of his generation: 
England. Writng at the time of the Crmean war, he commented 
on the extent to which England suffered from “the weakness and 
incapacity of its government” (Tocqueville 1986, 163). Yet in some 
of his letters to Corcelle, Beaumont, and Kergolay written in 1857, 
Tocqueville expressed greater appreciation for England, rts political 
culture, and its tradition of self-government. For more details, see 
Tocqueville 2003, 1255; 1260. 

31 Sardou as quoted in Roger 2002, 88. 


The Third Democracy 


August 2004 





this growing corruption. As such, American politics 
appeared to Tocqueville more and more as an arena 
for brute instincts and appetites, a point that, oddly 
enough, had been made by Cerise, one of Tocqueville’s 
reviewers, in 1835. 

What a disappointment must have felt the man who 
once claimed to have perceived in America more than 
America and who, in the last years of his life, saw the 
cause of freedom compromised not only in Europe but 
in the United States as well! We should be grateful, 
then, to the vagaries of French political life (which 
absorbed his energies and occupied his attention in 
the last two decades of his life) that Tocqueville never 
came to write Volume Three of Democracy in America. 
Reading the correspondence with his American friends 
after 1840 gives us a bitter taste of what the message of 
that volume might have been. 
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Madison v. Hamilton: The Battle Over Republicanism and the Role 


of Public Opinion 


COLLEEN A. SHEEHAN Princeton University and Villanova University 


article examines the causes of the dispute between James Madison and Alexander Hamilton 
in the early 1790s. Though Hamilton initially believed that Madison’s opposition to the Federalist 
administration was probably motivated by personal animosity and political advantage, in later 
years he concluded what Madison had long argued: the controversy between Republicans and Federalists 
stemmed from a difference of principle. For Madison, republicanism meant the recognition of the 
sovereignty of public opinion and the commitment to participatory politics. Hamilton advocated a more 
submissive role for the citizenry and a more independent status for the political elite. While Madison 
did not deny to political leaders and enlightened men a critical place in the formation of public opinion, 
he fought against Hamilton’s thin version of public opinion as “confidence” in government. In 1791— 
92 Madison took the Republican lead in providing a philosophic defense for a tangible, active, and 
responsible role for the citizens of republican government. 


he feud between James Madison and Alexander 

Hamilton that began early in the Washington 

administration left a lasting impression on the 
American political landscape. It led to the formation 
of the first political parties in the United States, to the 
decisive victory of the Republicans over the Federal- 
ists in the election of 1800, and to the establishment 
of participatory politics in the American republic. Al- 
though it is one of the most noted political battles of 
American history, the cause of the dispute remains to 
this day a source of controversy among scholars. In 
1792 Hamilton himself was unclear about the reasons 
for the quarrel, expressing surprise at Madison’s sys- 
tematic opposition to his fiscal program. After all, they 
not only had worked in tandem to produce The Fed- 
eralist Papers, but also had spent considerable time at 
the outset of the new government exchanging ideas 
and friendly advice. They must have appeared to those 
around them, and to themselves as well, as political 
allies. What, then, occasioned the divergence between 
them? Was the quarrel grounded in a difference of 
principle, or was it merely personal or political in the 
ordinary sense of the term? 

The political battles of the 1790s between the pri- 
mary co-authors of The Federalist have often been 
viewed by scholars within the context of the “Jefferson 
v. Hamilton” thesis regarding the origins of Ameri- 
can party politics. This interpretation owes it origins to 
Hamilton’s initial assessment of the feud. At the com- 
mencement of the new government, Hamilton claimed, 
there existed a similarity of thinking between him and 
Madison. Despite their disagreement on debt discrim- 
ination and the assumption of state debts at the outset 
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of the new government, Hamilton remained disposed 
to believe in Madison’s honesty, fairness, and goodwill. 
By the spring of 1792, however, he became convinced 
that Madison acted in cooperation with Jefferson, that 
he was actuated by “personal and political animosity” 
against him, and that his character was in fact subtle, 
complicated, and artificial in a way that the Treasury 
Secretary had not previously understood (Hamilton 
1961—87 [hereafter PAH] X1:432—34). Either Jefferson 
had so influenced Madison that the latter had under- 
gone a material change of mind or Madison was simply 
a common political calculator, pursuing measures to 
feed his own political popularity and/or the advantage 
of his particular state. 

Despite Hamilton’s initial speculations, he later ac- 
knowledged what Madison had long claimed—that the 
war between Republicans and Federalists stemmed 
from a difference of principle. “[I]n reality the foun- 
dations of society, the essential interests of our nation, 
the dearest concerns of individuals are staked upon 
the eventful contest,” Hamilton wrote in 1801 (PAH 
XXV-:352—53). “[T]he contest between us is indeed a 
war of principles,” though not a war “between monar- 
chy and republicanism” but “between tyranny and lib- 
erty” (PAH XXV:370). Hamilton’s modification of his 
earlier perspective is often overlooked by scholars, per- 
haps because it is easy to see it as just another partisan 
shot at his political opponents. Yet this is precisely what 
Hamilton warns his contemporaries against: Those who 
persist in seeing the conflict as nothing more than zeal- 
ous partisanship and a struggle for power are deceived. 

Hamilton’s more mature and, I would argue, more 
trenchant assessment of the party contest provides 
a valuable insight into the democratic implications of 
Madison’s and the Republicans’ agenda. By 1801, and 
probably earlier, Hamilton recognized that Madison’s 
opposition to him and the Federalists was propelled 
by a fundamental philosophic disagreement over the 


1 See, for example, McDonald’s endorsement of this thesis (1979, 
175, 199-200, 254, 1974, 80-81). In contrast, the more recent schol- 
arship of Banning (1995), Elkins and McKitnck (1993), and Read 
(2000) views the battle as a real disagreement over constitutional 
and political ideas. 


405 


Madison v. Hamilton 


nature and role of public opinion in a republic. Tied 
to Madison’s and Hamilton’s differing perspectives 
on public opinion were conflicting interpretations of 
the Constitution and divergent visions of America’s 
economic future. These disagreements between the 
two leading Publii shattered their Roman alliance of 
1787—88. 

In 1791—92 Madison took the lead in providing a 
philosophical defense of the republican opposition to 
Hamilton’s policies.? Madison’s opposition to the per- 
petuation of the debt, the national bank, and govern- 
mental support of manufactures were tied together by 
a single philosophic principle—the sovereignty of pub- 
lic opinion. His battles against the Neutrality Procla- 
mation and the Alien and Sedition Acts later in the 
decade stemmed from the same philosophic source. In 
Madison’s mind, the principle of popular sovereignty 
meant the recognition of the supremacy of the Con- 
stitution, understood and administered in a manner 
consistent with the sense of the people who ratified 
and adopted it. It also meant the ongoing sovereignty 
of public opinion, which requires the active partici- 
pation of the citizenry in the affairs of the political 
community. For Madison, public opinion was not the 
sum of fleeting passions and uneducated views, nor 
was it merely an aggregate of the sentiments of the 
populace. The “public” was not a mythical entity and 
public opinion was not a disembodied theoretical con- 
struct reflecting a “ghostly body politic” (Miller 1988). 
Rather, public opinion was the tangible product of a 
time-consuming process of communication and delib- 
eration throughout the community, grounded in and 
reciprocally influencing the minds and mores of the 
people. Like that of other eighteenth century theorists 
of the subject, Madison’s conception of public opinion 
must be sharply distinguished from the current one 
which equates public opinion with the results of daily 
polling aggregates. 

Madison did not believe that participatory poli- 
tics ends with the constitutional ratification process, 
the amendment process, or even elections. Rejecting 
Hamilton’s and the Federalists’ narrow dependence 
on the wealthy few to produce political stability and 
strength, Madison advocated the formation of an en- 
lightened public voice that would control and direct 
the measures of government. Hamilton feared that the 
Republican agenda embraced the naive democratic op- 
timism of his age, that in fact it had close connections 
across the seas to the “vain reveries of a false and new 
fangled philosophy” of the French Enlightenment. In 
contrast, he advocated a less active, more submissive 
role for the citizenry and a more energetic and indepen- 
dent status for the executive and his administration. For 
him, public opinion was the reflection of the citizens’ 
“confidence” in government. While Madison did not 
deny to political leaders and enlightened men a critical 





2 See Madison’s “Party Press Essays,” identified m PJM as “Es- 
says for the National Gazette” (PJM 14:117—22, 14:137—39, 17:559— 
60, 1302-10, 14.170, 14:178—-79, 14:191-92, 14:197-98, 14.201— 
2, 14.206—-9, 217-19, 14:233-34, 14:244—46, 14:257—-59, 14:266—68, 
14°274—75, 14:370-72, 14:426—27). 
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place in the formation of public opinion, he fought 
against Hamilton’s thin version of the politics of pub- 
lic opinion. In opposition to the Hamiltonian view of 
an economically distracted and politically subservient 
people, Madison advanced the image an active and re- 
sponsible citizenry with a substantial role in republican 
government. 

Both Madison and Hamilton considered the con- 
test between Republicans and Federalists to be one 
that would essentially determine the character and fate 
of republicanism in America. The ultimate victory of 
the Republicans meant the triumph of the Madisonian 
commitment to the sovereignty of public opinion and 
participatory republicanism in the United States. The 
outcome of the battles of the 1790s had far-ranging 
implications for the future of democracy in America 
and the West, as Tocqueville ({1835 and 1840] 2000, 
166—70) recognized and astutely analyzed a genera- 
tion later. Although Madison’s particular conception 
of participatory politics was intended to circumvent the 
problem of majority tyranny, it nonetheless encouraged 
the communication of the citizens’ views and the for- 
mation of a united public voice, thereby widening the 
path of opportunity for the power of public opinion. In 
Hamilton’s view, this threatened the checks on majori- 
tarian politics contrived by the framers; it asked more 
of the people than they could realistically contribute 
to political life. Madison too was well aware of the 
potential dangers associated with majority opinion— 
surely no one of the Founders was more mindful of 
such dangers. Nevertheless, he consciously took upon 
himself the role of chief philosophic architect and po- 
litical coleader of the republican effort to institute the 
politics of public opinion in America. 

The extent to which government should be influ- 
enced by public opinion is a perennial question of 
American politics and a central question of democratic 
theory. According to contemporary “deliberationist” 
theorists, the respect due to public opinion depends 
on whether the processes and conditions of political 
communications produce an informed and reasonable 
public opinion. As Benjamin Page and Robert Shapiro 
(1992) argue, the process of forming public opinion 
through “collective deliberation is essential to the re- 
alization of democratic ideals” (363). Madison would 
have agreed. Indeed, he was the first democratic the- 
orist in America to make explicit the central impor- 
tance of public opinion to free government and the 
conditions that are needed for its proper formation 
and articulation. Despite its centrality to his political 
analysis, Madison’s theory of public opinion has been 
neglected by many political scientists and historians. In 
fact, Madison is often attributed with virtually the op- 
posite view on the subject than the one he actually held. 
The Founders, including Madison, “tended to take the 
idea of deliberation in an elitist direction, disdaining 
public opinion and attempting to insulate leaders from 
it,” Page and Shapiro contend (1992, 363; c£ Jacobs and 
Shapiro 2000, 299). 

The issue of the respect due to public opinion was 
at the core of the disagreement between Madison and 
Hamilton in the 1790s. It stamped their divergent views 
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of the expectations for the new constitutional order 
they helped to frame and the new nation they were 
working to build. The disagreement shook the foun- 
dations of the nascent political order and gave defi- 
nition to the challenge of self-rule in America. Their 
insights and analyses concerning public opinion are no 
less relevant to contemporary American citizens than 
they were to citizens of the early republic. In fact, with 
extraordinary advances in communications technology 
over the past few decades, the potential power of pub- 
lic opinion in the United States is today at its historic 
height. Yet, as Daniel Yankelovich has perceptively 
noted, in our age there is little attention given to how 
we might identify and enhance the quality of public 
opinion. There is a critical difference between “mass 
opinion” and “public judgment,” Yankelovich (1991, 
15—23) argues, and while we “have learned a great 
deal about how to measure public opinion (and how 
to manipulate it) [we]...have almost nothing to say 
about how to improve it” (Yankelovich 1991, xi—xii). 

Yankelovich, Shapiro, Page, and others have sparked 
a renewed concern in our day over the quality of 
civic understanding and the content of democracy in 
America. At the same time, however, they tend to di- 
vorce the idea of a rational public from the substantive 
moral content of public judgment. Both Madison and 
Hamilton would have considered such an approach 
insufficient to the achievement of republican ends. 
In contrast, they consciously sought to overcome the 
problem of majority tyranny and anchor public opinion 
in the moral principles of republicanism, albeit with 
competing visions about the locus of its substantive 
content, the mode of its formation, and the extent of 
its influence on government. 


MADISON’S OFFENSE 


Beginning in October 1791 and continuing through 
December of the following year, Madison published 
a series of 19 articles in Freneau’s newly established 
National Gazette (Sheehan 1990, 356, et passim). In 


3 Although Yankelovich (1991) clams to consider the ethical as 
well as the cognitive dimensions of public opinion, the lack of a 
moral standard by which to measure the quality of pubhc opinion 
1s lacking in his discussion. Rather, he defines the quality of public 
opinion by its degree of firmness and consistency and the public’s 
willingness to take responsibility for the consequences of it views 
(5, 24). Page and Shapiro (1992) justify their claim that collective 
pubHc opinion 18 “‘reasonable,’ ‘responsible,’ and ‘rational”’ on the 
basis of its “general stability, differentiation, and coherent patterning 
of collective policy preferences, and. responsiveness to new situ- 
ations and new information” (388) The authors concede, however, 
that even if public opinion is stable and predictable, this does not 
“dispose of the Founders’ concern that majority opinion might be 
dangerous to ‘rights”’ or that some demands of the majority might 
be “‘amproper or wicked”’ (438). “But in our secular times,” they 
argue, “skeptical of absolutes and sensitrve to trade—offs, it is not easy 
to specify rights that deserve complete protection against majority 
Tule.” The unwillingness of many contemporary political theorists 
to make a substantive moral distinction between just and tyrannical 
public opinion undermines the defense of popular government and of 
the United States Constitution set forth by Madison and Hamilton, 
leaving it with no greater claim of right than any other form of 
government. 
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these Party Press Essays Madison attacked certain 
policies of the administration as “anti-republican” and 
presented his alternative “republican” conception of 
government. Although Hamilton is never mentioned 
by name, his role in initiating measures such as the 
funding system, the national bank, and governmen- 
tal support of manufactures is clearly implicated in 
the alleged trend toward monarchy or aristocracy in 
America. Only a few years earlier at the Constitu- 
tional Convention Hamilton had remarked that in his 
“private opinion” he considered the British govern- 
ment to be the best in the world and doubted whether 
anything short of it would secure good government 
in America (PAH IV:192; cf. IV:184, 200, 204, 207). 
Madison had listened to and recorded this daylong, 
rather brazen speech favoring a high-toned govern- 
ment for America, just as he noted Hamilton’s endorse- 
ment of the British practice of “influence” and “corrup- 
tion” in government (Koch 1966, 131—32, 175). Perhaps 
all this would have been forgotten or chalked up to 
savvy political maneuvering had not Hamilton’s public 
deeds and unguarded words later revealed otherwise. 
At the legendary dinner party hosted by Jefferson and 
attended by John Adams and Alexander Hamilton in 
April 1791, Hamilton once again demonstrated how 
audacious he could be. In response to Adams’s pedan- 
tic remarks on the near-perfection of the British consti- 
tution, which, he said, needed only to be purged of its 
corruption and equality of representation established 
in its popular branch, Hamilton’s riposte must have 
tested the bounds of his host’s civility: “Purge it of 
its corruption, and give to its popular branch equality 
of representation,” Hamilton purportedly said, “and 
it would become an impracticable government: as it 
stands at present, with all its supposed defects, it is the 
most perfect government which ever existed” (Koch 
and Peden 1972, 126). Almost certainly Jefferson’s 
good Madeira was flowing at table that spring evening, 
loosening Hamilton’s already sassy tongue, and just as 
surely Jefferson repeated Hamilton’s provocative re- 
marks to his friend Madison the next time they talked. 

Whether Hamilton actually sought to establish 
hereditary distinctions in America was not the cen- 
tral issue—though some Federalists probably did, 
and Hamilton’s financial program played into their 
schemes. Regardless, Hamilton’s program provided the 
chief impetus toward new-modeling the American gov- 
ernment on the British system. Hamilton’s measures 
were “more accommodated to the depraved exam- 
ples” of monarchy and aristocracy than to the genius 
of republicanism, and, whether intended or not, might 
well “smooth the way to hereditary government” in 
America (Madison 1962—91 [hereafter PJM] 14:274). 
In contrast to Jefferson’s accusations of monarchism 
leveled against Hamilton, Madison’s implicit attacks 
on the Treasury Secretary in the Party Press Essays 
are more circumspect; they are couched in terms of the 
impetus or tendency of Hamilton’s measures toward the 
establishment of a British-style system in the United 
States. By early 1791 Madison saw a pattern emerging 
in the administrative measures Hamilton avidly advo- 
cated, and by the end of that year, with an advance 
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copy of Hamilton’s “Report on Manufactures” in hand, 
Madison’s worst fears had been realized. Hamilton 
meant, by administrative fiat, to undermine the Con- 
stitution as ratified and adopted by the American peo- 
ple and to alter the substance, and perhaps the form, 
of American republicanism. In Madison’s perspective, 
Hamilton’s funding system, the national bank, and 
governmental support of manufacturing were linked 
together in a clever scheme that mimicked the British 
financial system and, if successful, would increase the 
powers of the national government and establish a 
powerful and influential monied class in America. The 
pages of the “Report on Manufactures” revealed to 
Madison that Hamilton intended nothing less than the 
transformation of the economic and political life of 
America. 

Madison believed that the foundation of Hamilton’s 
plan rested in measures that invested the national gov- 
ernment with “influence,” thereby enabling it to dis- 
pense money and emoluments (PJM 14:427, 371, 233). 
This was accomplished by the institution of a funding 
system, which would continue to provide the source for 
political influence as long as the debt was perpetuated. 
He suspected that Hamilton intended to fund the debt 
in perpetuity (PJM 13:106, 317; c£ 15:474, 14:208, 274— 
75). Madison regarded the establishment of a national 
bank as an unconstitutional usurpation of power by 
the national government, believing it neither necessary 
nor proper according to the Constitution, though he 
fully recognized that it was a necessary element of 
Hamilton’s scheme to establish a class of wealthy indus- 
trialists who would wield political power in America. 
Taken together, the national bank and funded public 
debt encouraged a “spirit of speculation within and 
without the government” (PJM 14:274). Hamilton’s 
system of public finance appealed to the avidity of 
public officials, tempting them to substitute the motive 
of private interest in the place of public duty (PJM 
14:233). It encouraged a servile dependency on the 
British, whose discriminatory trade policies with the 
United States amounted to the continuing treatment 
of America as a colonial territory (PJM 14:164—65; 
17:559—-60). It directed governmental measures to the 
interest of the few, providing the monied men with 
irresistible opportunities for further enrichment (PJM 
14:371). The wealth accumulated by the frenzy of spec- 
ulative activity was to be channeled into the manufac- 
turing industry, again by an unconstitutional exercise 
of power. Governmental manipulation of the choice of 
occupations via the artificial encouragement of manu- 
factures would promote the interest of this class at the 
expense of other interests in the society, particularly the 
agricultural interest. Landholders would be burdened 
with arbitrary taxes while rich merchants were granted 
new and “unnecessary opportunities” to capitalize on 
their wealth (PJM 14:197). This show of partiality to 
the wealthy few, though touted as advancing the pros- 
perity and happiness of the nation as a whole, would 
in time, Madison argued, actually give “such a turn to 
the administration, [that] the government itself may 
by degree be narrowed into fewer hands, and approxi- 
mated to an hereditary form” (PJM 14:371). Designed 
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to simulate the practices of the British system, it would 
introduce corruption and venality into government and 
encouraged self-interest as its driving force. Madison 
contemptuously described this governmental model in 
“Spirit of Governments”: 


A government operating by corrupt influence; substitut- 
ing the motive of private interest in place of public duty; 
converting its pecuniary dispensations into bounties for 
favorites, or bribes to opponents; accommodating its mea- 
sures to the avidity of a part of the nation instead of the 
benefit of the whole: in a word, enlisting an army of in- 
terested partizans, whose tongues, whose pens, whose in- 
trigues, and whose active combinations, by supplying the 
terror of the sword, may support a real domination of the 
few, under an apparent liberty of the many. (PJM 14:233) 


Despite Montesquieu’s categorization of this type of 
government as a republic, Madison argued, it is in 
reality “an imposter.” Such a government is not yet 
“on the west side of the Atlantic,” and “it will be 
both happy and honorable for the United States, if 
they never descend to mimic the costly pageantry of its 
form, nor betray themselves into the venal spirit of its 
administration” (PJM 14:233—34). 

Madison believed that Hamilton’s measures were 
intended to reproduce the equilibrium of the British 
model, if not by the creation of hereditary class dis- 
tinctions, then by a mimetic equivalent that provided 
additional checks on the demos and presumably en- 
hanced the stability of the political order (PJM 14:197— 
98). This is a perverse understanding of the republican 
solution to the problem of parties, he argued in the 
Party Press Essay “Parties.” Since parties exist natu- 
rally in all political societies, legislators and statesmen 
must find ways to alleviate their baneful effects. The 
art lies in preventing or accommodating parties to 
the extent possible and, when not possible, making 
them mutual checks upon one another. By contrast, 
the notion of promoting the creation of new parties 
or strengthening existing ones, in order to achieve ad- 
ditional mutual checks in society, to add “more scales 
and...more weights to perfect and maintain the equi- 
librium,” Madison declared, is “absurd.” Though this 
is the theory that undergirds balanced government, it 
is not the republican way. Such a political model is 
analogous to promoting vices in ethics so that they may 
be used to counteract other vices, and it “is as little the 
voice of reason, as it is that of republicanism” (PJM 
14:198). 

Madison further pursued the faulty analysis that he 
believed underlay Hamilton’s (and Adams’s) praise 
of the British model with his direct critique of it in 
the essay, “British Government” (PJM 14:201—2). The 
“boasted equilibrium” of the British government, so 
far as it is even true, is not primarily due to “the form 
in which its powers are distributed and balanced” (PJM 
14:201—2). Stability and liberty are not secured by lim- 
iting the share of the people to a third of government 
and counteracting their influence by two grand heredi- 
tary orders with conflicting and hostile feelings, habits, 
and interests (PJM 14:427), or by any simulation of the 
British model of class warfare or party contestation. 
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The stability of the British government “is maintained 
less by the distribution of its powers, than by the force 
of public opinion” (PJM 14:201; cf. 11:298). The Feder- 
alists, he believed, failed to recognize the dominant 
role played by public opinion in the British system 
and, moreover, denied public opinion its rightful place 
as sovereign in a free polity. Instead of heeding the 
authoritative voice of the public, the antirepublicans 
demanded that the people simply have confidence in 
their government and submit to its acts (PJM 14:426— 
27). By promoting a political design that would make 
the government independent of the will of the public, 
they were denying the right of a republican people to 
govern themselves. 


HAMILTON’S DEFENSE 


Hamilton’s financial system consisted of three essential 
elements. First and foremost was the need to establish 
public credit in the United States. The initial step in 
accomplishing this was the establishment of an ade- 
quate system of funding the national debt. Whereas an 
unfunded debt is the object of excessive speculation, 
drains the nation of capital, and diverts funds from 
useful and productive industry, a properly instituted 
funding system supplies active capital in a country de- 
ficient in capital. Once public securities have acquired 
an adequate and stable value and the confidence of the 
community is established, the debt may serve as an en- 
gine of credit by promoting the transfer and exchange 
of funds. With additional capital in circulation, interest 
rates decrease; the stabilization of public stock mod- 
erates the spirit of speculation and directs capital to 
more useful channels. In Hamilton’s view, the depreci- 
ated condition of landed property in America resulted 
from the scarcity of money. The increased quantity and 
circulation of capital would contribute to improve the 
state of agriculture. Further, it would unclog the wheels 
of commerce, thereby promoting commerce and man- 
ufacturing as well (PAH VI:70—72). While Hamilton 
conceded that his program benefited the monied men 
of America, he denied that it created a special monied 
interest adverse to other citizens. Rather, he argued, 
investment in public stock promotes the economic 
growth of the nation, including all the useful indus- 
tries in which the citizens are engaged. Productivity is 
increased and employment rises, further increasing the 
active and actual capital of a nation. Industry in general 
flourishes, “and herein,” Hamilton declared, “consist[s] 
the true wealth of a nation” (PAH II:618). 

The second prong of Hamilton’s financial program 
involved the establishment of a national system of 
banking that would fortify the establishment of pub- 
lic credit. The institution of a national bank was in 
his opinion more than an optional supplement to the 
funding system. Whereas banks are “useful in Coun- 
tries greatly advanced in wealth,” he argued, they are 
absolutely “necessary in Countries little advanced in 
wealth” (PAH VIII:220). The advantages derived from 
a national bank include (1) the augmentation of the ac- 
tive and productive capital of the nation; (2) a greater 
facility by the government to obtain financial support, 


especially in times of emergency; and (3) the assistance 
in the payment of taxes (PAH V1I:306). A national 
bank increases the supply of active capital by its ability 
to lend and circulate greater amounts of capital than 
the actual sum of its stock in coin. For all practical 
purposes, then, industry and trade would receive an , 
absolute increase in capital infusion, and economic en- 
terprise would be enlarged. In this way, banks are “the 
nurseries of national wealth” (PAH VI1I:306). Hamilton 
defended the constitutional authority of the national 
government to establish a national bank on the grounds 
that the right to erect corporations is inherent in the 
very definition of government. The intent of the Con- 
stitution was not to be sought in the Framers’ intent, 
Hamilton believed, but in the “instrument itself,” based 
on established rules of textual interpretation that co- 
here with the “nature and reason of the thing” (PAH 
VIl:111; Hamilton, Madison, and Jay [1788] 1999, 
78:436 [hereafter FP]). In later battles with Madison 
over constitutional interpretation, Hamilton would 
consistently employ his understanding of the nature 
of government and the practical necessities of political 
life in construing the United States Constitution. In 
a private letter to Washington, Hamilton couched his 
case for the bank in the most practical terms: “[T]he 
most incorrigible theorist among [the bank’s] oppo- 
nents would in one month[’]s experience as head of 
the Department of the Treasury be compelled to ac- 
knowle[d]ge that it is an absolutely indispensable en- 
gine in the management of the Finances, and would 
quickly become a convert to its perfect constitutional- 
ity” (PAH XJ:251). 

It was in response to the third prong of Hamilton’s 
financial scheme that Madison mounted a full-scale 
opposition against his “anti-republican” program and, 
with his political allies, adopted the appellation, the 
“republican party.” Hamilton’s “Report on Manufac- 
tures” was premised on the idea that the accelerated 
growth of manufacturing in the United States was es- 
sential to the national interest (PAH 230—340). The 
manufacturing industry, Hamilton argued, enhances 
the produce and revenue of the community, contributes 
to the diversification and division of labor, increases 
employment and productivity by engaging persons not 
ordinarily working, promotes foreign emigration, fur- 
nishes a broader scope for the differing talents and 
dispositions of persons, increases the demand for agri- 
cultural produce, and makes the United States less de- 
pendent on foreign markets. Despite the clear and cer- 
tain economic benefits that the growth of manufactures 
would produce in the nation, this does not guarantee 
that it will naturally occur, or occur as quickly as the 
country requires. Human beings are creatures of habit 
and tend to adopt untried industries reluctantly and 
slowly. “To produce the desirable changes, as early as 
may be expedient,” he wrote, “may therefore require 
the incitement and patronage of government” (PAH 
X:267). The supply of active capital needed to encour- 
age manufacturing in the new republic was already in 
place via the funded debt and the national bank. Spec- 
ulation in public stocks could thus be directed to use- 
ful purposes and away from its sometimes pernicious 


409 


Madison v. Hamilton 


effects. Although the encouragement of manufactures 
in America would be disadvantageous to the other 
classes of society and to consumers in the short term, 
Hamilton argued that the long-term, permanent effect 
would be to the benefit of all classes of society and the 
nation as a whole. 

Hamilton’s economic program was designed to sta- 
bilize the fiscal situation of the country, stimulate pro- 
ductivity, and set America on the course of prodi- 
gious material prosperity. His intent was to establish 
the economic foundation on which political stability 
and greatness depended. He had no wish, he repeat- 
edly claimed, to establish monarchy or aristocracy in 
America or to introduce hereditary distinctions of any 
kind. That he was bent on corrupting a portion of the 
legislature he pronounced false and malignant. He re- 
buffed the charge that he was attempting to overturn 
the state governments or pervert limited government; 
there is a good deal of ambiguous ground concerning 
the demarcation between the general and the state 
governments over which honest men might disagree, 
he asserted. Finally, he flatly denied that he and the 
Federalists were conspiring to overthrow republican 
government in the United States, or even that their 
measures would tend to subvert the republican form 
or prepare the way for monarchy (PAH XII:248—53; 
cf. X1I:131—33). In exasperation Hamilton could only 
ask in regard to his opponents’ accusations: When ever 
were “men more ingenious to torment themselves with 
phantoms?” (PAH XII:209). 

Hamilton’s economic blueprint for America was de- 
signed to achieve both individual security and national 
strength. His conception of the connection between 
political stability and economic prosperity was pre- 
sented most explicitly in his daylong speech of 18 June 
at the Constitutional Convention. In societies where 
industry is encouraged, Hamilton argued, individual 
security is often threatened by the clash of the distinct 
and rival interests between the few and the many, i.e., 
between the wealthy, well-born, educated citizens and 
the mass of the people. If either one group has all the 
power, it will oppress the other. “Both therefore ought 
to have power that each may defend itself agst. the 
other” (PAH IV:192). Moreover, given the “violence 
& turbulence” of the democratic spirit, it is particularly 
crucial to establish a separate and permanent body to 
check the unsteadiness and imprudence of the mass of 
the people (PAH IV:185, 193, 200, 204). The principle 
of representation is not sufficient to resist “the popular 
current,” for the most popular branch of the legislature 
will predominate, and within it a few individuals tend to 
prevail (PAH IV:185). Dependent on the favor of the 
people for the continuation of their position and power, 
these leaders often sacrifice the permanent interest of 
the nation to the passionate and partial interests of the 
many. 

The problem of the force of majority faction is there- 
fore not solved by the representative principle. Nor 
is the difficulty overcome by the establishment of a 
government over a large extent of territory. Although 
representatives chosen from larger districts may be of 
some benefit, frequently a small portion of a large dis- 
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trict carries an election (PAH JV:166). The represen- 
tatives of an extensive nation still meet in one room 
and are liable to the same influences of those in a small 
country, including the charm of a powerful demagogue. 
The determinant influence of the size of a nation to 
deter the formation of majority faction is, Hamilton 
claimed, of doubtful veracity. Combinations on the ba- 
sis of interest will not be as difficult or unlikely as some 
may suppose. Geographical and economic factors can 
and will influence the people and their representatives, 
and “it is easy tsconceive a popular sentiment per- 
vading” one portion, even a major portion, of the leg- 
islature (PAH IV:165). In essence, Madison’s analysis 
of the problem of majority faction and his proffered 
solution of the extended republic and representation, 
which he presented on June 6 on the Convention floor 
and later summarized in the tenth Federalist, was inad- 
equate to the task of remedying the defects of popular 
government. In Hamilton’s view, Madison’s proffered 
solution was not a well-considered solution to the prob- 
lem at all. 

Hamilton contended that the problem of majority 
tyranny necessitates the establishment of a “perma- 
nent barrier” in government that would counteract the 
passionate demands of the many, particularly their cov- 
etousness toward the property of others (PAH IV:192). 
The British provided for this barrier in their House 
of Lords. Hamilton believed that an equally effec- 
tual check on the turbulent and changing multitude 
was needed in America. Accordingly, he proposed a 
Senate for life or during good behavior, arguing that 
the seven-year Senate term supported by some dele- 
gates, including James Madison, was not sufficient to 
answer the purpose sought.‘ But just as there ought 
not be too much dependence on the popular senti- 
ments, neither ought there be too little (PAH IV:214). 
Hamilton recommended a House of Representatives 
of enlarged numbers, elected directly by the people 
every three years. The two branches of the legislature 
would balance each other in terms of the many versus 
the few, turbulence versus inertia, and protection of 
equal rights versus security for property rights. One 
chamber manifests the “sensibility” of the populace; 
the other, “knowledge and firmness” in public affairs 
(PAH V:81). It is a kind of balance and “happiest mode 
of conciliating” contraries, anticipating Jane Austen’s 
felicitous equipoise of Sense and Sensibility. 

The two-weighted scale protects the few and the 
many from oppression by each other, thereby con- 
tributing to the security of individual rights. Hamilton 
advocated adding a third weight to the scale in the 
form of a single elected executive serving for life or 
good behavior. The executive would possess an abso- 
lute negative on legislation and, in turn, would himself 
be subject to counterbalancing checks by the legis- 
lature. Accordingly, the executive would provide an 
additional check against the passage of laws based on 
partial interest. In positive terms, Hamilton’s executive 
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was to serve as the dominant active agency in govern- 
ment. Characterized by unity, duration, and energy, his 
ambitions would be virtually one with the interests of 
the nation. He would move government to act with 
vigor, dispatch, and regularity, providing a sense of na- 
tional character, strength, and permanency of will. An 
independent judiciary supplements the checks against 
the legislature and its natural tendency to dominate in 
popular governments. This check on legislative power 
would further increase the proportionate authority of 
the republican executive. 

Hamilton’s central objective in his June 18 speech 
was to demonstrate the need for a “permanent will” 
in the government (PAH JV:186). His plan was partly 
modeled on the British constitution, particularly with 
regard to a balance between the two great and distinct 
interests in society and an energetic executive who em- 
bodies the interest of the nation as a whole. However, 
unlike the British model, Hamilton claimed that his 
plan was fully consistent with the principles of repub- 
licanism: In it “the Executive and Legislative organs 
are appointed by a popular Election, and hold their of- 
fices upon a responsible and defeasible tenure” (PAH 
XXV:537). Granted, subsequent to (indirect) election 
by the people, the Senate and Executive would be as 
far removed from popular will as republican principles 
would allow. A democratic assembly simply cannot be 
properly checked by a democratic senate, and both 
of these by a democratic executive, Hamilton argued 
(PAH XXV:537). Gouverneur Morris described the 
problem in earthier tones: 


[T]he members of both Houses are creatures which, 
though differently born, are begotten in the same way 
and by the same sire. ... The President can...do what he 
pleases, provided it shall always please him to place those 
who lead a majority of the Representatives. (Flaumenhaft 
1992, 186) 


Hamilton urged his colleagues to see that the only ef- 
fectual method to secure the ends of republican gov- 
ernment was to overcome the contest between the few 
and the many. Like a host of renowned thinkers be- 
fore him, Hamilton saw in the British constitution a 
model that effectually neutralized this struggle at the 
governmental level. He borrowed from the vaunted 
British model the idea of achieving an equilibrium of 
the predominant and rival passions and interests within 
the legislature, albeit without deriving the competing 
humors from an hereditary 

The key to the success of the British political system 
was the creation of institutions and practices that neu- 
tralized the destabilizing effects of the rival passions 
in society and at the same time utilized those passions 
to energize and bolster the government. Hamilton be- 
lieved that if the American republic was to succeed, 
it too must incorporate a political scheme that chan- 
nels men’s selfish passions and interests and utilizes 
them to support the government (PAH V:85). Besides 
force, Hamilton listed four other factors that prompt 
men to the support of government, viz., interest, opin- 
ion, habit, and influence (PAH IV:180). Of these, self- 
interest is “the most powerful incentive of human ac- 
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tion,” he argued, explicitly following Hume in his as- 
sessment of human nature (PAH 1:92). No regime de- 
rives benefit from neglecting to utilize this dominant 
force in man, Hamilton declared in 1775. He restated 
this idea at the Constitutional Convention: The key 
to constructing a stable and good government is to 
interest the passions of men and make them serve the 
public (PAH IV:187, 217). 

The conjunction between Hamilton’s economic and 
political philosophy occurs at two principal axes. First, 
Hamilton believed that economic diversification is nec- 
essary to the security of individual rights. Second, he 
held that economic prosperity leads to confidence in 
government, thereby providing the foundation for pub- 
lic strength. The diversification of occupations through- 
out the union, he predicted, would contribute signif- 
icantly to overcoming the rivalry between northern 
and southern interests, i.e., between industry and agri- 
culture, between free and slave-holding states (PAH 
X:293; see also Brookhiser 1999, 97). Economic diver- 
sification would help to control the problem of major- 
ity faction by diminishing the most powerful engine 
of faction in America—interests grounded in geo- 
graphic/occupational distinctions, Moreover, increased 
diversification would lead to a preponderance of mem- 
bers of the learned professions—especially the legal 
profession—in Congress. Unlike men of industry and 
agriculture, men of the professional ranks “form no 
distinct interest in society” and are likely to be impar- 
tial arbiters between the others (FP 35:183; see also 
Allen 2000, 167—74). Economic diversification also fu- 
els prosperity—and vice versa. Economic prosperity 
instills in the people an opinion of the benefit of gov- 
ernment to their own well-being and inspires in them a 
confidence in its measures. Public confidence in govern- 
ment stabilizes the regime and endows it with public 
strength. This is particularly true in republican gov- 
ernment, which, even more than other political forms, 
depends on opinion (PAH V:37). 

In 1787 the United States was predominantly an 
agricultural nation. To achieve Hamilton’s goals of 
economic diversification and prosperity meant that 
America must become a commercial republic. This 
transformation depended on the institution of his 
three-pronged fiscal program, beginning with the esta- 
blishment of public credit and a national bank and 
culminating in governmental support of manufactures. 
Accordingly, Hamilton sought to connect the interests 
of the monied men to the interests of the nation— 
an idea he never dispensed with. The first wave of 
his economic program depended on this connection. It 
would stabilize public credit, wean men from state at- 
tachments to support of the national government, and 


5 McDonald (1985, 137) claims that in his maturity, Hamilton re- 
jected the idea of tying the interests of the wealthy to the interest 
of government, pointing particularly to his 1795 “The Defence of 
the Funding System” (see PAH XIX:40—41; cf. 11:248) Hamilton’s 
argument in “The Defence,” however, more nuanced. Although 
Hamilton claims that the bonding of the interests of the monied men 
to the national interest was not his primary aim in his plan to fund 
the debt, indeed that it was the consideration upon which he relied 
the least, it was nonetheless included in his calculation. 
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provide the avenue for economic prosperity and the 
train of events that would usher in a new economic and 
political era in America. Like Montesquieu, Hamilton 
believed that in a republic, where all the passions are 
free and unmodified, it is natural that the passion 
for material aggrandizement dominates men’s souls. 
A commercial republic allows the passionate pursuit 
of economic gain and rewards it with success. Com- 
mercial prosperity multiplies “the means of gratifi- 
cation,” promotes the circulation of charming, shiny 
metals—“those darling objects of human avarice and 
enterprise”—and increases prosperity throughout the 
society (FP 12:59). The multiplication of the means of 
gratifying the acquisitive desire is much more the result 
of commercial prosperity than the mere size of the ter- 
ritory. By interesting the monied men in the prosperity 
of the nation, Hamilton sought to start a chain reaction 
that would promote the commercialization of the entire 
nation. The consequences of this economic metastasis 
were far reaching on the political front. By multiplying 
and diversifying occupations and interests in America 
the age-old battle between the haves and the have-nots 
would be replaced by a new and much less dangerous 
rivalry in society. The likelihood of a majority faction 
forming would be greatly reduced and the stability of 
the political order would be significantly enhanced. 
At the start of the second Washington administra- 
tion and the outbreak of war between France and 
Great Britain, Hamilton feared that the Republicans’ 
emotional attachment to the new French republic and 
animus against England could result in an American 
foreign policy that would destroy his entire financial 
program. Hamilton took measures to prevent this from 
happening. He was a major force behind Washington’s 
issuance of the 1793 Neutrality Proclamation, he de- 
fended the proclamation in a series of “Pacificus” 
essays, and in 1795—96 he published numerous pieces 
defending the Jay Treaty, which clarified and ensured 
continued commercial relations with Great Britain. 
Going head to head with Madison in the paper wars, 
he argued for a construction of the Constitution that 
recognized the conduct of foreign policy as essen- 
tially executive in nature; he allowed for the consti- 
tutional role of the Senate in making treaties and of 
the Congress in its power to declare war. He would 
not agree with Madison, however, that the constitu- 
tional powers granted to Congress delimit the consti- 
tutional and practical duties of the executive to con- 
duct foreign policy. Once again, in Hamilton’s mind 
the proper construction of the Constitution intersected 
with political and economic realities. As Stanley Elkins 
and Eric McKitrick (1993) have argued, for Hamilton 
the continuance of stable political relations and a dy- 
namic commerce with Great Britain were critical to 
America’s future (123-31). Hamilton considered 
America’s trade with Great Britain “absolutely essen- 
tial” to the success of his economic plan (Rakove 1990, 
118). Great Britain provided a major market for Amer- 
ican agricultural produce, and approximately three- 
fourths of U.S. imports came from Britain. American 
prosperity—and the civic confidence it inspired in gov- 
ernment—depended heavily on the revenues brought 
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into the United States Treasury from impost duties on 
British goods. If American dependence on commerce 
with England were to lessen with the rise of a diver- 
sified domestic economy, this would only occur over a 
period of time. Until then, a significant decline or loss 
of British trade would ruin the United States economy, 
destroy public credit, and shake the political founda- 
tions of the fledgling nation. The policy of commercial 
discrimination against the British—which Madison had 
been pushing for in Congress since 1789—would result 
in British retaliation against the United States and be 
devastating to the new nation. In a word, it would mean 
an end to the Hamiltonian dream of commercial great- 
ness for America. 

A commercial republic possesses the advantage over 
other forms of government, Hamilton believed, be- 
cause it tends “to interest the passions of the commu- 
nity in its favor [and] beget[s] public spirit and public 
confidence” (PAH IV:163). Hamilton viewed human 
nature as consisting of two very different types of men: 
the mass of men who are motivated largely by self- 
interest, and an exclusive class of men whose souls 
are dominated by the desire for distinction. Hamilton 
accepted the generality of human nature as it was and 
did not attempt to transform it into something it could 
not become. He relied on the average republican cit- 
izen to pursue his own economic advantage, neither 
expecting nor encouraging him to develop a public 
spiritedness unconnected with his perception of self- 
interest. The vast majority of citizens were not called 
to participate actively in the affairs of government, the 
extent of their peacetime responsibilities essentially 
limited to electing the better sort of men to political 
office and supporting the government they had chosen 
(PAH 111:102—3, 54445; cf. Flaumenhaft 1992, 15—16, 
216). Their attachment to the new American repub- 
lic, Hamilton believed, would result largely from their 
opinion of its necessity and utility. 

A train of prosperous events, brought about by a 
wise and energetic administration, would result in an 
attachment of the people to their government and in- 
still in them a confidence in its measures (PAH V:39— 
40). Indeed, “the confidence of the people will be eas- 
ily gained by a good administration,” Hamilton con- 
tended (PAH V:39). “Confidence” results largely from 
the gratification of men’s acquisitive desires, produc- 
ing habits of obligation and obedience to government. 
Since all governments, particularly free republics, are 
dependent on public opinion, the wise republican 
statesman will cultivate an opinion of confidence by 
promoting measures that gratify the average citizens’ 
passion for material gain, thereby increasing the stabil- 
ity and strength of the nation. In turn, the statesman 
himself is rewarded by the favor of public opinion, 
i.e., by the confidence and esteem of his fellow citizens, 
thereby gratifying his distinctive desire for fame. In this 
way the most powerful passions of the many and the 
ruling passion of the noblest minds are directed toward 
the support of government. 

Hamilton learned from Jacques Necker the im- 
portance of directing public opinion to the support 
of government by means of publicity, particularly 
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publicity in the area of national finance. Necker’s the- 
ory emphasized the influence of public ministers on 
public opinion to produce unity, confidence, and obe- 
dience to the government. “A skilful administration,” 
he wrote, “has the effect of putting in action those it 
persuades, of strengthening the moral ideas, of rousing 
the imagination and of joining together the opinions 
and sentiments of men by the confidence it inspires” 
(Necker [1784] 1785, L-xii). “Confidence” is “that pre- 
cious sentiment which unites the future to the present” 
and “lays the surest foundation of the happiness of 
the people” (Necker [1784] 1785, Lx). Hamilton took 
Necker’s advice and wrote prolifically for the public 
press in an effort to influence public opinion and inspire 
a spirit of confidence in the government and obedience 
to its measures. Although Hamilton believed that the 
citizens generally possess the ability to perceive their 
interests with sufficient clarity, he also recognized that 
they are sometimes misled by opinions built on false ap- 
pearances of the advantageous (Stourzh 1970, 92—93). 
During the 1790s his earlier sanguinity about the effects 
easily gained by a good administration was dashed by 
the successes of opponents who misjudged and misled 
the common man. 


HAMILTON’S OFFENSE 


Hamilton believed that systematic opposition to his 
economic measures was instigated by naive projectors 
and ambitious demagogues. Aaron Burr was clearly 
of the latter description (PAH XXV:321). Jefferson, 
Hamilton thought, had some of the demagogue in him, 
but was fundamentally a man whom nature had ill en- 
dowed with a “sublimated paradoxical imagination” 
(PAH XI:544). Having drunk too much of French 
philosophy, his “mind [was] prone to projects... 
incompatible with the principles of stable and system- 
atic government” (PAH XII:581, XI:439). Madison’s 
character was more subtle, complex and difficult to 
discern. In 1792 Hamilton accused him of changing 
his mind concerning the public debt. He was not en- 
tirely sure, however, about Madison’s motivations for 
the switch. Were personal animosity and the desire for 
political advantage the cause of Madison’s newfound 
opposition? Or had Madison fallen under the influence 
of Jefferson and undergone a sincere change of mind? 

In later years, Hamilton undoubtedly concluded that 
Madison was sincere in his attachment to the principles 
he espoused in the 1790s. He saw that Jefferson was not 
alone in his “vain reveries of a false and new fangled 
philosophy” and attachment to a “wild and fatal” po- 
litical scheme that would destroy sound government in 
America (PAH X11:249, XXVI1:740). Like the French 
writers from whose well of speculative philosophy they 
were imbibing, the Republicans were bent on a fanati- 
cism in political science that miscalculated the force of 
the human passions and was “unsuited to the nature of 
man” (PAH XXVI:739). They were simply “too much 
in earnest” about “democracy” (PAH XXV-319). Pros- 
trating themselves before the opinion of the majority, 
as if vox populi were vox dei, they encouraged a spirit of 


anarchy and flirted with tyranny, its natural ally. They 
stimulated the restless passions of the people and ex- 
cited a reckless censure, destroying public confidence 
in the government and its leaders (PAH XIII:394—95). 
Following in the path of their Jacobin cohorts, the Re- 
publicans worshiped at the altar of the “Goddess of 
Reason,” rejecting the “mild reign of rational liberty, 
which rests on the basis of an efficient and well palanced 
government” (PAH XXV:353, 370). 

Men are for the most part ruled by their EN 
Hamilton believed, and rather more “reasoning tha[n] 
reasonable animals” (PAH XXV:605). Yet his oppo- 
nents were intent on molding “a wise, reflecting and 
dispassionate people” (PAH XXV:605). They eulo- 
gized reason, but in reality they courted men’s vanities 
and cheated the people out of their confidence. Left 
unchecked, the Republican brand of politics would 
succeed in “corrupting public opinion till it becomes fit 
for nothing but mischief” (PAH XXV:605—6). More- 
over, they claimed for public opinion an unwarranted 
status and invoked its authority to circumvent the pre- 
scribed constitutional amendment process—the only 
legitimate channel of appeal to the people in their 
collective capacity (PAH XXV:606). The Republican 
politics of public opinion threatened to undermine all 
the hard work done by the men at Philadelphia in 1787, 
and the source of their new creed was none other than 
the fanatics of the French Enlightenment. Hamilton 
named names: 


In vain was the collected wisdom of America convened 
at Philadelphia In vain were the anxious labours of a 
Washington bestowed. Their works are regarded as noth- 
ing better than empty bubbles destined to be blown away 
by the mere breath of a disciple. of Turgot, a pupil of Con- 
dorcet. (PAH XXV-501) 


Whatever diminution of respect Hamilton had felt in 
the early 1790s for the force of Madison’s mind and 
soundness of his judgment, a decade later his opinion 
of the Virginian’s political sagacity sunk lower still. 
From Hamilton’s perspective the loss of Madison as a 
political and philosophic ally must have been a genuine 
disappointment. This was the mind that had conspired 
with him at the Convention, penned with him The 
Federalist, and seemed to understand, if not fully, at 
least better than most of his colleagues, the age-old 
dilemma of the few versus the many and the republican 
road that could overcome it. 


MADISON’S DEFENSE 


Madison believed that, if successful, the Treasury Sec- 
retary’s financial plan to perpetuate the national debt, 
establish a national bank, and enact a policy of govern- 
mental favors for select interests would subvert pop- 
ular government in America. Hamilton’s part in the 
President’s Neutrality Proclamation and the Jay Treaty 
during Washington’s second term was part and parcel 
of the same antirepublican agenda. Madison’s assaults 
on Hamilton’s program were not merely measures of 
resistance to the Federalist agenda, however; his aim 
was also to pave an alternate economic and political 
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route that accorded with the principles of republican- 
ism, as he understood them. He attempted to prevent 
measures he believed were contrary to the sovereign 
authority of public opinion as expressed in the Con- 
stitution and to establish and secure a political system 
conducive to the ongoing formation of public opinion, 
on which government remains dependent in its ordi- 
nary operations. If Hamilton believed the attainment 
of American glory to be contingent on increasing the 
power of the national government and tying the inter- 
ests of the monied class to it, thereby achieving eco- 
nomic prosperity and political stability and strength in 
one fell swoop, Madison believed the glory of America 
to consist in her discovery of the way to educate public 
opinion in a republic so that power and right would 
be on the same side. This fundamental challenge was 
one Madison wrestled with in the 1780s and contin- 
ued to think through in the 1790s. He was convinced 
that the republican solution depended on modifying 
the sovereignty (PJM 9:357, 10:214). 

In the early 1790s Madison argued that the insti- 
tution of a national bank was contrary to the Con- 
stitution, as understood by the people who ratified 
and adopted it (PJM 13:372—87, 395—96; Farrand 1966, 
3:533—34). He viewed Hamilton’s proposal to establish 
the bank as the attempt to use unconstitutional means 
to accomplish legitimate ends. Hamilton’s “Report on 
Manufactures” went even further: it proposed the na- 
tional exercise of power to achieve ends not mandated 
by the Constitution (PJM 14:180, 193). Madison viewed 
the Constitution of the United States as the embodi- 
ment of the highest expression of the opinion of the 
public. No opinion in the regime, however widespread 
and popular, is superior to the voice of the people ex- 
pressed in its most sovereign capacity in this document. 
Only the extraconstitutional invocation of the right of 
revolution can claim moral superiority. The idea of con- 
stitutionalism is derivative of the principle of popular 
sovereignty, which forms the democratic basis for the 
doctrine of originalism. No one took this doctrine more 
seriously than Madison. He viewed Hamilton’s broad 
construction of the Constitution as more than a point 
of legal debate. It struck at the very philosophical basis 
of republican government. The idea of consent of the 
governed means that something was consented to— 
understood and agreed to—by the people in their most 
sovereign capacity. The people are “the only earthly 
source of authority,” Madison wrote. The charters au- 
thenticated by their seal in the solemn act of founding 
constitute the most sacred of trusts. Constitutions are, 
in essence, the holy writs of this world, the “politi- 
cal scriptures” of faithful citizens. “They are bound on 
the conscience by the religious sanction of an oath..., 
[transcending] all other landmarks, because every pub- 
lic usurpation is an encroachment on the private right, 
not of one, but of all” (PJM 14:191). The American 
founding represents a charter of power granted to the 
government by a free people. It was a revolution in 
the annals of human history without parallel, as mo- 
mentous a part of the American Revolution as the 
shots fired at Lexington and Concord. Probably even 
more so. Hamilton’s interpretation of the Constitution 
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effectively removed the limitations on the power of 
government placed there by the sovereign authority of 
the people, undermining the core principles of repub- 
lican government. 

It has been argued that Madison’s altered position 
on the issue of the national bank during his presidency 
represents an abandonment of the doctrine of con- 
stitutional superiority, that in this instance Madison 
trumped the authority of the Constitution with the 
power of ordinary public opinion and legislative prece- 
dent (Meyers [1973] 1981, 389-90; Rosen 1999, 140). I 
believe this is an erroneous reading of the explanation 
Madison provides for his change of view. Madison was 
not arguing that ordinary public opinion—even when 
settled over a course of many years and precedent 
established—is ever superior to the Constitution. His 
argument was that for over 20 years public opinion 
had acquiesced in the decision to establish a national 
bank, demonstrating that the generation who ratified 
the Constitution were in fact not adverse to it and did 
not understand it to be contrary to the Constitution— 
even if Madison, in “his solitary opinion,” had (Meyers 
[1973] 1981, 390-93). Accordingly, the bank was not 
nor ever had been unconstitutional. Madison is not here 
confessing to any weakening of his dedication to the 
Constitution as the supreme authority in all cases, nor is 
he admitting to any inconsistency of principle. Rather, 
he is conceding that he had misread public opinion on 
the issue in the early 1790s. The establishment of a 
national bank was not, as he had earlier thought, con- 
trary to the Constitution, as understood by the public 
who ratified it. Thus, as president, he could respect leg- 
islative precedent because the institution of the bank 
was not an unconstitutional exercise of power, but only 
an ordinary, legitimate legislative act. His action as 
president did not represent an exception to the idea 
of the fundamental authority of the Constitution, and 
indeed he was, without fail, committed to the doctrine 
of constitutionalism throughout his life. “A Constitu- 
tion being derived from a superior authority,” he said in 
1831, “is to be expounded and obeyed, not controlled or 
varied, by the subordinate authority of a Legislature” 
(Meyers [1973] 1981, 391, emphasis added). 

Given Madison’s commitment to the doctrine of con- 
stitutionalism, the representatives of the people are 
bound by oath and sacred trust to abide by the provi- 
sions and principles of the Constitution even when a 
majority of citizens demand measures to the contrary. 
Nonetheless, Madison’s theory of republicanism was 
no more an elite theory of statesmanship devised to 
circumvent majoritarian politics than it was a theory 
constructed to stymie democracy or substitute plural- 
ism in place of justice as the end of government (see 
Dahl 1956, 1—33; Diamond 1977; Sharp 1993, 2; Wills 
1981, 179—264; Wood 1987, 91—93). He had as little 
confidence that enlightened statesmen would always 
be at the helm as he had that a simple or aggregate ma- 
jority of the community would always and only demand 
those things consistent with natural and political right. 
Majority faction is the greatest threat and requires the 
most intense theoretical scrutiny in all polities in which 
majority opinion actually does reign supreme. In the 
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1780s he focused his mental energies more on solving 
the problem of majority faction than minority faction 
because he was committed to the principle of majority 
rule and he envisioned the majority as ultimately de- 
termining the law in America. Madison did not change 
his mind about this in the 1790s. In the battle with 
Hamilton and the Federalists he fought against 
schemes that would undermine the formation and force 
of the public voice and substitute an independent gov- 
ernmental will. And he fought to establish in practice 
what he had conceived at his writing desk. I doubt that 
he was as surprised about the political realities of the 
new administration with men such as Hamilton and 
Adams in power as is often thought. He knew a fair 
amount about their views, though he did not know for 
certain how they would play out their ideas vis-à-vis 
the decisions that had been made at Philadelphia and 
endorsed by the people. Once he saw that Federalist 
policies ignored the authority of the citizen-founders 
and threatened to sever the government from the peo- 
ple, he reacted to their “anti-republican” agenda. At 
the same time, he remained proactively committed 
to achieving and vindicating majority rule, the archi- 
tectonic challenge that marked his long career as an 
American Founder. 

The rift between Madison and Hamilton in the 
Washington administration was not caused by inconsis- 
tency on Madison’s part, nor was Jefferson responsible 
for their political estrangement. Madison did not, as 
Elkins and McKitrick (1993, 266) and Rakove (1990, 
100) have contended, seek to insulate national pol- 
itics from public opinion in the 1780s and then de- 
velop “a new feeling for the legitimacy of majorities” 
and embark on a “new course of theorizing” in the 
1790s. Rather, in the 1790s he engaged in a further 
and more “thorough investigation” of the remedy for 
majority faction in order to justify popular government 
and the authority of public opinion (see PJM 14:159; 
cÊ 10:212). The philosophical divergence between 
Madison and Hamilton did not originate in the 1790s, 
though their prior differences were clearly exacerbated 
by political events in the formative years under the 
new Constitution. Certainly, the decisions made in 
Philadelphia in 1787 and ratified by the people influ- 
enced Madison’s understanding of the American polit- 
ical system (Banning 1995, 171, 191), but this is fully in 
accord with his unerring commitment to the idea of the 
Constitution as the encapsulation of the most sovereign 
voice of the people. The accusation of inconsistency 
would in fact be warranted if he had taken the reverse 
tack, that is, if he had not heeded the authoritative 
intent of the people, who alone infused the Constitu- 
tion with life and validity. From Madison’s perspective, 
Hamilton’s lack of respect for the authoritative opinion 
that informs the Constitution, and his determination to 
substitute his own economic and political vision despite 
the decree of the sovereign public, was the crux of their 
political division. 

Throughout his life, Madison’s practical efforts were 
based on two equally important theoretical maxims: 
the majority must ultimately rule, and it must have 
right on its side. In republican government, Madison 
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wrote in preparatory study for the Constitutional Con- 
vention, “the majority however composed, ultimately 
give the law” (PJM 10:355). The problem of course is 
that the majority may have power but not right on its 
side. Madison insisted on both. Majority rule is a nec- 
essary but not sufficient condition of free government. 
Its legitimacy depends on the respect and protection 
the majority accords to the rights of the minority. In 
the manifestation of their freedom the citizens have a 
moral obligation to extend “that debt of protection” 
they mutually owe each other in the exercise of natural 
and positive rights, and for which they as a “public” 
pledged their “faith ... by the very nature and original 
conditions of the social pact” (PJM 14:267). A govern- 
ment independent of the will of the society is unre- 
publican and illegitimate, but so too is a government 
that has force, but not right, on its side (FP 51:292—93; 
cf. PJM 9:350, 355). 

The will of the society is manifested in govern- 
ment through the constitutionally prescribed 
which give to the legislature preeminence in public 
policymaking. A frequently elected legislature is more 
closely aligned with the will of the people than are the 
other branches of government. In a large republic it 
is less likely to be the pawn of majority faction than 
in a small one. Nevertheless, Madison recognized the 
problem that worried Hamilton: Whatever the size of 
the nation, assemblies are to some degree susceptible 
to the influence of demagoguery and the heat of capital 
politics (PJM 9:354, 14:165—66, 13:93—-94; FP 58:328— 
29). Madison did think that the clash of arguments in 
public bodies can contribute substantially to the de- 
liberative process, but he also acknowledged that the 
advantages are often outweighed by false reasoning 
and the easy contagion of opinion and passion in a body 
that meets under one roof in a politically charged city. 
The problem of securing “the benefits of free consul- 
tation and discussion” is especially great in numerous 
assemblies, where proceedings are often marked by 
the confusion and immoderation that generally accom- 
pany mass gatherings (FP 55:310). To counteract these 
dangers Madison endorsed the auxiliary precaution of 
separation of powers, including the division of the leg- 
islature into two houses and the attendant devices of 
checks and balances. However, the “primary control” 
on the government, he declared in Federalist 51, is “a 
dependence on the people” (FP 51:290). 

Publius’ explicit declaration of a reliance on the peo- 
ple as the chief control on government deserves greater 
attention by scholars. As Madison would later argue in 
the Party Press Essay “British Government,” separa- 
tion of powers and checks and balances are important 
prudential devices to control the will of the govern- 
ment and protect liberty, but they are auxiliary to a 
primary dependence on public opinion (PJM 14:201; 
cf. 14:218). This is true both empirically and norma- 
tively. Public opinion is more powerful than parch- 
ment barriers and institutional arrangements. Madison 
believed that public opinion is also the fundamental 
authority in republican government. In the conclusion 
of the fifty-first Federalist he restates his case for a de- 
pendence on the people in even broader terms than he 


415 


Madison v. Hamilton 


August 2004 





initially had in the essay: The will of the government 
must be dependent on the will of the society. Accord- 
ingly, the public is not only the primary guardian whose 
watchfulness keeps government within its prescribed 
boundaries, but also the active agency upon which the 
movement of government depends. When the asser- 
tions in Federalist 5] are attended to in the context 
of the two preceding Federalist Papers, a nascent idea 
beats in the ear of Publius’ audience. It is reason, not 
passion, which ought to prevail over legislative deci- 
sions. Specifically, it is the reason of the public that 
ought to control the government (FP 49:285, 50:287). 
In the Party Press Essay “Spirit of Governments,” 
Madison pounded the republican drum to a rolling 
cadence, Contrasted with the imposter republican gov- 
ernment advanced by some, which is actuated by pri- 
vate interest and avidity and pretends to operate by 
the liberty of the many, but in fact is supported by 
the domination of the few, Madison set forth the true 
republican model: 


A government, deriving its energy from the will of the 
society, and operating by the reason of its measures, on 
the understanding and interest of the society. Such is the 
government for which philosophy has been searching, and 
humanity sighing, from the most remote ages. Such are the 
republican governments which it is the glory of America 
to have invented, and her unrivalled happiness to possess. 
(PIM 14:234) 


In the same vein of thought that runs through Federalist 
49, 50, and 51, “Spirit of Governments” reinforces and 
intensifies the claims of Publius. The spirit of republi- 
canism, Madison emphatically pronounced in the Party 
Press Essays, requires that the will of the government 
be dependent on, “or rather the same with,” the will 
of the society, and the will of the society be subject to 
“the reason of the society” (PJM 14:207). The process 
of subjecting the public will to the precepts of reason 
directs popular government toward the ends of justice 
and the general good. In turn, the resulting laws inform 
the citizens’ understanding and influence their percep- 
tion of the public interest. This has been the ambitious 
quest of philosophy and the ardent longing of human- 
ity for time immemorial, Madison declared. America 
has answered humanity’s call, and upon her soil the 
greatest of political aspirations are to be realized. 

In The Federalist Madison argued for a political sys- 
tem that regulates the interests and passions within 
society, and that itself is dependent on the will and rea- 
son of the public. As he continued to think through and 
hone his theory of public reason in the ensuing years, 
he gave it added emphasis and clarity. In the 1780s and 
into the 1790s Madison avidly read French texts on the 
subject of public opinion. French interest in the subject 
had emerged about 1770 and captured the minds of 
the French intelligentsia in the 1780s° Due in large 
measure to Jefferson’s generous shipment of crates 


6 These authors included Raynal, La Bruyère, Necker, Turgot, 
DuPont de Nemours, Le Trosne, Louis-Sébastien Mercier, Le 
Mercier de La Riviére, comte de Mirabeau, Brissot de Warville, 
Condorcet, Barthélemy, and Peuchet. Madison had in his possession 
in the 1780s works by all of these writers, and in fact he packed 
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of books from Paris to Montpelier during his tenure 
as ambassador to the French court, Madison studied 
a host of French authors on the subject of l’opinion 
publique and agreed with them about the undesirability 
of the British model of corporate political conflict so 
admired by Montesquieu (and Hamilton and Adams). 
Though unlike most of the French theorists he did not 
reject the doctrine of separation of powers and checks 
and balances, he did agree with them that the key to 
achieving political stability and individual liberty is not 
by a system that pits the interests of the few against 
those of the many, but by the force of an enlightened 
public opinion that results from a continuing process 
of communication among men in society. 

Madison was in general agreement with the French 
writers who moved substantially beyond Necker’s con- 
ception of public opinion as confidence in government 
and envisioned a more energetic role for the pub- 
lic in the political life of a nation. Theorists such as 
Turgot, Condorcet, and Peuchet emphasized both the 
influence of the enlightened men on public opinion 
and the directive influence of public opinion on gov- 
ernment, conceiving of public opinion as being both 
acted upon and itself an active agent. In a complex and 
layered process of civic participation and communica- 
tion, the diverse views of citizens are modified to form 
a united public reason.’ The enlightened members of 
society bear an important responsibility to shape the 
public views, but it is equally important that the public 
be enlightened, active, and united. When their opinion 
is fixed and their voice united, it directs the decisions 
of government. Madison subscribed to this idea of an 
activated public whose opinion carries political force. 
Contrary to those who would devise schemes that de- 
tach the government from the people, warn it to be 
vigilant against the centrifugal tendency of the people, 
and ask of the people only that they obey their wise 
and enlightened rulers, Madison argued that the goal 
is to awaken and enlighten the people, warn them to be 
united and vigilant, and to obey the government that is 
of their own making (PJM 14:426—27). 

Hamilton’s allegations of ties between American Re- 
publican theory and French Enlightenment thought 
were not unfounded. Though no mere follower or 
devotee of French theory, Madison was influenced by 
the works he read and even adopted key language 
from them. For example, Condorcet argued that over 
time public opinion derives force from the effect of 
“fixed principles” and unites society under “an empire 
of reason” (Baker 1976, 58). Peuchet said that in the 


and shipped works by most of these authors to his residence in the 
temporary capital city of Philadelphia in the summer of 1790 (PJM 
13:286—-89) For a more extensive treatment of the French theones 
of pubhc opinion, see Sheehan 2002. 

7 Madison believed that, when properly formed, pubhc opmion is 
tantamount to the reason of the public, but he chsagreed with some 
French authors about what constitutes public reason and how it 1s 
achieved, e.g., the physiocrats’ and Condorcet’s reliance on évidence 
and ma i calculations to produce public reason. See 
Sheehan 2002, 939—40, 954-55 In contrast, McLean (2003) and 
Schofield (2003) see Madison as accepting Condorcet’s early form of 
rational choice theory. 
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modern world Christian morality has united men as 
brothers; scientific discoveries have led to an increase in 
communication and the circulation of knowledge 
among men and “extended the sovereign empire of rea- 
son” (Peuchet 1789, viii). The optimism of the French 
and their wont for the felicitous expression captured 
Madison’s ear and imagination. “Let it be the patriotic 
study of all,” he declared, “to erect over the whole 
[society], one paramount Empire of reason, benevo- 
lence and brotherly affection” (PJM 14:139, emphasis 
added). 

“The great desideratum” in government, Madison 
wrote in preparation for the Constitutional Conven- 
tion, is the establishment of a “disinterested & dis- 
passionate umpire” which renders impartial judgments 
between the different passions and interests of the so- 
ciety (PJM 9:384; cf. 9:357, 10:214). The achievement 
of reasonableness and impartiality in republican gov- 
ernment, he believed, hinged on a “modification of the 
Sovereignty.” Leading up to the Philadelphia conven- 
tion Madison conceived of this just umpire determining 
national policy as well as officiating over state legisla- 
tion and exercising the power of veto in the case of 
unjust local laws. Subsequent to the Convention’s re- 
jection of his proposed national negative on state laws, 
Madison continued to seek the establishment of an im- 
partial referee that would sit in judgment on national 
concerns (PJM 10:214). The great problem in popular 
government, he argued in the tenth Federalist, is that 
the parties to the case must themselves also be the 
judges (FP 10:48). Madison’s theory of the extended 
republic and representation in the tenth Federalist has 
been analyzed by many scholars to demonstrate how 
he intended to achieve a “disinterested & dispassion- 
ate umpire” in republican government. Gordon Wood 
(1987) attributes a theory of elite statesmanship to 
Madison and the Founders via their creation of large 
electoral districts from which the more talented and 
noble sorts of men will tend to be elected. The use of the 
term “popular sovereignty” was a democratic rhetori- 
cal device used to supplant democracy with aristocracy 
(Wood [1969] 1998, 562). Joshua Miller (1988) and 
Robert Wiebe (1984) go further, treating Madison’s 
and the Founders’ ascription of sovereignty to the pub- 
lic as a sleight of hand. The Federalists’ “public” was 
“a mere abstraction,” “a mythical entity,” “a ghostly 
body politic,” which does not participate, deliberate, 
or take action (Miller 1988, 99, 104, 114; Wiebe 1984, 
38—39). 

In contrast, Lance Banning (1995) argues that 
Madison was “adamant that once the proper checks 
had been imposed and passing passions had been 
cooled, the will of the majority must rule (372). In or- 
der for Madison’s republican theory to work, Banning 
claims, representatives in Congress must reflect the di- 
verse views of their constituents (209). Alan Gibson 
(1991) finds both of these interpretations lacking and 
argues that the Madisonian remedy of an impartial 
umpire consists in the formation of just majorities 
in Congress, made possible by (1) the inclusion of a 
multiplicity of interests in a large territory with a di- 
verse population, which obstructs the communication 


of factious views and leaves representatives fairly in- 
dependent in the exercise of their trust, and (2) large 
electoral districts from which impartial representatives 
are more likely to be chosen. Gibson criticizes Banning 
for failing to account for the achievement of impartial- 
ity in public decision making and ultimately succumb- 
ing to the pluralist model he tries to avoid (267-68). 
Conversely, Banning argues that Gibson does not take 
account of the importance of the practicable extent of 
territory and the role it plays in maintaining the rulers’ 
responsibility to the people in Madison’s theory (212, 
n. 61). “Madison never argued that the national legis- 
lators would be capable of acting as impartial referees 
over clashing interests at the national level,” Banning 
declares (470, n. 54). I would add that Gibson’s combi- 
nation model does not solve the difficulty of preventing 
the communication and spread of factious views among 
the national representatives, who meet in person in the 
capital city and have open, easy lines of communication 
and ample opportunity for the formation of factions, 
which could well prove an overmatch for contrived 
institutional rivalries. If Gibson’s delineation of public 
opinion is meant to counter the problem of representa- 
tives who act from passion and partial interest, then it 
too ultimately yields to the pluralist thesis. In Madison’s 
mind, Gibson contends, public opinion “was simply 
a public consciousness formed from the aggregate of 
individual sentiments” (1991, 285, 2002, 287, 282); his 
goal was not to reform the citizens of an unjust majority 
or to educate and form civic character (300—1). 

I take Madison’s remedy of a “dispassionate and 
disinterested umpire” to be something more complex 
and much more deeply republican. The “great desider- 
atum” for which Madison is searching is informed by 
the principle that “the majority ... alone have the right 
of decision” (PJM 9:384; cf. 9:357). When the various 
components of Madison’s thought are viewed as part 
of a single design informed by an overarching, positive 
theory of participatory politics, his arguments fit to- 
gether to form a coherent philosophical vision. He en- 
capsulated this vision in his theory of the politics of pub- 
lic opinion. “Public Opinion,” Madison declared, “sets 
bounds to every government, and is the real sovereign 
in every free one” (PJM 14:170; cf. 14:161—62). In all 
governments public opinion operates as a force that 
limits the power of government. In all free govern- 
ments public opinion is the ground of all legitimate 
authority; it functions as both a defensive agency that 
controls government and an active agency that directs 
the will of the government. As the embodiment of the 
highest expression of public opinion, the Constitution 
provides a standard to which officials and citizens can 
appeal to limit the power of government, as well as a 
source of instruction concerning individual rights and 
responsibilities. The latter is what Madison meant by 
the beneficial effects of a bill of rights over time, as 
it is sanctified and incorporated into public opinion 
(Meyers [1973] 1981, 169; PJM 14:162—63, 170). In 
addition to its manifestation in the Constitution, pub- 
lic opinion has three other modes of expression: as 
the censor of governmental acts, as the constitutional 
majority, and as the general spirit that permeates 
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the nation (and perhaps beyond). The censorship of 
governmental measures by public opinion finds expres- 
sion via state political organs and by educated men via 
the print media (FP 44, 46; Meyers [1973] 1981, 262—64; 
PJM 10:214). These are essentially defensive measures 
against political usurpation. The appeal may be to the 
people of the states or even directly to the people as 
a collectivity. However, public opinion in these cases 
does not carry the force of law, though it may well “lead 
to a change in the legislative expression” of the public 
will or even to a change in judicial opinion (Meyers 
[1973] 1981, 270). The directive agency of public opin- 
ion manifests itself through the constitutional mecha- 
nisms of free elections and representation, by which 
“the will of the largest political body may be concen- 
tered and its force directed to any object which the 
public good requires” (FP 14:68—69). In this expres- 
sion of public opinion by the constitutional majority 
the people’s agency is not direct, but it is nonetheless 
their will, and not a government insulated from the 
actual views of the people, that directs public measures. 
Finally, public opinion in its broadest sense consists of 
the settled views and general convictions of the peo- 
ple. Its potential power is prodigious: It can preserve 
or alter public morality; it can support or scorn the 
laws. The formation of constitutional majorities occurs 
within a sphere permeated by an overarching and ubiq- 
uitous public opinion. When settled, the opinion of the 
constitutional majority is absorbed by public opinion, 
contributing to the ongoing modification and construc- 
tion of public opinion in a republic (PJM 9:355). 
Madison discovered the remedy for the apparent 
dilemma between the problem of majority faction and 
the ultimate right of the majority to rule in republican 
government by an analysis of the politics of commu- 
nication in an extensive republic. On the one hand, a 
nation should be large enough to include a multiplicity 
of interests and sects, thereby neutralizing the effects of 
interest or passion by denying any one of them majority 
status. The extensive size of the territory makes it less 
likely that a majority activated by a common passion 
or interest will be able to communicate effectively and 
unite for unjust ends. In addition to the general chal- 
lenge of forming a united voice over so large and popu- 
lous a land, the consciousness of dishonorable motives 
and unjust purposes provides an obstacle to the forma- 
tion of a majority faction, for “communication is always 
checked by distrust in proportion to the number whose 
concurrence is necessary” (FP 10:51). On the other 
hand, the territory must not be so large that it precludes 
the communication of ideas and the formation of public 
opinion on the basis of justice and the general good. 
Indeed, Madison’s repeated insistence on limiting the 
size of the territory to a practicable sphere—in both 
the 1780s and the 1790s—is logical only if he intended 
a positive political role for a national majority united by 
a common opinion. The opinion of the constitutional 
majority is a modification of the views of a latent major- 
ity. This is achieved through established constitutional 
processes in an extensive, representative, federal gov- 
ernment, which provides the arena in which to collect, 
temper, and refine the public views into a collective 
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opinion that accords with the rights of others and good 
of the whole. 

When public opinion is fixed, Madison taught, it must 
be obeyed by the government. When not settled, it may 
be influenced by those in government. The extensive 
size of the territory makes it difficult for a faction to 
“counterfeit” the opinion of the public; the limitation 
on size to a practicable sphere enables the “real” opin- 
ion of the public to form and carry effect (PJM 14:170). 
Madison argued in both The Federalist and the Party 
Press Essays that the practicable boundaries of a repub- 
lic can be stretched without sacrificing the formation of 
the public voice if conditions that ease intercommuni- 
cation among the citizens are present. These conditions 
include good transportation routes, improvements in 
interior navigation, the free circulation of newspapers, 
and representatives traveling to and from the capital 
city, all of which act as equivalents to a contraction 
of the territorial size (FP 14:70—71; PJM 14:170, 161). 
In contrast to a nation that is too small and where a 
majority faction easily arises, or to one that is too large 
and in which the public voice cannot be collected, a ter- 
ritory of practicable extent provides the conditions for 
the communication of ideas, the proper formation of 
public opinion, and its appropriate degree of influence 
on the representatives. Under these circumstances the 
representatives are effectively distanced from the influ- 
ence of the ephemeral passions and partial interests of 
the diverse factions within their districts, while simul- 
taneously kept dependent on the will of the society. 

Madison’s goal was not merely the distillation of the 
people’s will by representatives in Congress, but even 
more importantly, the establishment of “an equilib- 
rium in the interests & passions of the Society itself” in 
order to create the conditions necessary to refine and 
enlarge the opinion of the society (PJM 14:158—59). 
His advocacy of a large but “practicable sphere” (FP 
14:68, 51:293; PJM 14:170) should be understood within 
this context. Madison’s insight into how territorial size 
contributes to the positive achievement of the just ma- 
jority consists of more than a technical dependence on 
the people via their representatives in Congress. It also 
entails, to the extent possible, the tempering of factious 
impulses and the elevation of opinion within the society 
by means of a dynamic process of communication and 
deliberation throughout the land. Madison’s “modifi- 
cation of the sovereignty” is not merely the alteration 
of individual state interests and popular views by the 
national legislature. It is also the modification of public 
opinion itself. 

Madison believed that the authority of public opin- 
ion is limited by the act of constitutional ratification. 
But he rejected the idea that it is limited to the act 
of constitutional consent, or that it is merely an inter- 
mittent expression of authority at times of elections. 


8 Gary Rosen (1999) 1s correct that for Madison the Constitution 18 
the manzfestation of the sovereign “sense of the community” (165) 
However, his insistence that “Madison’s solution was a kind of con- 
stitutional passion, an unthinking attachment to the Constitution 
as an end ın itself” (127) neglects Madison’s concern for public 
enlightenment and fails to take into account bis conception of the 
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While the authority of the Constitution is fixed and 
its provisions unalterable except through the modes of 
amendment prescribed in the document itself, there are 
a myriad of political decisions that do not involve con- 
stitutional questions. In these, the government ought 
to be informed by the considered views of the public. 
“In no case,” Madison declared, “ought the eyes of the 
people to be shut ...nor their tongues tied.” If left 
uncontrolled by the people, government “ever will be 
administered by passions more than by reason” (FP 49: 
285, 50:287; PJM 17:238—39; cf. 14:426—27). Contrary 
to the notion that Madison wanted the people’s in- 
volvement limited to voting—to kicking the bums out 
of office when they got out of line (Matthews 1995, 
159; cf. 16263), Madison explicitly argued the reverse. 
The doctrine that has “so ardently been propagated by 
many, that in a republic the people ought to consider 
the whole of their political duty as discharged when 
they have chosen their representatives” and “that the 
people ought at all times to place an unlimited confi- 
dence in rulers” they have chosen, is false, he protested. 
Just as he had indicated in The Federalist a decade 
earlier, in “Political Reflections” Madison insisted that 
the people are the primary control on the government, 
that they have a real and ongoing role in the political 
life of their country, and that the manifestation of the 
reason of the public results from their active political 
participation and the communication of ideas. 

In the contest with Hamilton, Madison routinely 
applied his theory of the ongoing authority of public 
opinion to the practical issues of the day. With respect 
to the issue of the public debt, Hamilton was correct 
to think that he had Madison’s general support for 
funding, and indeed Madison argued on the floor of the 
House of Representatives in early 1790 that the debt 
incurred in the war for independence must be funded. 
However, Madison’s general view was that although 
funding was at times necessary in the life of a nation, it 
was nonetheless an evil (PJM 13:75). While he assented 
to those measures necessary to reestablish public credit 
and retire the debt, he was adamantly opposed to a per- 
petuation of it and, in fact, had been so for many years 
(PJM 13:106, 317; cf. 13:37; 6:272, 298). The extension of 
the debt would only further the distance between the 
national government and the interests of the people. 
Public debt generally results from the costs of running 
war and fitting an army, all of which tends toward the in- 
crease in executive discretionary power, corruption in 
government, and governmental independence from the 
popular will, he argued in the Party Press Essay, “Uni- 
versal Peace” (PJM 14:206—9, 274-75, 15:474, 518). 
Such has been the ploy used by governments to extend 
and perpetuate arbitrary power throughout human his- 
tory. The cure for this, Madison declared, is to make the 


dynamic character of public opmuion and 1ts continuous operation and 
influence in the everyday life of the polity. Roger Sharp (1993) also 
gives Madison’s conception of public opinion a static quality, arguing 
that although Madison called for a dependence on an enlightened 
and watchful public, in the early 1790s he did not suggest how public 
opinion would be collected and articulated, regarding it as “a fixed 
entity that was supportive of republicanism but essentially inert” 
(45). 
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will of the government “subordinate to, or rather the 
same with, the will of the community” (PJM 14:207). 
Furthermore, to the extent possible, each generation 
should bear the financial burdens of debt it has taken 
on, thereby prompting “avarice ...to calculate the ex- 
pences of ambition” and “in the equipoise of these 
passions, [leaving] reason... free to decide for the pub- 
lic good” (PJM 14:208). By “permanent and constitu- 
tional maxims of conduct” the executive temptation 
for war must be moderated by the legislative represen- 
tatives’ willingness for war, contingent on the opinion 
of their constituents. The people’s temptation to war is 
controlled by “subjecting the will of the society to the 
reason of the society” (PJM 14:207). 

In the exchange with Hamilton writing as Pacificus, 
Madison as Helvidius insisted on the legislative na- 
ture of the power to declare war and make treaties, 
as delineated in the Constitution. “Under colour of 
vindicating an important public act,” Helvidius wrote 
in his first installment, Pacificus “advanced [principles] 
which strike at the vitals of [the nation’s] constitution, 
as well as at its honor and true interest” (PJM 15:66). 
The violation of separation of powers manifested in 
the President’s proclamation of Neutrality in 1793 was 
a travesty with respect to “the simple, the received, and 
the fundamental doctrine of the constitution, that the 
power to declare war including the power of judging 
of the causes of war is fully and exclusively vested in 
the legislature” (PJM 15:108). The Helvidius essays 
are in part a continuation of the argument Madison 
presented in “Universal Peace.” “War is in fact the 
true nurse of executive aggrandizement” (PJM 15:108), 
Helvidius wrote. Quoting one of Hamilton’s contribu- 
tions to the Federalist, Helvidius argued that a “hered- 
itary monarch ...[is] often the oppressor of his peo- 
ple,” though generally he has too much personally at 
stake in his government to be corrupted by a foreign 
power (PJM 15:109). An elective magistrate, on the 
other hand, may be tempted both by avarice to sacri- 
fice the interests of his fellow citizens and by ambition 
to betray his country. Madison feared that Pacificus’ 
construction of the American Constitution drew upon 
British theory in a most dangerous way, threatening 
to destroy the rudimentary constitutional conditions 
necessary to the achievement of an impartial umpire in 
republican government. 

In Madison’s analysis of interest and disinterested- 
ness he argued that the advantage of absolute monar- 
chy is that the king is sufficiently neutral towards the 
different interests and parties of his country, whereas in 
a republic the will of the majority may sacrifice the in- 
terests of the minority (PJM 9:357, 9:384). Conversely, 
the advantage of republics is that the sovereign will is 
sufficiently restrained from making decisions contrary 
to the interests of the society; in monarchy it is not, 
and the king may sacrifice the interests and happiness 
of his subjects to his own personal ambition and gain. 
The arbitrariness of republican government is reme- 
died by enlarging the sphere, thwarting the ascendancy 
of majority faction and providing adequate conditions 
for the refinement of public views. This remedy, how- 
ever, is contingent on maintaining the beneficial effects 
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of republican government, i.e., that the will of the 
government is dependent on the will of the whole soci- 
ety and controlled from setting up an interest adverse 
to it. The United States Constitution lodges the ques- 
tion of war and peace with the legislature, and not with 
the president, and gives the latter only partial, and 
not the sole, power to make treaties, precisely to 
weaken the executive temptation to betray the interest 
of his nation. 

Madison clearly understood that continued commer- 
cial relations between the United States and Great 
Britain were critical to the success of Hamilton’s fi- 
nancial program. If Hamilton was willing to main- 
tain America in a position of economic subordination 
to the British and sacrifice national honor and inter- 
est in order to advance short-term economic gains, 
Madison was not. In the long run, Madison believed, 
the economic, political, and moral strength of the 
United States was tied to achieving a nonsubservient 
economic position. At the outset of the first Congress, 
and time and again in subsequent years, Madison ar- 
gued that the establishment of a beneficial or at least 
more equitable commercial policy with the British 
would take fully into account American preeminence 
in agriculture and Great Britain’s dependence on 
American produce. While England depended on the 
United States for the raw materials used in her man- 
ufacturing industry, her West Indian colonies—from 
which she drew an immense income—depended on us 
for the necessaries of life. There was no good reason 
to adopt Hamilton’s servile response to British com- 
mercial dominance and allow the Empire to treat the 
United States as a British colony. Commercial retalia- 
tion against the British would force a change in trade 
policy; other markets, particularly France, could sub- 
stitute for losses in Anglo-American commerce. Dur- 
ing the war between England and France Madison at- 
tempted to counteract the “Anglican Party” and the 
false appearance that public opinion endorsed its prej- 
udices for England and against France. He and James 
Monroe produced a model resolution to be distributed 
at country meetings, the object of which was to pro- 
vide a means to mobilize, collect, and manifest “the 
genuine sense” and “real sentiments of the people”— 
that is, “the agricultural” and “commanding part of 
the society,” and to negate the counterfeiting of public 
opinion coming from the nation’s commercial centers 
(PIM 15:92—93; cf. Banning 1995, 377—78). 

A nation whose citizens depend for their livelihood 
on the manufactured production of superfluities and 
the whims of fashion and fancy, Madison claimed, is 
one in which one class of citizens lives in servile depen- 
dence on another. “In proportion as a nation consists 
of that description of citizens, and depends on external 
commerce, it is dependent on the consumption and 
caprice of other nations” (PJM 14:258; cf. 14:164—65; 
17:559—60). Madison did not share Hamilton’s dream 
that America become an industrial prodigy. Instead, 
he believed that agriculture was the most beneficial 
object of human employment in the United States and 
the industry most productive of real wealth in a nation. 
The way of life of the husbandman, he argued in 1792, is 
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“the most truly independent and happy” (PJM 14.246). 
A nation predominant in agriculture is most favorable 
to the health, virtue, intelligence, competency, liberty, 
and safety of the greatest number of individuals A 
manufacturing nation, by contrast, courts the dangers 
of wantonness and waste, inviting into its environs the 
wretchedness of the Bridewells and Bedlams (PJM 
14:244—46; cf. 14:186, 257-59). As the population in- 
creases, a proportion of the inhabitants of a nation 
will gradually and naturally shift their employment 
from agriculture to the manufacturing, mechanical, and 
commercial industries, but this diversion ought not be 
artificially encouraged. Rather, “it ought to be seen 
with regret as long as occupations more friendly to 
human happiness, lie vacant.” Domestic manufactures 
would develop naturally, he said, at the stage when 
“hands [are] not called for by agriculture” (FP 41:230). 
Governmental encouragement of manufactures artifi- 
cially diverts human industry from a more to a less 
beneficial course and therefore ought to be limited to 
considerations regarding existing establishments that 
would otherwise perish (PJM 12:70—72). 

Madison believed that privileging one industry over 
another violates both the rights of property and the 
rights of persons (PJM 14.266-67). Property is not 
secure, he asserted, when unequal taxes burden one 
kind of property and reward another; nor is it pro- 
tected when a part of the citizenry is denied the free 
exercise of their faculties and the free choice of their 
occupations. Building on Federalist 10’s claim that the 
rights of property originate in men’s free exercise of 
their diverse faculties, he claimed that the individual’s 
free use of his faculties and choice of occupation not 
only constitute his property, but also are the “means 
of acquiring property” (PJM 14:267). Viewed in this 
context, Madison’s alarm at Hamilton’s Report on 
Manufactures seems understandable. The protection 
of these different faculties, Madison had written in The 
Federalist, “is the first object of government” (FP 
10:46). 

Stemming from the free exercise of his faculties, man 
has a property “in his opinions and in the free commu- 
nication of them” (PJM 14:166). When the power of 
government is excessive and unjustly interventionist, 
no man is secure in his opinions or in the effective 
communication of them. This is a particular danger in 
a large republic, since the size of a nation has the ef- 
fect of making intercommunication and the discovery 
of a united purpose more difficult. If public opinion 
is to exert adequate and proper control on the gov- 
ernment it must, Madison contended, have sufficient 
channels through which it can be expressed, formed, 
and enlightened. The process of collecting, coalescing, 
and shaping public opinion is accomplished by a va- 
riety of conditions and processes, including state and 
local governmental bodies, educational institutions and 
the learned professions, the circulation of newspapers 
throughout the nation, and the exchange of views be- 
tween representatives and their constituents as well as 
among themselves at the seat of government. 

That the people had, by their sovereign authority, 
established a partition between the national and the 
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state governments was sufficient to insist on respect 
for the constitutional limitations on power. Madison’s 
defense of the powers reserved to the states is properly 
understood as part of his commitment to constitutional 
government, the anachronistic account that attributes a 
Calhounian view of states’ rights to Madison, notwith- 
standing. But as Adrienne Koch and Harry Ammon 
(1948) long ago pointed out, Madison had an additional 
reason to stress the importance of the federal character 
of the American republic: he considered the state and 
local governments essential to the collection and artic- 
ulation of the public voice (see PJM 17:247). Without 
a due degree of power at the state and local levels of 
governments, the extent of the territory would make it 
impossible for the people to communicate effectively 
and convey a united voice by which to control govern- 
ment (PJM 14:138; cf. FP 46). Conversely, “the most 
arbitrary government is controuled where the public 
opinion is fixed” (PJM 14:192). Federalism is a critical 
element in maintaining governmental responsibility to 
the people; it contributes significantly to shaping an 
environment conducive to the communication of ideas 
and the mobilization and expression of public opinion 
in a large republic. 

Madison would apply his long-held conception of the 
importance of the states in marshalling public opinion 
later in the 1790s, in his battle to overturn the Alien and 
Seditions Acts. Both acts, he declared, constituted clear 
violations of the United States Constitution; in the one 
case the national government assumed a power not 
granted by the Constitution, and in the other it exer- 
cised a power expressly forbidden by the First Amend- 
ment. What is particularly interesting in his discussion 
of the Sedition Act is that over and above his charge 
of unconstitutionality—which he believed must decide 
the matter—he also provides an explanation of the rea- 
soning that informs the American Constitution in this 
matter. In free governments, he argued, “it is the duty 
as well as right of intelligent and faithful citizens, to dis- 
cuss and promulgate [the proceeding of government] 
freely, as well to control them by the censorship of the 
public opinion, as to promote a remedy according to the 
rules of the constitution” (PJM 17:342). In an extensive 
republic in which the central government possesses a 
magnitude of powers and where the great body of the 
people is far removed from the seat of government, 
the state governments serve as “intermediate” bodies. 
The purpose of the Virginia Resolutions, he explained, 
was to utilize the states as vehicles to excite public re- 
flection and mobilize public opinion (PJM 17:348). Fur- 
thermore, the difficulty of circulating knowledge about 
governmental proceedings throughout the large na- 
tion and of maintaining responsibility to the people by 
public officials requires a particularly high degree of 
liberty of the press (PJM 17:341). The Federalist mea- 
sure restricting the freedom of the press was based 
on a different and nonrepublican political model, yet 
another manifestation of their proclivity to imitate the 
British. Driven by a desire to “extend the ground of 
public confidence” (PJM 17:346), Hamilton, Adams, 
and their cohorts would place a censorial power in 
the government over the people. Madison’s concern 


was that the government demonstrate responsibility to 
the people; in “republican government... the censo- 
rial power is in the people over the government, and 
not in the government over the people” (PJM 15:391; 
cf. 11:163; Read 2000, 69—70). A free press “alone can 
give efficacy to [the national government’s] respon- 
sibility to its constituents,” he wrote. It is the means 
for freely examining public characters and public mea- 
sures, and for the free communication of these opin- 
ions, that is “the only effectual guardian of every other 
right” in a free society (PJM 17:189—90, 345). 

The advent of circulating newspapers significantly in- 
creased communications among men and contributed 
to the power of public opinion in the eighteenth cen- 
tury, a phenomenon clearly grasped by Madison and 
many of his contemporaries abroad. The rise of the 
mass media also made communication over a large ter- 
ritory possible for the first time in history. It was now 
possible to found a nation large enough to impede the 
formation of majority faction and at the same time es- 
tablish the circumstances that make possible a genuine 
“commerce of ideas” throughout an extensive territory. 
To my knowledge, this original, momentous insight be- 
longed to James Madison. Madison envisioned newspa- 
pers serving as vehicles for the circulation of the ideas 
of the literati to the people of the extensive American 
republic, resulting in the refinement and enlargement 
of the public views and the emergence of an enlight- 
ened public opinion. The literati, in fact, occupy a cen- 
tral place in the process of civic education and pub- 
lic enlightenment Madison hoped to see in America. 
They are “the cultivators of the human mind—the man- 
ufacturers of useful knowledge—the agents of the com- 
merce of ideas—the censors of public manners—the 
teachers of the arts of life and the means of happiness” 
(PJM 14:168). In Madison’s view, their role is abso- 
lutely indispensable to the proper formation of public 
opinion. Their influence on the ideas and manners of 
the people can serve to anchor a republican citizenry in 
the moral principles of free government. Madison’s use 
of the language of agriculture and manufacturing in his 
description of the highest aims of the new republic was 
clearly no accident. He intentionally meant to contrast 
his vision of the American commercial republic and its 
hero, the merchant of ideas and mores, with the nar- 
rower Hamiltonian emphasis on commerce as material 
exchange and profit. 

The energy Madison expended to stop Hamilton’s 
economic and political policies was proportionate to 
the threat he perceived: Hamilton’s program would 
destroy the limitations on government established by 
the Constitution and undermine the rightful authority 
of public opinion in the American republic. The thrust 
of Hamilton’s financial package was the creation of a 
system that promoted inequality of property by gov- 
ernmental flat and tied the interests of the favored 
opulent class to the national government. Madison be- 
lieved that this clever scheme would have the effect of 
strengthening and consolidating the powers of the na- 
tional government and undermining the constitutional 
and practical limitations placed on its authority. The 
concentration of power at the national level would 
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diminish the power of the state governments. Since a 
single national legislature is not competent to regulate 
all the objects of government over so large a territory, 
the power of the national executive would unduly grow, 
this would open the way for legislative corruption and 
render less effectual the voice of the people and their 
control on the legislature (PJM 14:138). Hamilton’s 
plan would eventually transform the executive office 
into one of “unlimited discretion,” in opposition “to 
the will and subversive of the authority of the peo- 
ple” (PJM 14.274). Ultimately, it might even produce 
a “universal silence,” leaving the national government 
to act independent of the will of the society and free to 
pursue a “self directed course” (PJM 14:138). 

Madison’s advocacy of the politics of public opinion 
was his sustained attempt to solve the problem of ma- 
jority opinion in a manner fully consistent with the form 
and spirit of popular government. The spirit of free gov- 
ernment cannot be attained by achieving the people’s 
consent and then disassociating them from the acts 
of government. The spirit of republicanism is present 
only when it is embodied in the minds and mores of 
the citizens and sustained by the activity of political 
participation and the commerce of ideas throughout 
land. The construction of public opinion involves a 
process of instructive dialogue and deliberation that 
permeates the whole society, from the influence of 
the literati and statesmen on the mores and views of the 
citizens, to the communication of ideas throughout the 
great body of the people, to the influence of the settled 
opinion of the community on the representatives in 
government. The process of forming public opinion 
is a time consuming and complex one, much like the 
process of establishing precedents in courts of law. Ma- 
jority opinion in a republican polity is constantly in 
the process of constructing itself within an intellectual, 
moral, and psychological milieu larger than itself. This 
architectonic influence over the minds and morals of 
the public in turn influences the decisions of govern- 
ment and the laws of the land, which further operate 
on the understanding and interest of the public. This is 
Madison’s solution to the difficult challenge he set him- 
self when preparing for the Federal Convention, i.e., 
how to achieve a “modification of the Sovereignty” 
(PJM 9:357). Public opinion is the sovereign authority 
in a genuine republic whose mild voice of reason is 
capable of transforming the will of a nation. It is no 
surprise, then, how often Madison himself put pen to 
paper in the public press or that he urged his fellow 
citizens, despite all artificial and circumstantial distinc- 
tions, to come together as one people under the mantle 
of the “Empire of reason” (PJM 14:139). 


CONCLUSION 


The disagreement between Madison and Hamilton that 
led to the formation of the first political parties in the 
United States cannot properly be understood as merely 
personal or partisan. It was a battle over the very 
character of republican government and the extent to 
which the people are capable of governing themselves. 
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Hamilton did not think Madison’s solution of the ex- 
tended republic and representation went far enough 
to prevent the problem of majority tyranny. Madison 
thought Hamilton’s measures substituted private in- 
terest for public good and undermined the sovereign 
authority of public opinion. Interestingly, scholars have 
generally attributed the vision of a modern commercial 
republic composed of diverse and rival economic inter- 
ests actuated by the untutored passion of acquisitive- 
ness to James Madison. But this was not, nor ever had 
been, Madison’s vision of republicanism. It is closer to 
Hamilton’s.’ In fact, Hamilton fits better the descrip- 
tion that has traditionally been reserved for Madison, 
while Madison was a more unhesitating democrat than 
is generally believed. Hamilton is the chief American 
theorist of the modern commercial republic; Madison, 
the philosophic architect of the politics of public par- 
ticipation and republican self-government in America. 

Madison and Hamilton did not differ about the need 
to filter the interests, passions, and opinions of the 
citizens or about the need to achieve a reasonable, 
impartial, and durable will in government, but they 
did very much disagree about who or what legitimately 
gives voice to this will and whether the process involves 
modifying the actual views of the citizens. Hamilton 
attempted to solve the problem of the predominance 
of partial interests, the contagion of passion, and the 
danger of demagoguery in the legislature by establish- 
ing a system of institutional counterbalances within 
the government of a diversified, commercial nation. 
He sought to achieve a reasonable and permanent will 
via an independent and energetic executive whose ad- 
ministration would advance the interest of the nation 
and inspire in the people an opinion of confidence and 
habits of obedience. By contrast, Madison’s solution 
was to call the representatives to stand before the bar 


9 For example, Martin Diamond (1972, 1977) attributes to Madison 
the theory that a large republic supplies the remedy for faction only 
if 1t 18 also a commeraal republic (54—55, 648). However, I would ar- 
gue that Diamond’s presentation of the commercial republic theory 
1s actually a much more apt interpretation of Hamulton’s political 
and economic thought According to Diamond’s interpretation, the 
historical battle between the haves and the have-nots was to be 
replaced with a new factional struggle based on the divermty of eco- 
nomic interests. This required magnifying the operation of interest 
(and taming or devitalizing passion and opinion), so that citizens 
would divide themselves on the basis of narrow and particularized 
economic interests, thereby allowing the society to evade the fa- 
tal kind of fachionalism caused by opimion and class interest in the 
past. Diamond further argued that the proponents of this theory 
rejected any attempt to refine and improve the citizens’ opinions of 
the advantageous and just. Instead, they accepted as “irredeemably 
dominant” the self-interested passions sown in human nature. In light 
of this, they sought to channel the powerful passions and interests of 
the society by way of shrewd imstitutional arrangements rather than 
engage in the futile attempt to form the character of the citenry 
While the commercial republic theory presented by Diamond cap- 
tures much of Hamilton’s thought, it does not correctly characterize 
Hamilton’s vision in one umportant respect. Hamilton’s theory of the 
commercial republic did not merely rest on a multiplicity of rival 
interests to effect the common good, nor did it advance the notion 
of a multiplicity of factions. At the New York Ratifying Convention 
Hamulton proclaimed that the objective was “to abohsh factions, and 
to unite all partes for the general welfare” (PAH V-85). Like Necker, 
Hamulton sought to achieve public confidence and unity of national 
sentiment via the effects of a good administration. 
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of public opinion. He sought to establish an equilib- 
rium of passions and interests in the society in order to 
reduce the likelihood of majority faction as well as to 
shape an environment conducive to the formation of a 
public will tempered and modified by the commerce of 
ideas. 

Hamilton relied on the people to pursue their own 
material advantage and to support a government that 
benefits them economically. He did not see the wis- 
dom in encouraging political hyperactivity among the 
citizenry, which only invites demagoguery and civil un- 
rest—as the French example too perfectly illustrated. 
For Madison, the citizens’ political duties were substan- 
tial and ongoing. They did not end at choosing the bet- 
ter sorts of men to represent them; their guardianship 
over public affairs was not an intermittent responsibil- 
ity. Both Hamilton and Madison relied significantly on 
an educated elite to accomplish their ends. However, in 
the one case it was a type of statesmanship that sought 
to inspire respect and confidence more than to teach. 
In the other case it was a kind of civic leadership that 
aspired to cultivate civic understanding, refine mores 
and manners, and educate the people for their indis- 
pensable role in a self-governing republic. 

At least by the time of the election of Jefferson to 
the presidency, Hamilton understood clearly that the 
attachment by leading Republicans to the theory of 
public opinion had had much to do with the rifts and 
party battles of the past decade. He also saw that their 
philosophy translated into a political strategy, and that 
that strategy was winning. The Federalists had lost po- 
litical ground by relying too much on the good effects 
of their administration, all the while the Republicans 
gained ground by appealing directly to the American 
people. Reluctantly, Hamilton reconciled himself to 
the fact that he and his fellow Federalists would also 
have to give much more attention to cultivating public 
opinion. However, he refused to do so in a way that he 
considered humiliating and unworthy of a republican 
statesman, though he did admit that it would be neces- 
sary to countenance some modes of action that “may 
be denominated irregular, such as in a sound & stable 
order of things ought not to exist” (PAH XXV:606). 
Accordingly, he proposed the establishment of a Chris- 
tian Society, whose object was to support the Christian 
religion and the Constitution and to collect a public 
force that could significantly influence the outcome of 
elections. 

Hamilton’s political ally and correspondent, James 
Bayard, cautioned Hamilton against such a measure. 
The type of organization that can accomplish the goal 
Hamilton had in mind must be grounded upon a 
stronger motive in man than reason, or even common 
interest, he argued. Be patient, Bayard counseled, and 
the Republicans will in a short while demonstrate to 
all the country the soundness of Federalist doctrines 
and the imbecility of their own. In free governments 
such as the United States, he continued, there must 
always be a degree of “agitation and vibration of opin- 
ion,” for it is “in the nature of things... impossible to 
fix public opinion” (PAH XXV°:613). Good men would 
do better to exert themselves against the evils of selfish 
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and ambitious demagogues and, otherwise, wait pa- 
tiently for the Republicans to self-destruct. Hamilton 
could only wish his friend were right, but he knew that 
he was not. The advent of the new politics of public 
opinion had forever changed the face and fabric of 
republican government. The unrivaled power of public 
opinion that Tocqueville observed decades later was 
already fast becoming a political reality in America. 
Hamilton continued to resist the new politics but knew 
that his brand of patriotism was of “the old school” 
and that the “disciples of the new creed” had won the 
battle to make public opinion queen of the world (PAH 
XXV:354). 
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Presidentialism and Accountability for the Economy 


in Comparative Perspective 


DAVID SAMUELS University of Minnesota 


o what extent do the institutions of presidentialism allow voters to hold governments accountable? 
l Powell and Whitten (1993) suggested that voter capacity to sanction is strong when “clarity of 
government responsibility” for outcomes is clear, and vice versa. I argue that clarity of responsi- 
bility functions differently under presidentialism and that presidentialism generates particular forms of 
accountability. In general, electoral sanctioning is weak in nonconcurrent elections, which do not occur 
under parliamentarism, but is stronger in concurrent elections. In concurrent executive elections the 
clarity of responsibility does not attenuate the economy’s impact on the vote. Yet in concurrent legislative 
elections both partisan and institutional variables diffuse responsibility for economic performance. Thus 
under many common institutional and partisan formats, voters sanction presidents to a greater degree 
than legislators for the same phenomenon. These findings elucidate the conditions under which we 
might observe accountability similar to what we find in some parliamentary systems or a more uniquely 
presidentialist “dual democratic legitimacies” of the kind Linz (1994) imagined. 


cials to accounts remains a central concern of 

political science. Most research has explored 
established democracies, which all happen to be par- 
liamentary systems, with the exception of the United 
States) Democracy’s spread around the world has 
increased the number of presidential systems, which 
generates an important new set of questions about 
the relationship between economics and elections, and 
thus about well democracy can work: To what extent 
do the institutions of presidentialism allow voters to 
hold the government accountable? To what extent do 
we see party or government accountability in sepa- 
ration of powers systems? That is, do voters reward 
or punish incumbent presidents and legislators, and to 
the same degree? Do particular partisan dynamics or 
institutional formats of presidentialism dim or enhance 
the prospects for accountability? 

Scholars have yet to provide answers to questions 
such as these, although substantial debate persists 
about the presidentialism’s relative merits. Some schol- 
ars disdain presidentialism and suggest that the sepa- 
ration of powers hinders accountability. For example, 
Linz (1990, 1994) argued that presidentialism’s “dual 
democratic legitimacies” confuses voters and that pres- 
idential autonomy inhibits “party government” ac- 
countability.! Others question Linz’s claim and suggest 
that presidentialism provides advantages for account- 


T= degree to which voters can hold elected offi- 
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ability by enhancing identifiability and allowing vot- 
ers to hold presidents and legislators accountable for 
different things, which may be normatively a good 
thing (Perrson, Roland, and Tabellini 1997; Samuels 
and Shugart 2003; Shugart and Carey 1992). 

I seek to discover the ways and extent to which 
we observe accountability for the economy under 
presidentialism. Hundreds of books and articles have 
explored the relationship between economics and elec- 
tions,” and scholars have begun to reveal the ways in 
which political institutions and party-system configu- 
rations mediate this connection.? Yet how accountabil- 
ity might work within a presidential system remains 
largely unexplored, both empirically and theoreti- 
cally.* Can presidential systems live up to their demo- 
cratic promise, or does something about the sep- 
aration of powers inhibit accountability? Exploring 
this question could provide substantial insight into 
ongoing debates about the advantages or disadvan- 
tages of different institutional formats for democratic 
performance. 


ACCOUNTABILITY AND CLARITY 
OF RESPONSIBILITY 


I define accountability as the electorate’s capacity 
to reward or sanction incumbent politicians (Manin, 
Przeworski, and Stokes 1999, 40).° This paper thus 


2 For a recent review of this literature see Lewis-Beck and Stegmeser 
2000. 

3 See, e.g., Anderson (1995, 2000) and Powell and Whitten (1993) 

4 Cheibub and Przeworski (1999) provide a partial exception, al- 
though they define accountability differently, focusing on incum- 
bents’ “survival m office.” 

51I do not include the second clause of Manin, Przeworski, and 
Stokes’s definition, “... so that those mcumbents who act in the best 
interest of citizens win reelection and those who do not lose them.” 
Scholars typically find that economics determimes about one-third of 
the vote (Dorussen and Palmer 2002, 4). Thus asking whether eco- 
nomic swings are sufficient to “kick the bums out” is asking too much. 
We expect a correlation between the economy and the vote, but a 
correlation between the economy and the removal of incumbents 
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follows what Susan Stokes calls the “normal economic 
voting” research program. In this approach, “voters 
use the past performance of the government to predict 
future performance and see the government as respon- 
sible for that performance” (Stokes 2001, 13). Schol- 
ars believe that voters’ capacity to reward or sanction 
elected officials declines when they cannot discern re- 
sponsibility for government performance. Powell and 
Whitten (1993) thus argued that voters are more likely 
to punish or reward incumbents for the state of the 
economy when “clarity of responsibility” for outcomes 
is high. Given the separation of powers under presiden- 
tialism, a pertinent question is therefore whether clar- 
ity of responsibility affects voters’ capacity to reward 
or sanction presidential governments, in both executive 
and legislative elections. 

The theory of retrospective voting (Fiorina 1981; 
Lewis-Beck 1988) suggests that electoral accountabil- 
ity occurs because voters retrospectively judge whether 
governments have acted in their best interests and then 
reward or sanction them appropriately. Yet several fac- 
tors might hinder voters’ ability to sanction elected 
officials. First, voters might not be able to punish in- 
cumbents if no viable alternative exists (Anderson 
2000; Manin et al. 1999). This problem is not limited 
to presidentialism, and in the analysis I include only 
competitive elections. Second, incumbent politicians 
might strategically conceal information about policy 
responsibility. This is also a problem everywhere, and 
we can reasonably assume that voters in presidential 
systems can potentially sanction governments, at least 
as much as they can anywhere else. In any case, it is 
unreasonable to assume that incumbents facing com- 
petitive elections conceal all information; instead, we 
should test to see whether partisan and/or institutional 
factors mediate the connection between economics and 
elections. 

Third, citizens will be able to sanction governments 
only if politicians desire (and are eligible for) reelec- 
tion or care about who succeeds them to office. Manin, 
Przeworski, and Stokes (1999, 48) suggest that limits on 
presidential reelection thus restrict voters’ capacity to 
hold governments accountable. Limits on reelection do 
impede voters from rewarding or punishing a particular 
individual incumbent president (or legislator), but they 
do not impede voters from sanctioning the incumbent’s 
party. In presidential systems, the vote for or against the 
party of a term-limited president or legislator is nec- 
essarily a judgment of both the individual incumbent 
and the incumbent’s party. Manin and coworkers’ sug- 
gestion assumes that incumbent presidents do not care 
about their legacy, have no interest in the continuity 
of their policies, and do not care about their party’s 
success once they have left office. Yet last-term presi- 
dents care about their place in history and, thus, care 
about their performance in office. Moreover, since their 
legacy is profoundly shaped by who succeeds them to 
office, they also care about who that person turns out to 


may not appear. Still, this does not mean that incumbents are not 
ve to voter sanction and do not try to change their policies 
to avoid additional future sanctions. 
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be. This suggests that (to some degree at the least) they 
ought to attempt to convince voters that they share 
a personal and political affinity with their successor- 
candidate. 

Voters can reward a successful president’s party by 
electing his or her party’s successor, or they can pun- 
ish the incumbent’s party by electing a rival party’s 
candidate. There is therefore no reason why voters 
could not hold political parties as a whole and pres- 
idential governments as a whole—even those popu- 
lated by term-limited politicians—accountable in a ret- 
rospective voting fashion. Thus while many problems 
inhere in the accountability relationship between cit- 
izens and governments, these problems are not lim- 
ited to presidential systems, nor should we simply 
assume that presidential governments are necessarily 
worse than other forms of government in terms of 
accountability. 

Aggregate economic voting studies typically assume 
that some voters consider national economic condi- 
tions before the election as part of their vote decision. 
Voters attribute reward or blame to the incumbent gov- 
ernment for those conditions, and as a result the vote 
shifts. Yet electoral accountability declines when voters 
cannot discern responsibility for government policy, 
and voters are more likely to punish or reward the 
incumbent government for the state of the economy 
if clarity of responsibility is high, and not if clarity of 
responsibility is low (Powell and Whitten 1993). 

Two related reasons explain this dynamic. The first 
highlights incumbents’ strategic actions. All incum- 
bent governments tend to lose votes (Nannestad and 
Paldam 2002). Powell and Whitten surmised that in- 
cumbents know this and thus prefer to diffuse politi- 
cal responsibility, for example, by blaming those with 
whom they might have shared control of government. 
A country’s institutional and/or partisan makeup can 
help politicians diffuse responsibility, or it can com- 
plicate such activity. The second factor emphasizes 
voters’ ability or willingness to gather information: 
As institutional or partisan complexity increases, vot- 
ers will have greater difficulty discerning actual re- 
sponsibility for government output, independently of 
whether politicians attempt to diffuse responsibility or 
not. Regardless of which factor weighs more heav- 
ily, lower clarity of responsibility should insulate in- 
cumbents from the factors that cause them to lose 
votes. 


PRESIDENTIALISM AND CLARITY 
OF RESPONSIBILITY 


Perhaps clarity of responsibility and thus voters’ capac- 
ity for sanctioning in presidential systems depends on 
factors that scholars have identified for parliamentary 
systems. Or presidentialism could affect how voters 
hold governments accountable in particular ways. For 


6 Powell and Whitten (1993) argued that incumbent party or coali- 
tion cohesion and the type of committee system are important for 
clarity of responsibility. Information on those variables is unavailable 
for most of the countnes m this study. 
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example, Manin, Przeworski, and Stokes (1999, 47) im- 
ply that presidentialism obscures government respon- 
sibility generally and that coalition and divided gov- 
ermments under presidentialism are particularly bad 
for accountability. In contrast, Persson, Roland, and 
Tabellini (1997) suggest that the separation of powers 
institutionalizes conflicts of interest between branches 
of government and thus encourages relatively greater 
information revelation than parliamentary systems, 
which should aid voters’ efforts to discern policy re- 
sponsibility. 

A few single-country studies support the notion that 
differences exist in terms of voters’ ability and/or will- 
ingness to sanction members of the same party who oc- 
cupy different branches of government, such as Lewis- 
Beck (1997) on France and Erickson (1989, 1990) on 
the United States, but no cross-national research has 
yet confirmed or refuted these findings.’ How do voters 
in presidential systems think about blame and reward? 
Do they focus exclusively and always on the president, 
regardless of who controls the legislature? Are leg- 
islators in some partisan or institutional environments 
more or less likely to feel the brunt of the voters’ wrath? 

Several factors might sharpen or blunt clarity of 
responsibility and thus voters’ propensity to reward 
or punish incumbents in presidential systems. The 
most important factor is unique to presidentialism 
and is institutional: the electoral cycle, or the concur- 
rence or nonconcurrence of elections. Others (see es- 
pecially Shugart 1995) have noted that the electoral 
cycle plays a key role in executive—legislative rela- 
tions. Here I suggest that it also mediates the ac- 
countability relationship between voters and elected 
officials. 

Concurrent elections under presidentialism most re- 
semble elections under parliamentarism. When exec- 
utive and legislative elections are held simultaneously, 
candidates for executive and legislative elections have 
strong incentives to coordinate election campaigns, in 
order to benefit from campaign economies of scale, 
coattail effects, and organizational advantages. That is, 
concurrent elections tend to encourage a team effort, 
whereby politicians employ their national party orga- 
nizations and national partisan messages to get out the 
vote. This associates copartisans from both branches 
of government in voters’ minds and encourages voters 
to hold them accountable for the same things. Conse- 
quently, in concurrent elections voters have relatively 
greater incentives to hold both branches coresponsible 
for the economy, all else equal. 

In contrast, when either executive or legislative 
elections are held nonconcurrently, national policy is- 
sues such as the economy should matter relatively 
less. For legislative elections, following the literature 
on U.S. congressional elections (e.g., Erickson 1990), 


7 Remmer (1991, 2001) is the only scholar who has explored the links 
between economics and elections in a cross section of presidential 
systems However, Remmer was not interested in the impact of po- 
litical mstitutions and explored economic voting in Latin America 

. Moreover, she did not explore the mpact of economics on 
legislative elections 
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nonconcurrence should attenuate the impact of na- 
tional factors on the vote because (all else equal) 
it focuses voters’ and candidates’ energies on local 
factors and candidates’ personal qualities, including 
but not limited to any potential incumbency advan- 
tages due to constituency service. In nonconcurrent 
legislative elections we also tend to observe relatively 
less national party organization involvement and rel- 
atively more influence of local or regional political 
messages. 

A similar logic should hold for executive elections. 
Nonconcurrence frees presidential candidates to cam- 
paign on entirely individualistic bases, meaning that the 
election result will depend relatively more on voters’ 
perceptions of the candidates’ personal qualities and 
relatively less on their partisan connections. That is, 
we have reason to believe “It might not be the econ- 
omy, stupid” in nonconcurrent presidential elections, 
because national policy factors ought to matter rela- 
tively less. 

Another institutional factor that might mediate clar- 
ity of responsibility, in legislative elections, is the elec- 
toral rule. Under presidentialism voters cast separate 
ballots for legislative and executive elections. Unlike 
in parliamentary systems, this offers the opportunity 
for substantially different dynamics to drive executive 
and legislative elections. The primary factor is whether 
the electoral system encourages or discourages local- 
ism and/or individualism, giving legislators the choice 
whether to run “with” or “against” their president 
and/or party (Carey and Shugart 1995; Shugart 1995). 
When the electoral rule for legislative elections encour- 
ages localism, the state of the economy should matter 
relatively less, all else equal. In contrast, when the rules 
nationalize legislative campaigns, the economy should 
matter more clearly and consistently. 

Two other variables that we also find in parliamen- 
tary systems merit discussion. The first is coalition gov- 
ernment. Coalitions tend to obscure any one party’s 
responsibility for government output (Anderson 1995, 
2000). As in parliamentary systems, governing in coali- 
tion might insulate incumbent presidents or legislative 
parties from the factors that might otherwise affect 
vote swings for or against them. A contrast of vote 
swings lends credence to this notion: Presidents who 
governed in coalition were relatively more insulated 
from losses, losing on average 4.68% (N = 21), com- 
pared to average losses of 6.31% (N = 54) for pres- 
idents who did not govern in a coalition. A similar 
result obtains for the legislative elections: Parties in 
coalition lost an average of 0.92% (N = 32), while 
parties that governed alone lost an average of 3.59% 
(N = 71). 

Second, minority government might insulate pres- 
idents and legislative parties from vote swings due to 
economic performance. In unicameral presidential sys- 
tems, minority government occurs when the president’s 
party or coalition does not control more than 50% of 
the seats in the single legislative chamber. In bicameral 
systems, minority government occurs when the pres- 
ident’s party or coalition does not control more than 
50% of the seats in both chambers. When parties divide 
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control over the branches of government, the incum- 
bent president can claim that all parties must share 
responsibility for poor performance. Several studies 
confirm that minority parliamentary governments suf- 
fer smaller losses than majority governments (e.g., 
Powell and Whitten 1993 and Strgm 1990). As with 
coalition governments, minority presidential govern- 
ments also appear to insulate incumbents from losses. 
Presidential parties running a candidate following a 
minority presidency lost on average 5.00% (N = 51), 
compared to average losses of 7.68% (N = 24) for 
candidates following a majority president. However, it 
remains to be seen whether the economy drives these 
differences. 

The factors explored thus far permit us to place 
presidential systems on a continuum. At one end elec- 
tions are nationalized and always concurrent, and at 
the other end legislative elections are localized and 
often not concurrent with the executive election. In 
the real world, Costa Rica might fall on one end of 
the continuum while Colombia fell at the other end, 
and the United States would fall somewhere in the 
middle. The question remains whether such variables 
affect clarity of responsibility for incumbents in either 
branch of government. 


CLARITY OF RESPONSIBILITY AND 
ECONOMIC VOTING: ANALYSIS 


I test the impact of clarity of responsibility and eco- 
nomic conditions on elections in 23 presidential democ- 
racies, defined as those in which both branches of 
government are directly elected for fixed terms and 
in which the head of government is not accountable 
to the legislature (Siaroff 2003).8 I employ aggregate 
economic and electoral data. I included a case if the 
country had had at least two consecutive democratic 
elections—and if political and economic data were 
available. This resulted in a sample of 75 executive 
elections and 103 legislative elections.” 

Scholars of economic voting typically employ one of 
the following three variables as the indicator of eco- 
nomic performance: GDP growth, inflation, or unem- 
ployment. I employ the percentage change in real per 
capita GDP growth (World Bank 2003).'° Following 
Pacek and Radcliff (1995), I use the change in GDP 


8 I include cases from Siaroff’s Category 2 as well as Category 6, 
which “functions essentially like” Category 2. As Elgie (1999, 11) 
notes, regime classification “varies from one writer to another,” but 
the only case on which I differ from Siaroff is Bolivia, which Siaroff 
classifies together with Switzerland. Despite a quirk in Bolivia’s 
constitution, it functions as a pure presidential system. See Inter- 
American Development Bank 2002; personal communication with 
John Carey and Octavio Amorim Neto. 

9 Appendix 1 contains a list of the countries and elections I explore 
differences across regime types, incorporating pure and semupresi- 
dential systems as well as parliamentary systems, in other work. 

10 Unemployment information 1s unavailable for most of the coun- 
tries in the sample. As for mflation, the expectations about the impact 
of mflation are far less clear than for that of GDP growth, supporting 
the use of the latter over the former, because it is unclear whether 
voters respond to the level or direction of inflation. 
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growth in year ¢—1 if the election was held in the 
first six months of the year, and the change in GDP 
growth in year ¢ if the election was held in the last 
six months of the year." I call this variable Economy. 
If the economy matters generally in presidential sys- 
tems, this variable should have a positive and significant 
coefficient. 

If clarity of responsibility diffuses or accentuates 
economic voting effects, we should observe variables 
in multiple regression analysis that substitute for, de- 
crease, or wipe out the impact of the economy on vote 
swings. The logic of the argument suggests that the ef- 
fect of economics on elections is interactive with clarity 
of responsibility variables (Powell and Whitten 1993). 
Thus if the economy matters in general, we expect a 
positive and significant coefficient on Economy (all 
else equal), and if another variable blurs the impact 
of economic fluctuations on the vote, we would expect 
a negative and significant coefficient on the interacted 
term Economy * OtherVariable when that variable is 
added to the equation. The impact of the economy 
on the vote would then be the sum of the coefficient 
on Economy plus the coefficient on the interacted 
variable. 

As per the discussion in the previous section, I ex- 
plore the impact of the following independent vari- 
ables: Concurrence, Electoral Rules (for legislative 
elections only), Coalitions, and Minority President, all 
operationalized as dummy variables. I enter each vari- 
able directly and in an interacted term (with Economy). 
The dependent variable in all regressions is the abso- 
lute percentage change in vote for the incumbent pres- 
ident’s party from the previous election. In the models 
for executive elections, the dependent variable is the 
vote change in the executive election, and in the models 
for legislative elections, the dependent variable is the 
vote change in the legislative election. In all models, I 
employ ordinary least squares regression with robust 
standard errors. 

In all regressions I also included two control vari- 
ables: the incumbent party’s proportion of the vote 
in the previous election (Previous Vote), which should 
have a negative impact because parties tend to lose 
votes after being in office; and a dummy variable 
indicating whether or not the incumbent president 
is running for reelection (Reelecton). This variable 
should return a positive coefficient because incumbents 
running for reelection have advantages in terms of 
name recognition and organization relative to other 


11 Quarterly figures are unavailable for most countries in the sam- 
ple Other operationalizations could provide more nuance, but data 
limitations restrict my ability to conduct such tests. For example, 
because the time series is relatively short for most of these countries, 
I cannot explore the impact of “shocks” relative to long-term trends 
(cf. Patmer and Whitten 1999). 

12 I do not employ models with country dummies because of the 
numerous cross sections relative to the overall sample size in the 
data set (with 23 cross sections I have an average of three executive 
elections and four legislative elections per country) I ran a series of 
regressions with country dummies, and although same of the dum- 
mies were significant, their inclusion did not substantially alter the 
results. 
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TABLE 1. 
Models 
Independent Varlable 
Previous Vote 


Executive Elections, Baseline 


Concurrent 


(177) 
1.1727** 
(.311) 
8.680** 
(2.762) 
12.906 
(8.682) 
R? .357 
N 61 
Prob >F .000 
Note: ™ p = .01, ** p = .001 (one-talled test). 


Economy 
Reelection 
Constant 


candidates. Consequently, they (or their party, in the 
case of legislative elections, due to potential coattail 
effects) ought to receive a higher vote percentage, all 
else equal. 

Given my hypothesis about the importance of the 
electoral cycle, I first separate out concurrent from 
nonconcurrent elections.” Table 1 provides results for 
a comparison of the effects of the economy on the 
vote in both concurrent and nonconcurrent executive 
elections. 

The first model tests for the impact of the Economy, 
Previous Vote, and Reelection in concurrent executive 
elections. The results conform to my expectations: The 
vote in the current period is correlated with the vote in 
the previous period, presidents running for reelection 
can expect a higher vote share than candidates running 
for their first term, and the state of the economy is 
strongly correlated with the vote for the incumbent 
party. These results are robust to the exclusion of any 
particular case. 

The results for nonconcurrent presidential elections 
also conform to my expectations. In contrast to the 
results for concurrent elections, neither the previous 
vote nor the state of the economy affects the vote swing 
for the incumbent party’s candidate. This result may 
sound foreign to scholars and residents of the United 
States, where presidential elections are always concur- 
rent with legislative elections, and I acknowledge that 
the results rest on relatively few cases. However, they 
are fairly robust: Note that the standard error on Econ- 
omy is three times the size of the coefficient. Future 
work should return to this issue as more data become 
available. 


3 Analysis of a correlation matrix for executive elections revealed 
that the Economy * Concurrence variable was correlated at .91 with 
the Economy variable (no other variables were correlated at higher 
than .79). Correlation at this level is problematic and makes the 
inclusion of both variables m one model unhelpful. I also attempted 
to test for the impact of concurrence using a Proximity variable, 
which measures the proportion of the presidential (legrslatrve) term 
that has elapsed relative to the legislative (presidential) term. Yet this 
variable, when interacted with the Economy variable, was similarly 
highly correlated with the Economy variable, at .92. 





Table 1 reveals that voters only sanction presidents 
and their parties’ candidate when executive and legisla- 
tive elections are concurrent. I suggested above that in 
nonconcurrent elections voters may focus on the candi- 
dates as individuals and not as members of a party run- 
ning candidates for a set of offices. As a result they may 
not connect either the performance of the incumbent 
president or the performance of the economy to their 
vote. For those who believe that executive elections 
should flow from the state of the national economy, the 
results for nonconcurrent elections are, at a minimum, 
discouraging, even if they are not conclusive. 

A key question remains about the potential im- 
pact of incumbent presidents who are running for 
reelection. Cheibub and Przeworski (1999) and Manin, 
Przeworski, and Stokes (1999) suggest that voters can- 
not engage in retrospective economic voting when they 
literally cannot hold the person who has been responsi- 
ble for managing the economy over the past few years 
(in theory at least) to accounts. Does incumbency en- 
hance accountability for the economy, and does the 
absence of an incumbent running for reelection weaken 
accountability? 

In 11 of the 61 concurrent elections an incumbent 
president ran for reelection. (No incumbents ran for re- 
election in nonconcurrent executive elections.) Given 
the results for concurrent elections in Table 1, we know 
that incumbency does matter on its own. In Table 2, 
I interact the Economy and Reelection variables to 
discover whether incumbency has both an independent 
and a joint effect. If incumbency drives the accountabil- 
ity relationship, this variable should return a positive 
and significant coefficient. In addition, the Economy 
variable should lose significance, if the hypothesis that 
the absence of an incumbent running for reelection 
attenuates accountability is correct. 

The results do not support the extension of Cheibub 
and Przeworski’s and Manin and coworkers’ argument. 
Incumbency does not make for a tighter accountability 
relationship, at least for the economy. Voters hold in- 
cumbent parties to accounts, even in executive elections 
and even if the incumbent is not running for reelection, 





TABLE 2. Executive Elections, 
Testing for Impact of Incumbents 


Independent Vanable Concurrent 







Previous Vote —.476** 
(.180) 

Economy 1.156*** 
(.333) 





Reelection 8.125** 








(2.968) 
Reelection + Economy 266 
(.848) 
Constant 12.870 
(8.803) 
R? .358 
N 61 
Prob >F 000 





Note: ** p = .01, *** p = .001 (one-talled test). 
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TABLE 3. Concurrent Executive Elections 








Independent Vanable Model 1 Model 2 

Previous Vote —.545** —.555** 
(.194) (.184) 

Economy 1.277""* 1.088** 
(251) (.350) 

Reelection 9.577** 8.753"* 
(3.080) (3.003) 


—3.555 — 











Mmortty President 






Min Pres « Economy —.168 — 
(.558) i 
Coalltion — —3.601 
(3.844) 
Coal x Economy — .227 
(.808) 
Constant 18.204" 17.382* 
(10.149) (9.310) 
R? .373 .367 
N 61 61 
Prob >F 000 000 














Note *p = .05, ** p = .01, *** p = .001 (one-tailed test). 


as long as elections are concurrent. Linking partisan 
ties across branches of government, rather than the 
presence or absence of an individual candidate, is rel- 
atively more important for overall accountability in 
presidential systems. 

The next question is whether these results change 
when we add any of the other clarity of responsibility 
variables. Table 3 presents several models for concur- 
rent executive elections, adding in each variable and its 
interaction term.!* For nonconcurrent elections, I have 
too few cases to conduct similar and reliable analysis— 
the results would be dependent on the influence of 
the existence of one case or another of (for example) 
minority coalition government. 

Model 1 tests for any change when the president’s 
party does not control a majority of the seats in the 
legislature, independently of whether the president 
ultimately obtains a majority through coalition gov- 
ernment. This specifically tests whether voters dis- 
tinguish situations when the president’s team does 
not unilaterally comprise a legislative majority rela- 
tive to all other situations. This situation arose in 41 
of the 61 concurrent cases. The null cases are thus 
those in which the president’s party controls more 
than 50% of the seats. This hypothesis suggests that 
shared responsibility for government output might 
dampen voters’ propensity to hold the incumbent pres- 
ident’s party’s candidate responsible for the state of 
the economy. Thus, the coefficient on the interacted 
variable ought to be negative, offsetting the positive 
expected coefficient on Economy. The results indicate 
that although Previous Vote, Economy, and Reelec- 


14 In this and the subsequent series of regressions, I could not run 
fully specified models because adding in the interacted vanables 
results in problematic degrees of mult:collinearity (among the inter- 
acted variables). 
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tion all continue to be strongly correlated with the 
vote, the relationship between the economy and the 
vote does not change for incumbent-party presidential 
candidates whose parties do not control a legislative 
majority. 

Model 2 tests for any change when the president 
governs in coalition. This hypothesis tests whether vot- 
ers distinguish situations when the president agrees 
to share power from situations where the president 
attempts to govern without other parties’ assistance, 
regardless of whether that coalition is majority or mi- 
nority. This situation arose in 15 of the 61 concurrent 
cases. The null cases here include both single-party mi- 
nority and single-party majority presidents. As with 
Minority President, the introduction of this variable 
does not alter the results. 

The results in Tables 1 through 3 reveal that in ex- 
ecutive elections the only variable affecting the clarity 
of responsibility and thus the relationship between the 
economy and the vote is whether the presidential and 
legislative elections are held concurrently. When this 
occurs, the state of the economy strongly shapes pres- 
idential election results: In each model, an increase 
of approximately 1% in per capita GDP prior to the 
election results in an increase of a bit more than 1% 
in the incumbent party’s vote. The introduction of par- 
tisan clarity of responsibility variables does not affect 
this relationship—that is, voter reward or sanction is 
not shaped by the presence of minority government or 
coalition government. 

It is perhaps paradoxical that nonconcurrent ex- 
ecutive elections do not clearly enhance clarity of 
responsibility for the economy. Nonconcurrent pres- 
idential elections certainly concentrate attention on 
the presidential campaign. Yet while we might sup- 
pose that nonconcurrent elections would concentrate 
voters’ goodwill or ire about the economy onto the 
presidential race, the results here suggest that “solo” 
presidential elections tend to eliminate the connection 
between the state of the economy and the vote. The 
findings here, although based on relatively few cases, 
tend to qualify the notion that voters always hold 
presidents to accounts for the state of the economy: 
“It’s the economy, stupid” only holds clearly when 
executive and legislative elections are held concur- 
rently. Nonconcurrent presidential elections may turn 
on something else—candidates’ personal characteris- 
tics or other policy outputs. This issue merits further 
exploration. 

I now turn to legislative elections. To parallel the 
analysis for executive elections, Table 4 displays the 


15 With the addition of more cases and/or countnes, I could test 
for differences both across and withm groups of countnes (ic, 
countries that only have concurrent elections, countries that have 
both concurrent and nonconcurrent elections, and countries that 
have only nonconcurrent elections), to further explore the impact 
of concurrence and incumbency. However, in this sample, only four 
countries have executive elections that are sometimes concurrent 
and sometimes nonconcurrent (N = 12 total, 6 nonconcurrent), and 
only two countries’ presidential elections are always nonconcurreant 
(N = 8). 
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TABLE 4. Leglslative Elections: Baseline 
Models 
Independent Varlable 
Previous Vote 


Concurrent 


Nonconcurrent 
—.186* 


baseline results for concurrent and nonconcurrent elec- 
tions. 

The results again support my argument. In concur- 
rent legislative elections both the economy and the 
party’s previous vote are significantly correlated with 
the vote in the current election. In contrast, Previous 
Vote matters less and Economy is not significant in 
nonconcurrent elections, although it is in the predicted 
direction.!6 This implies (but does not confirm, since I 
have no variable testing for “Local Conditions” in both 
models) that local conditions matter relatively more in 
nonconcurrent elections and that the holding of both 
executive and legislative elections simultaneously em- 
phasizes national policy matters in voters’ minds. 

As with executive elections, the next question is 
whether these results change when we add clarity of 
responsibility variables. I explore concurrent elections 
first, in Table 5. 

In Model 1, I test for any change when the president’s 
party does not control a majority of the seats (Minority 
President). This hypothesis is the same as in Table 3. 
This situation arose in 42 cases, and the null cases are 
thus those in which the president’s party controls more 
than 50% of the seats. The hypothesis suggests that 
shared responsibility for government output dampens 
voters’ propensity to hold the incumbent president’s 
party responsible for the state of the economy im leg- 
islative elections. Thus, the coefficient on the interacted 
variable ought to be negative, offsetting the expected 
positive coefficient on Economy. 

The results indicate that the Previous Vote, Econ- 
omy, and Reelection variables remain strongly corre- 
lated with the vote and that the interaction of Minority 
President with the Economy variable is in the predicted 
direction and is significant. To illustrate the finding, 
suppose that the economy increased by 1%. If the 
president’s party did not control a majority, then the 
president’s party would gain 1.211 — 0.562 = 0.649%, 


16 For concurrent elections, if we exclude the 1990 Dominican 
Republic election, the Reelection variable becomes statistically ng- 
nificant. No other case affects the results. For the nonconcurrent 
elections, if we exclude the 1994 Ecnador election, the coefficient on 
Previous Vote becomes significant. No other case alters the results. 








TABLE 5. Concurrent Legislative Elections 
Independent 








Vanable Model 1 Model 2 Model 3 
Previous Vote —.285** —.346""* —.295*** 
(.091) (.081) (.087) 
Economy 1.211*** .952""" .778"* 
(.199) (.189) (.258) 
Reelection 7.772 4.881 4.913 
(3.789) (3.609) 






Minority President 










Min Pres « 


Economy 
Coalition 







Coal « Economy 






Electoral Rules 







Rules » Economy 








Constant 





R? : : ‘ 
N 61 61 61 
Prob >F .000 .000 .008 
Note * p = .05, ** p = 01, *** p = .001 (one-talled test). 






about half of what it would gain if the president’s party 
controlled a majority. This supports the clarity of re- 
sponsibility hypothesis that voters reward or punish 
the president’s party relatively less when that party 
does not control a legislative majority.!” 

Model 2 tests for any effect in legislative elections 
when the president governs in coalition (Coalition), 
regardless of whether the coalition is majority or mi- 
nority. This situation arose in 18 elections. Similar 
to Model 1, this hypothesis suggests that shared re- 
sponsibility for government output dampens voters’ 
propensity to hold the incumbent president’s party 
responsible for the state of the economy. Thus, the 
coefficient on the interacted variable again ought to 
be negative, offsetting the positive expected coefficient 
on Economy. The result confirms the hypothesis. When 
the president governs in coalition, the economy still 
affects the vote, but if the economy increased by 1% 
and the president governed in coalition, the president’s 
party would only gain 0.952 — 0.807 = 0.147%, even 
less than if the president lacks a majority legislative 
contingent.!8 


17 This model excludes the 1995 election ın Argentma as an outlier. 
Exclusion of no other case affects the results. When this case is m- 
cluded, the Previous Vote and Economy variables remain significant, 
while the Reelection variable is not significant, as ıs the Minority 
President « Economy variable, although it 1 still in the predicted 
direction. Excluding no other case alters the results. 

18 This model excludes the 1999 Panamanian election as an outlier. 
When this case is included, the Previous Vote and Economy variables 
Temain significant, while the Reelection vanable is not significant, as 
1s the Coalition + Economy variable. The exclusion of one other case 
affects the results, in a similar way: the 1998 Venezuelan election. 
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Finally, Model 3 tests for the impact of the Electoral 
Rules. Electoral systems that encourage localism or 
personalistic campaigning may tend to obscure the clar- 
ity of responsibility for the state of the national econ- 
omy, while “nationalizing” electoral systems might 
maintain the clarity of responsibility for the economy. 
I created a dummy variable designed to test whether 
the most extremely “localizing” and/or “personalizing” 
electoral systems differ in a substantial way from other 
electoral systems. These include the systems of Brazil, 
pre-1973 Chile, Colombia, Cyprus, Peru, Taiwan, and 
Uruguay (Samuels and Shugart 2003).!9 For concurrent 
elections, only six cases took a value of one. As with 
the other variables, we expect the coefficient on the 
interacted variable to be negative. 

The results again reveal the robustness of the first 
two variables in the regression to the introduction of 
this variable. The interacted Rules x Economy variable 
is also significant, indicating that localizing electoral 
rules diffuse clarity of responsibility for the economy. 
However, I should acknowledge that this result mer- 
its additional exploration, because this model excludes 
one of the six “localizing” cases, the 1990 Peruvian elec- 
tion, as an outlier. When this case is included, there is no 
relationship between this variable and the dependent 
variable. However, the other variables in the regression 
do not change. In short, although this test suggests that 
localizing electoral rules affect clarity of responsibility 
in concurrent elections, the result is inconclusive. 

In sum, for concurrent legislative elections, I obtain 
robust results that the economy is related to voters’ 
propensity to sanction incumbents and that the party’s 
vote in the previous election is related to its vote in the 
current election. In addition, partisan variables such 
as coalition and minority government diffuse the re- 
lationship between the economy and the vote. These 
findings, combined with the findings in Tables 1 through 
3, suggest that in countries with a single-party majority 
legislative contingent and concurrent elections, voters 
reward or sanction executives and legislators for eco- 
nomic performance similarly. However, party-system 
fragmentation, minority government, and potentially 
the electoral rules can mediate this relationship and 
obscure the clarity of responsibility for economic out- 
comes in legislative elections. 

I now conduct similar tests for the nonconcurrent 
legislative elections. The results appear in Table 6. Re- 
call that I hypothesized that in general, the account- 
ability relationship between economics and elections 
should weaken in nonconcurrent legislative elections, 
because national political factors matter relatively less 
in nonconcurrent elections. All else equal, nonconcur- 
rent elections focus voters’ and candidates’ attention 
on local factors and candidates’ personal qualities and 
constituency service. Moreover, national party organi- 
zations play less of a role in nonconcurrent legislative 


19 My operationalization collapses Samuels and Shugart’s (2003) tri- 
partite coding, because I found that no difference exists between the 
cases they coded with a value of one and those they coded with two 
but that a difference exists between the cases they coded as zero and 
all the other cases. 
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TABLE 6. Nonconcurrent Legislative 
Elections 


Model 1 
—.178 
(.115) 
Economy .260 .603* 
(.329) (.277) 
Minonty President .096 — 
(2.386) 


Model 2 Model 3 
—.189* 


(.081) 


Previous Vote 


Min Pres « Economy 
Coalition 


Coal x Economy 


Electoral Rules 


Rules x Economy 





elections, whereas local or regional factors may weigh 
more. This does not mean that national factors are (or 
should be) irrelevant in nonconcurrent elections, it is 
only a statement of their relative importance. Specif- 
ically, the hypothesis suggests that the coefficients on 
all of the variables in the regressions should be weakly 
significant or not significant. For example, the perfor- 
mance of the national party in the previous election 
should matter less if the current election is noncon- 
current, weakening the coefficient on Previous Vote. 
Likewise, the Economy ought to matter less or not at 
all. Moreover, none of the interacted variables should 
be significant, because we don’t expect voters to take 
national partisan factors into consideration in noncon- 
current elections. 

Let me first comment on the results for the Pre- 
vious Vote variable. Recall that in concurrent elec- 
tions, this variable was always significant, and in the 
baseline model for nonconcurrent elections (Table 4), 
it remained significant, although the coefficient was 
much weaker than that for concurrent elections. Here, 
the Previous Vote variable reaches significance in two 
of three models and its coefficient remains smaller 
than for concurrent elections, a plausible finding given 
our intuition that nonconcurrent elections tend to 
concentrate campaigns and voters’ minds on local or 
candidate-based factors rather than national partisan 
factors. 

Next I explore the impact of the Economy variable. 
In concurrent elections this variable was robust to all 
alternative specifications. Here, Economy is significant 
in only one model, and it is always smaller in non- 
concurrent elections than in any of the models for 
concurrent elections. These results thus again conform 
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to the expectations generated by Table 4, which con- 
trasted concurrent and nonconcurrent legislative elec- 
tions and found that the relationship between the 
economy and the vote was significant in both types 
but both weaker and less robust in nonconcurrent 
elections. 

Finally, I examine the impact of the other clarity of 
responsibility variables in Table 6. Recall that I argued 
that nonconcurrence tends to reduce the impact of 
national party organization and campaign messages. 
Thus the influence of all national-level partisan factors 
ought to be relatively diminished in nonconcurrent 
elections. The results are clear: In contrast to the re- 
sults in Table 5, none of the interacted variables are 
significant in Table 6, indicating that no other clarity 
of responsibility variables mediate the impact of the 
national economy beyond the electoral cycle.” 

To sum up, the results for legislative elections in 
Tables 4—6 indicate that the economy matters in 
both concurrent and nonconcurrent legislative elec- 
tions, although it has a greater impact and matters 
more clearly and consistently in concurrent elections. 
Moreover, the results indicate that partisan and in- 
stitutional clarity of responsibility affects the rela- 
tionship between the economy and the vote in con- 
current elections but that the evidence for such a 
connection is much weaker in nonconcurrent leg- 
islative elections. In addition, when we compare the 
impact of the economy in executive and legisla- 
tive elections (compare against Tables 1 and 3, e.g.) 
we can infer that voters reward or sanction presidents 
more than legislators for economic performance.?! 

These results leave us with the following: In concur- 
rent executive elections, the economy affects the vote, 
regardless of the clarity of responsibility. Yet in noncon- 
current executive elections, the economy never mat- 
ters. In concurrent legislative elections, the economy 
affects the vote, but partisan and institutional variables 
mediate that connection. The economy also affects the 
vote in nonconcurrent legislative elections, but rela- 
tively less so than in concurrent legislative elections, 
yet neither partisan nor institutional variables mediate 
that link. These findings tend to support my argument 
about the particular way in which the electoral cycle 
affects clarity of responsibility and thus accountability 
in presidential systems. 


CONCLUSION 


This paper elucidates the conditions under which elec- 
toral sanctioning is more or less likely to occur under 


The negative and significant result on Electoral Rules indicates 
that candidates from the president's party obtain a higher percentage 
of the vote when the electoral niles “localize” competition. This 
occurs because localizing electoral rules allows candidates to insulate 
themselves from national trends and base their campaigns on the 
“personal vote” and the incumbency advantage 

Regression analysis that pooled all elections, however, did not 
statistically distinguish between the effect of the economy in execu- 
tive and that in legislative elections, although the sign of the dummy 
variable distinguishing executive from legislative elections was in the 
predicted direction. 


Vol. 98, No. 3 


presidentialism. The results confirm neither the most 
optimistic views, expressed most clearly (if only at 
an abstract level) by Persson, Roland, and Tabellini 
(1997), nor the most pessimistic, perhaps expressed 
by Linz (1990, 1994). In many ways the findings echo 
Shugart and Carey’s (1992) suggestion that the sep- 
aration of powers can engender different forms of 
representation and accountability than parliamentary 
systems. 

Clarity of responsibility and thus voters’ capacity to 
sanction elected officials work in particular ways under 
presidentialism. When elections are nonconcurrent, 
a situation that cannot occur under parliamentarism, 
sanctioning for the state of the economy is weak. In ex- 
ecutive elections this lack of a connection between the 
economy and the vote came somewhat as a surprise, but 
the relatively weak relationship in legislative elections 
followed theoretical expectations and research findings 
on U.S. elections. Because about one in five executive 
elections and two in five legislative elections around the 
world are nonconcurrent, these results support some 
observers’ concerns about overall accountability under 
presidentialism. 

In contrast, concurrence enhances voters’ capacity 
to reward or sanction government officials for the state 
of the economy in presidential systems. These results 
qualify Linz’s critique of presidentialism based on his 
notion of dual democratic legitimacies. Despite the 
formal separation of powers, institutions that promote 
close electoral linkage between the executive and 
legislative branches can generate “unified democratic 
legitimacy.” Ceteris paribus, when elections are con- 
current voters treat the incumbent executive and his or 
her legislative supporters as a team and judge them as 
such. 

However, the all-else-equal clause is essential, be- 
cause in concurrent elections legislative fragmentation 
and “localizing” electoral rules can affect the degree 
to which voters hold both branches accountable for 
the economy. This implies the following: Under single- 
party majority government with concurrent elections, 
voters in presidential and parliamentary systems will 
not differ much in their capacity to sanction or reward 
governments. However, even under concurrent elec- 
tions, when we see multiparty coalitions (for exam- 
ple), clarity of responsibility and thus voters’ ability to 
sanction incumbents function differently under pres- 
identialism and parliamentarism. Under parliamen- 
tarism, as Anderson (1995, 2000), Powell and Whitten 
(1993), and others have argued, partisan and institu- 
tional variables tend to diffuse clarity of responsibility. 
Under presidentialism, the same phenomenon occurs 
in concurrent legislative elections but not in executive 
elections. This separates the degree to which voters 
sanction executives and legislators and means that 
executives in many presidential systems are subject to 
relatively greater sanctioning or rewards than legisla- 
tors, a situation that cannot by definition arise under 
parliamentarism. 

Presidentialism generates distinct forms of account- 
ability. The electoral cycle most clearly affects this dy- 
namic, but electoral rules and party system features 
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also mediate voters’ ability to sanction incumbents. 
Under a limited set of conditions presidentialism can 
approximate parliamentarism, and we would expect 
to see a similar form of government accountability 
in both types of systems. But the collective, “respon- 
sible parties,” policy-oriented sense of accountability 
that is rooted in normative political theory can oc- 
cur under presidentialism only under these limited 
conditions. When such conditions are not met, and 
when the electoral and party systems separate the 
bases for selecting members of each branch, we have 
a more clearly distinct “presidentialist” dynamic. For 
example, even when elections are concurrent, vot- 
ers are often likely to sanction incumbents in both 
branches of government to different degrees, depend- 
ing on institutional and party-system factors. And 
when elections are nonconcurrent, national factors 
matter relatively little in both executive and legislative 
elections. 

For legislative elections at least, Shugart and Carey 
(1992) suggested that nonconcurrence and/or localiz- 
ing electoral rules can be a normatively good thing 
because there is nothing prima facie wrong with the 
idea of giving legislators an opportunity to campaign 
on their constituency service and voters an opportu- 
nity to base their vote on local rather than national 
concerns. In this way the institutional “separation of 
purpose” under presidentialism (Samuels and Shugart 
2003) can enhance representation and accountability 


APPENDIX 1: CASES 


Country 
Argentina 
Benin 
Bolivia 
Brazi! 
Chlle 
Colombla 
Costa Rica 


1995L, 1999L 


Cyprus 
Dominican Republic 
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in ways that parliamentary elections cannot. Thus al- 
though the state of the national economy matters rela- 
tively little in nonconcurrent legislative elections, and 
national clarity of responsibility variables do not af- 
fect this relationship, local variables may systemati- 
cally affect the vote and thus promote accountability 
(for example, whether voters can recall the incum- 
bent’s name or the extent to which they credit 
the incumbent with providing constituency service). 
The degree to which presidentialism allows both na- 
tional and local accountability simultaneously merits 
exploration. 

Yet if one disputes Shugart and Carey’s suggestion 
that separating the bases for electing the executive and 
legislative branches can be a normatively good thing 
for democracy because it permits different forms of ac- 
countability, and instead holds to the Linzian view that 
both branches should generally be held accountable 
for the same things, then the results here reinforce the 
conclusion that the core problem of for government ac- 
countability under presidentialism is the potential lack 
of electoral linkage between the president and his or 
her legislative contingent. Party-system and other insti- 
tutional factors shape the strength of this linkage, so 
the findings in this paper thus reinforce the critical 
role that institutions and party-system configurations 
can play in shaping democratic performance. Future 
research ought to explore these differences in more 
detail. 





Elections* 


1983PL, 1985L, 1987L, 1989PL, 1991L, 1993L, 1995PL, 1997L, 1999PL 


1985PL, 1989PL, 1993PL, 1997PL 

1989P, 1990L, 1994PL, 1998PL 

1958P, 1961L, 1964P, 1965L, 1969L, 1970P, 1973L, 1989PL, 1993PL, 1997L, 1999P 
1974PL, 1978PL, 1982PL, 1986PL, 1990PL, 1991L 1994PL, 1998PL 

1962PL, 1966PL, 1970PL, 1974PL, 1978PL, 1982PL, 1986PL, 1990PL, 1994PL, 1998PL 
1981L, 1985L, 1991L, 1996L, 2001L 

1986PL, 1990PL, 1994PL, 1996P, 1998L 

1979L, 1984PL, 1986L, 1988PL, 1990L, 1992PL, 1994L, 1996PL, 1998L, 2000P 


1962L, 1964PL, 1966L, 1968PL, 1970L, 1972PL, 1974L, 1976PL, 1978L, 1980PL, 1982L, 


1984PL, 1986L, 1988PL, 1990L, 1982PL, 1994L, 1996PL, 1988L, 2000PL 


Ecuador 

El Salvador 1989PL, 1991L, 1994PL, 1997L, 1999P, 2000L 
Guatemala 1991PL, 1995PL, 1989PL 

Honduras 1981PL, 1985PL, 1989PL, 1993PL, 1997PL 

Korea 1987P, 1988L, 1982PL, 1996L, 1997P, 2000L 
Malawi 1994PL, 1999PL 

Mexico 1994PL, 1997L, 2000PL 

Nicaragua 1991PL, 1996PL 

Panama 1994PL, 1999PL 

Paraguay 1989PL, 1993PL, 1988PL 

Peru 1985PL, 1890PL 

United States 

Uruguay 1966PL, 1971PL, 1984PL, 1989PL, 1994PL, 1999PL 
Venezuela 1963PL, 1968PL, 1973PL, 1978PL, 1983PL, 1988P 


1993PL, 1998PL, 2000PL 


Note: More cages are listed here than are included In the regressions. None of the first elections in any time senes are included because 
there Is no “t— 1" information on the “Previous Vote” vanable for the first. Thus the total number of cases Included m the regressions equals 
the total number of cases listed here minus the first cases in every country’s genes. 


*P, presidential; L, legrstatrve. 
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APPENDIX 2: ELECTORAL DATA SOURCES 


For Argentina, Bolivia, Brazil, Chile, Costa Rica, the Domini- 
can Republic, Ecuador, El Salvador, Guatemala, Honduras, 
Mexico, Nicaragua, Panama, Paraguay, Peru, Uruguay, and 
Venezuela, the sources were as follows. 


© Inter-American Development Bank. 2002. Democracies 
in Development. Washington, DC: IDB. 

e Political Database of the Americas. 2003. “Statisti- 
cal Data: Electoral Resulta.” Various dates. http://www. 
georgetown.edu/pdba/. Retrieved November 2003. 

© Magar, Eric, and Kevin J. Middlebrook. 2000. “Statisti- 
cal Appendix: National Election Results, 19803 and 1990s, 
for Argentina, Brazil, Chile, Colombia, El Salvador, Peru, 
and Venezuela.” In Conservative Parties, the Right, and 
Democracy in Latin America, ed. Kevin Middlebrook. 
Baltimore: Johns Hopkins University Press. 


For particular countries, I also used the following. 


© Benin: personal correspondence, Professor Shaheen 
Mozaffar, Bridgewater State University. 

e Bolivia: Gamarra, Eduardo, and James Malloy. 1995. “The 
Patrimonial Dynamics of Party Politics in Bolivia.” In 
Building Democratic Institutions: Party Systems in Latin 
America, ed. Scott Mainwaring and Timothy R. Scully, 
Stanford, CA: Stanford University Press. 

© Brazil: Jairo Nicolau. 2003. “Dados Eleitorais do Brasil.” 
www.iuperj.br/deb/port/. Retrieved November 2003. 

e Chile: (a) Chilean government Web site, www.elecciones. 
gov.cl. Retrieved November 2003. (b) Cruz-Coke, 
Bernardo. 1984. Historia Electoral de Chile, 1925—1973. 
Santiago: Editorial Judicial de Chile. 

© Costa Rica: personal correspondence, Michelle Taylor- 
Robinson, Texas A&M University. 

© Cyprus: Nordsieck, Wolfram. 2003. “Parties and Elec- 
tions in Europe.” www.parties-and-elections.de/indexe. 
html. Retrieved November 2003. 

© Ecuador: Mejfa-Acosta, Andrés. 2000. “Weak Coali- 
tions and Policy Making in the Ecuadorian Congress 
(1979—-1996).” Presented at the 2000 meeting of the Latin 
American Studies Association, Chicago. 

© Korea: (a) Keesings Contemporary Archives (various). 
(b) Foreign Broadcast Information Service. 1992 “Daily 
Report, East Asia.” March 26, 1992, pp. 22—23. (c) Moriss, 
Peter. 1996. “Electoral Politics in South Korea.” Electoral 
Studies 15 (December): 550—62. (d) Kang, W. T., and H. 
Jaung. 1999, “The 1997 Election in Korea.” Electoral Stud- 
les 18 (December): 599—608. 

e Malawi: (a) Elections Around the World. 2003. 
www.clectionworld.org. Retrieved November 2003. 
(b) Nohlen, Dieter, Michael Krennerich, and Bernhard 
Thibaut, ed. 1999. Elections in Africa: A Data Handbook. 
Oxford: Oxford University Press. 

e Panama: personal correspondence, Carlos Guevara- 
Mann, University of Notre Dame. 

© Peru: Tuesta Soldevilla, Fernando. 2001. Perú Político en 
Cifras: 1821—2001. Lima: Fundación Friedrich Ebert. 

e Uruguay: No author. 2000. Elecciones 1999/2000. 
Montevideo: Ediciones de la Banda Oriental. 

e United States: Ornstein, Norman, et al. Vital Statistics on 
Congress, 2001—2002. Washington, DC: American Enter- 
prise Institute. 

e Venezuela: Kornblith, Miriam, and Daniel H. Levine. 
1995. “Venezuela: The Life and Times of the Party Sys- 
tem.” In Building Democratic Institutions: Party Systems 
in Latin America, ed. Scott Mainwaring and Timothy R. 
Scully. Stanford, CA: Stanford University Press. 


REFERENCES 


Anderson, Christopher. 1995. “The Dynamics of Pubhe Support 
for Coalition Governments.” Comparative Political Studies 28 
(Summer): 350-83. 

Anderson, Christopher. 2000. “Economic Voting and Political Con- 
text: A Comparative Perspective.” Electoral Studies 19 (June): 
151-70. 

Carey, John M., and Matthew S. Shugart. 1995. “Incentives to Cul- 
tivate a Personal Vote: A Rank-Ordering of Electoral Formu- 
las” Electoral Studies 14 (December): 417-39. 

Cheibub, José Antonio, and Adam Przeworski. 1999. “Democ- 
racy, Elections, and Accountabihty for Economic Outcomes.” 
In Democracy, Accountabihty, and Representation, ed. Adam 
Przeworski, Susan Stokes, and Bernard Manin. New York: 
Cambridge Univernty Press, 222—50. 

Dorussen, Han, and Harvey Palmer. 2002. “The Context of Eco- 
nomic Voting: An Introduction.” In Economic Voting, ed. Han 
Dorussen and Micheell Taylor. London and New York: Routledge, 
1-14. 

Elgie, Robert. 1999. “The Politics of Semi-presidentialism.” In Semi- 
Presidentialism in Europe, ed. Robert Elgie. Oxford: Oxford Uni- 
versity Press, 1—21. 

Enkson, Robert S. 1989. “Economic Conditions and the Presidential 
Vote.” American Political Science Review 83 (June): 567-73. 

Erikson, Robert S. 1990. “Economic Conditions and the Congres- 
sional Vote: A Review of the Macrolevel Evidence.” American 
Journal of Political Science 34 (May): 373-99 

Fiorina, Morris, 1981. Retrospective Voting in American National 
Elections. New Haven, CT. Yale University Press. 

Inter-American Development Bank. 2002. Democracies in Develop- 
ment. W DC: IDB. 

Lewis-Beck, Michael. 1988. Economics and Elections: The Major 
Western Democracies. Ann Arbor: University of Michigan Press. 

Lewis-Beck, Michael. 1997. “Who’s the Chef? Economic Voting 
Under a Dual Executive.” European Journal of Political Research 
31 (April): 315—25. 

Lewis-Beck, Michael, and Mary Stegmeier. 2000. “Economic De- 
termimants of Electoral Outcomes.” Annual Review of Political 
Science 3: 183-219. 

Lijphart, Arend. 1992. “Introduction.” In versus Pres- 
idennal Government, ed Arend Lijphart. Oxford. Oxford Univer- 
sity Press, 1—30. 

Linz, Juan. 1990. “The Perils of Presidentiahsm.” Journal of Democ- 
racy 1 (Winter): 51-69 

Linz, Juan J. 1994. “Presidential versus Parliamentary Democracy: 
Does It Make a Difference?” In The Failure of Presidential Dernoc- 
racy, Volume 2. The Case of Latin America, ed. Juan Linz and 
Arturo Valenzuela. Baltimore: Johns Hopkins University Press, 
3-90. 

Manin, Bernard, Adam Przeworski, and Susan Stokes. 1999 “Elec- 
tions and Representation.” In Democracy, Accountability, and 
Representation, ed. Adam Przeworski, Susan C. Stokes, and 
Bernard Manm. New York: Cambridge University Press, 29—54. 

Nannestad, Peter, and Martin Paldam. 2002. “The Cost of Ruling.” In 
Economic Voting, ed. Han Dorussen and Michaell Taylor. London 
and New York Routledge, 17—44. 

O'Donnell, Guillermo. 1994. “Delegative Democracy.” Journal of 
Democracy 5 (January): 55—69. 

Pacek, Alexander, and Benjamin Radcliff. 1995. “The Political Econ- 
omy of Competitive Elections in the Deve World.” Ameri- 
can Journal of Political Sclence 39 (August): 745—59. 

Palmer, Harvey D., and Guy D. Whitten. 1999. “The Electoral Impact 
of Unexpected Inflation and Economic Growth.” British Journal 
of Political Science 29 (October): 623-39. 

Persson, Torsten, Gérard Roland, and Guido Tabellin: 1997. “Sepa- 
ration of Powers and Political Accountability.” Quarterly Journal 
of Economics 112 (November): 1163-203. 

Powell, G. Bmgham, and Guy D. Whitten. 1993. “A Cross-National 
Analysis of Economic Voting: Taking Account of the Political 
Context.” Amencan Journal of Political Saence 37 (May): 391— 
414. 

Remmer, Karen L. 1991 “The Political Impact of Economic Crisis 
in Latin America in the 1980s.” American Political Science Review 
85 (September). 777—800 


435 


Presidentialism and Accountability for the Economy 


August 2004 





Remmer, Karen L. 2001. “Elections and Economics in Contem- 
porary Latin America.” Presented at the meeting of the Latin 
American Studies Association, Chicago. 

Samuels, David, and Matthew Shugart. 2003 “Presidentalism, 
Elections, and Representation.” Journal of Theoretical Pol:tics 
15 (January): 33—60. 

Shugart, Matthew S. 1995. “The Electoral Cycle and Institutional 
Sources of Divided Presidential Government.” American Political 
Science Review 89 (June): 327—43. 

Shugart, Matthew S., and John M. Carey. 1992. Presidents and Assem- 
blies: Constitutional Design and Electoral Dynamıcs. New York: 
Cambridge. 

Shugart, Matthew S., and Scott Mainwaring. 1997 “Presidentialism 
and Democracy in Latin America: Rethinking the Terms of the 


436 


Debate.” In Presidentialism and Democracy m Latin America, ed. 
Scott Mainwanng and Matthew Shugart. New York: Cambridge 


tary 
tmclon.” European Journal of Political Research 42: 287— 
312. 

Stokes, Susan. 2001. “Introduction.” In Public Support for Market 
Reforms m New Democracies, ed. Susan C. Stokes, New York: 
Cambridge University Press, 1-34. 

Strøm, Kaare. 1990. Minority Government and Majority Rule. New 
York: Cambridge. 

World Bank. 2003. “World Development Indicators” [CD-ROM]. 
Washington, DC: World Bank. 


American Political Science Review 


VoL 98, No.3 August 2004 





It's Not Whether You Win or Lose, but How You Play the Game: 
Self-Interest, Social Justice, and Mass Attitudes toward 


Market Transition 


RAYMOND M. DUCH University of Houston 
HARVEY D. PALMER University of Mississippi 


explore systematic differences in economic reasoning and what might account for them, we 
i investigate how sociocultural conditions affect transitions to market economies in the West African 
country of Benin. We probe the importance of several factors: basic economic norms, utility max- 
imization behavior, individual-level personal capital, and individual-level social capital. The evidence, 
based on experiments embedded in an opinion survey, indicates that Beninese citizens widely share 
commitments to the basic foundations of economic interaction, e.g., property rights. The nature of social 
capital varies across cultural and political contexts and accounts for cross-contextual variation in the 
costs associated with cooperative behavior and in utility maximization behavior. 


the transition from a highly state-dominated 

nonmarket economy with nondemocratic po- 
litical institutions to a more market-oriented econ- 
omy with democratic political institutions. Since the 
late 1980s, a large number of countries, including the 
postcommunist regimes of East and Central Europe as 
well as many African countries, fall into this category. 
The current conventional policy prognoses for these 
developing countries is the adoption of policy and in- 
stitutional reforms designed to promote efficiency— 
for example, “micro-lending” initiatives designed to 
promote indigenous business formation or more secure 
property rights aimed at expanding the base of invest- 
ment capital (de Soto 2000). Increasingly, development 
economists recognize that economic development poli- 
cies will only work if they incorporate incentives that 
promote efficient economic choices, such as invest- 
ing in the future rather than increasing consumption 
(Easterly 2002). 

Many of these initiatives build upon straightforward 
conclusions from classical micro-economic theory. To 
the extent that the basic assumptions of classical micro- 
economic theory hold in developing countries, many 
of these policy prognoses make eminent sense. Some 
have argued, though, that the behavioral assumptions 
of these classical models do not hold in developing 


D eveloping countries are frequently faced with 
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countries with sociopolitical contexts that differ widely 
from those in advanced industrial countries.! We focus 
on two of these challenges to the classical assumptions 
in the literature. First, a large body of experimental 
evidence demonstrates that deviations from micro- 
economic behavioral assumptions vary considerably 
across cultural contexts. Second, an extensive litera- 
ture suggests that citizens in these transition countries 
retain considerable cultural and ideological “baggage” 
that inhibits their ability to respond appropriately (i.e., 
in the classical micro-economic sense) to institutional 
and economic cues. 

Are there universal regularities to economic reason- 
ing and behavior? Or are the reasoning and behav- 
ioral assumptions of micro-economic theory subject to 
significant cross-cultural variations? We propose a con- 
ceptualization of economic reasoning that highlights its 
multidimensionality and suggests that certain aspects 
of economic reasoning are invariant across contexts 
while others vary across contexts. First, we distinguish 
between norms associated with the basic rules that gov- 
ern market interactions and those that are associated 
with economic choices. We dismiss the notion that so- 
cioeconomic context or individual cognitive capabil- 
ities affect normative commitment to the basic rules 
governing market interactions. We contend that there 
is no systematic variation in commitment to these basic 
norms. With respect to economic choices, though, we 
expect variation across contexts in the extent to which 
choices reflect the utility maximization assumptions 
of micro-economic theory. First, we posit that basic 
economic reasoning associated with market choices is 
not predicated on sophisticated cognitive abilities, nor 
is it predicated on demanding informational and psy- 
chological prerequisites. We do expect, though, that 
the nature of “social capital” varies across cultural and 


1 Palmer (1989, 320), for example, argues that “...a major reason 
for the growing disparities between the First and Third World is to be 
found in the persistence of traditional attitudes and behavior patterns 
m many regions of the Third World.” Bourgin (1984) suggests that 
the religious and cultural values in many regions of the Third World 
stress passivity rather than achievement. Other examples include 
Greif (1994), Harrison (1992), and Schejtman (1984). 
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political contexts and hence the costs associated with 
more or less cooperative behavior can vary significantly 
generating, for example, cross-contextual variation in 
utility maximization behavior. 

The unique contribution of our article to this debate 
is that we test these propositions using a mix of tradi- 
tional survey research methods augmented with exper- 
iments that are imbedded within surveys administered 
to representative samples of national populations. 
Second, our analysis focuses on survey experiments 
conducted in a country, Benin, where the behavioral as- 
sumptions of classical micro-economic theory are least 
likely to hold, at least according to critics of such mod- 
els? Benu is a particularly impoverished nation that 
adopted a democratic system of government during 
the past decade (per capita income in Benin hovered 
around $1000 during the late 1980s and early 1990s 
when the democratic transition occurred). Moreover, 
Benin’s transition to democracy supplanted a non- 
democratic regime that implemented highly collectivist 
and antimarket institutions. By selecting this particular 
case, we believe our analysis is strongly biased toward 
producing results consistent with the (null) hypothe- 
sis that nascent market economies exhibit underdevel- 
oped market reasoning. 


Does Any of This Matter? Our larger concern in this 
article is improving our understanding of the micro- 
foundations of implementing market reforms in con- 
texts with underdeveloped market mechanisms. The 
literature on these transitions is replete with rather 
broad assertions about the appropriateness and feasi- 
bility of implementing market reforms in contexts that 
lack a well-established tradition of classic market insti- 
tutions. Some scholars, for example, argue that classic 
notions of property rights can easily be implemented 
in widely varying national contexts (e.g., de Soto 2000), 
while others question the cultural “receptivity” to such 
initiatives in many developing nations (Harrison and 
Huntington 2000). Similarly, some scholars suggest that 
the transition to market economics from a nonmarket 
system of the Soviet variety is extremely problematic 
because individuals adapt slowly to classic market rea- 
soning, which is fundamentally at odds with their tra- 
ditional communitarian norms (Moore 1966). These 
are important issues because they are the basis for 
extremely expensive international policy initiatives de- 
signed to promote economic development. 

Yet surprisingly there is little effort to leverage em- 
pirical work on the microfoundations of economic be- 
havior in order to better understand these broader 
theoretical and policy issues. The hypotheses and em- 
pirical tests reported below explicitly address this la- 
cuna. At the very broadest level, these hypotheses 
test whether in fact there is any systematic deviation 
from the fundamental assumptions of classic macro- 
economic theory. Moreover, this test is conducted in 
a context in which such deviations are most likely to 





2 For two excellent analyses of the Benin democratic transition, see 
Banégas (1998) and Magnusson (1996) 
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occur, i.e., a poor developing country with a nascent 
market economy. 

This article’s principal theoretical contention is that 
economic reasoning is multidimensional, some dimen- 
sions are more or less subject to deviations from classic 
economic assumptions, and the pattern of deviations 
across dimensions has important policy implications. 
Our theory helps explain why economic behavior de- 
viates from classic micro-economic assumptions. The 
empirical results have important policy implications 
because they indicate whether personal capital is the 
principal factor in explaining out-of-equilibrium be- 
havior. To the extent that this is the case, nothing short 
of dramatically improving cognitive and information 
capabilities in transitional societies will promote eco- 
nomic choices consistent with micro-economic theory. 
Alternatively, if social capital is primarily responsible 
for out-of-equilibrium decisions, policymakers should 
design policies that anticipate existing contextual in- 
centives or modify contextual circumstances to better 
reflect classic micro-economic theory. 


THEORY 


Our primary focus in this article is exploring whether 
cross-cultural or cross-contextual differences in eco- 
nomic reasoning help us understand the success or 
failure of countries in economic transitions. A variety 
of factors is hypothesized to determine the success of 
economic transitions including institutional attributes 
(McKinnon 1993), geography (Diamond 1997), invest- 
ment in human capital (Romer 1990), and a country’s 
endowment in productive factors such as technological 
innovation (Solow 1957) natural resources and popu- 
lation. Recent advances in our understanding of the 
rational choice assumptions underpinning economic 
theory suggest that systematic variations in economic 
reasoning might be an appropriate candidate for ad- 
dition to this list of explanatory factors. First, many 
scholars concede that the domains in which economic 
choices are made vary and that in some contexts ratio- 
nal choice assumptions, such as utility maximization, 
are more or less approximated (e.g., Chrystal and Peel 
1986 and Kahneman, Knetsch, and Thaler 1986). It cer- 
tainly seems plausible that contexts exist where, for a 
variety of socioeconomic and historical reasons, classic 
economic reasoning might be “underdeveloped” (or 
“overdeveloped”). Second, a number of recent empir- 
ical findings suggest that there are systematic contex- 
tual or cultural differences in economic reasoning (e.g., 
Heinrich 2000, Henrich et al. 2001, and Prasnikar and 
Roth 1992). Again, it would not seem unreasonable to 
link these systematic differences in economic reasoning 
with the success or failure of economic transitions. 
Unfortunately, much of the literature focuses on 
demonstrating the existence of systematic differences 
in economic reasoning. Yet, we believe the more inter- 
esting and relevant issue is explaining these systematic 
differences in economic reasoning and drawing infer- 
ences as to whether they help account for why some 
societies have experienced more successful transitions 
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to a market economy.’ In this section, we briefly review 
a number of existing theoretical arguments along with 
our theoretical perspective on the factors that might 
explain cross-national differences in cognitive reason- 
ing as it relates to economic decision making. 


Economic Reasoning? 


The notion that cultural values play an important 
role in the successful transition to a successful market 
economy is widely accepted by social scientists (e.g., 
Harrison and Huntington 2000, Inglehart 1997, and 
Weber 1958). An example is North’s (1981) argument 
that norms governing economic interactions can in- 
hibit the development of efficient market economies. 
Typically, these claims have been rather broad and im- 
precise, which we believe undermines their credibility 
and impedes empirical testing. Our approach provides 
more precise insights into these linkages by conceptu- 
alizing economic reasoning as multidimensional with 
some dimensions having little cross-national or cross- 
cultural variation, while others are more sensitive to 
different socioeconomic contexts. 

What is it about economic reasoning that might be 
more or less consistent with efficient economic decision 
making? Individuals have a repertoire of expectations 
and norms related to the market economy that make up 
the various behavioral assumptions of micro-economic 
theory. Within this broad repertoire we distinguish 
amongst two different types of reasoning: rules and 
choices. The obvious political analogy is the distinc- 
tion between the legitimacy individuals accord regime 
institutions and their preferences over politicians con- 
tending for political power (Easton 1963). 


Basic Economic Norms. We begin with the funda- 
mental rules governing economic exchange. Market 
economies are unlikely to function without wide ac- 
ceptance of certain basic norms or rules that govern 
individual] transactions—guarantees of individual liber- 
ties, of property rights, and of personal security. Olson 
(1993), for example, argues that the protection of indi- 
vidual rights to property and contract enforcement is 
a sufficient condition for the emergence of democracy 
as well as for economic development and growth. We 
contend that these basic economic norms are widely 
shared by individuals across different cultures and lev- 
els of socio-economic status (see Milgrom, North, and 
Weingast 1990 and Putnam 1993). The notion of the 
right to, and the preservation of, private property is 
not cognitively demanding, nor do we expect that it 
requires extensive socialization (e.g., Ellickson 1991 
and Ostrom 1990). It is, we would argue, virtually a 
primal instinct for the human species. In fact, there is an 


3 Note that traditional economic models of market transition gener- 
ally assume that the basic assumptions of clasmcal micro-economic 
theory hold for citrzens in developing countries just as they do for 
citizens m advanced industrial countries. Hence, research on the 
process of economic liberahzation does not include the development 
of market reasoning as an mitial step in the transition process (e g., 
McKinnon 1993). 


impressive body of evidence from the fields of ethnol- 
ogy and sociobiology suggesting that acquisitiveness is 
universal across the human species.* Evidence of this 
includes the historical evolution of property rights in 
widely different cultural contexts and the failure of 
efforts by various regimes to eliminate property rights 
(Pipes 1999; Putnam 1993). And as de Soto (2000) ar- 
gues, cultural factors have not prevented the evolution 
of more effective property rights in developing coun- 
tries. Hence, we expect that respect for basic rights, 
such as property rights, will be uniformly high across all 
groups in society, regardless of education, information 
levels, socioeconomic status, and cultural context. 


Economic Choices. Micto-economic theory also 
posits a set of assumptions regarding the choices in- 
dividuals make in market contexts. One critical as- 
sumption is that individuals will maximize utility when 
faced with choices among different alternatives. We 
treat these assumptions differently than the fundamen- 
tal rules described above. These are behaviors that we 
expect will vary by socio-economic context. Moreover, 
experimental findings indicate considerable variation 
in the extent to which individuals approximate the 
utility maximization assumptions of micro-economic 
theory. Examples include experiments designed to in- 
vestigate bargaining in ultimatum games (e.g., Heinrich 
2000 and Prasnikar and Roth 1992). Similarly, exper- 
imental evidence points to significant individual-level 
variations in the extent to which subjects engage in co- 
operative behavior (Eckel and Wilson 2003; E. Glaeser 
et al. 2000). And there is increasing evidence that the 
experimental results are conditioned by cultural and 
institutional contexts (e.g., Roth et al. 1991). 

How might variations in utility maximization im- 
pact transition economies? Evidence that individuals 
in transition contexts adopt very different strategic 
behavior than individuals in more established market 
economies suggests that the public’s acquisition of mar- 
ket reasoning might represent a barrier to successful 
transition. Strong aversions to particular equilibrium 
strategies (such as the zero offer in the ultimatum 
game) may reflect cognitive reasoning that undermines 
certain liberal market reforms. 

Our primary goal in this article is to establish, 
through the application of survey research techniques 
combined with experimental methods, whether in fact 
there exists systematic individual-level variation in 
utility maximization. Having established this, the sec- 
ond goal is to account for the sources of systematic 
individual-level variation in economic reasoning. There 
are two fundamentally different explanations for this 
variation in utility maximization: one involving per- 
sonal abilities (or capital) and the other focusing on 
social capital.> Which of these explanations proves cor- 
rect has important implications for our understanding 
of the process of market transition. 


4 For an extensive review of the evidence supporting the notion that 
“acquisitiveness is universal among humans as well as animals,” see 


Pipes 1999. 
5 We use the distinction between personal and social capital pro- 


posed by Becker (1996). 
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Personal Capital 


On the one hand, variations in utility maximization be- 
havior might be linked to personal capital, specifically 
cognitive abilities and levels of information. Coun- 
tries with poorly educated and uninformed populations 
(and there are many; see Duch and Whitten 2003) 
would be unfertile soil for market reforms that build 
on classical micro-economic assumptions. If variation 
in economic choice is uniquely explained by education, 
then educating the population would be the most ef- 
fective strategy for promoting a successful transition to 
a market economy. We seriously doubt, however, that 
greed or self-interest requires any sophisticated rea- 
soning capabilities. And some of the limited efforts to 
empirically model deviations from utility maximization 
behavior lend support to our position (Henrich et al. 
2001). We propose three measures of personal capital: 
cognitive abilities, modernization, and information. 


Cognitive Abilities. Many argue that investment in 
human capital, or personal capital, to use our nomen- 
clature, positively impacts economic development (e.g., 
Barro 1991, Romer 1990, and Duch and Whitten 
2003). A reasonable expectation is that education 
contributes to economic development because of its 
impact on economic reasoning. Lower levels of ed- 
ucation might undermine classic economic reasoning 
and hence constitute, in the aggregate, a barrier to 
economic transition or development. Yet we see no 
inherent cognitive challenges associated with utility 
maximization. Hence, we do not expect education to 
explain individual-level variation in utility maximiza- 
tion behavior. 


Modernization. There are characteristics of modern, 
as opposed to traditional, society that may contribute 
to the sophistication of individual-level economic rea- 
soning. Individuals in “modern” societies have greater 
access to more diverse information sources. Urbaniza- 
tion, another feature of modernization, is frequently 
identified as one of the catalysts for the development 
of successful market economies, primarily because it 
accelerates the diffusion of knowledge and improve- 
ments in human capital (Becker, Glaeser, and Murphy 
1999; Glaeser 1999; Lucas 1988). One might conclude 
that urbanization increases the exposure of average 
citizens to market mechanisms and thereby increases 
their level of sophistication in economic reasoning. We 
question whether exposure to urban or modern influ- 
ences will have any bearing on economic reasoning per 
se. Lending support to this perspective is experimental 
research that questions whether economic reasoning 
is underdeveloped in nonmodern settings. Heinrich 
(2000), for example, compares the ultimatum game 
results for the Machiguenga tribe with those of a Los 
Angeles control group and finds that the Machiguenga 
much more closely approximate the predicted equilib- 
rium outcome. 


6 But see Easterly (2002) for a skeptical assessment of the evidence. 
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Information. To the extent that engaging in maximiz- 
ing behavior incurs information-gathering costs, one 
might expect high levels of media usage (information 
consumption) to reduce these information-gathering 
costs and hence produce behavior more consistent with 
the classical model. We contend that these information 
gathering costs are not particularly high and hence me- 
dia exposure should have little bearing on economic 
reasoning. 

We believe that the above three factors (as compo- 
nents of personal capital) have only marginal impacts 
on market reasoning and strategic economic decision- 
making. Education, exposure to “modern” information 
sources, and the acquisition cost of information about 
the economy are not expected to be significantly corre- 
lated with the sophistication of individuals’ economic 
reasoning. Rather, individuals have inherent economic 
instincts that evolve relatively quickly into cognitive 
market skills without extensive training or socializa- 
tion (i.e., education, exposure to “modernity,” or fre- 
quent media usage). In short, we reject the notion that 
personal capital has any significant bearing on utility 
maximization behavior. 


Soclal Capttal 


On the other hand, we expect that there are contexts 
in which rational choice behavioral assumptions, such 
as utility maximization, are likely to be at odds with 
theoretical expectations. Based on Becker (1996), we 
adopt the concept of social capital to account for con- 
textual incentives to deviate from utility maximization 
behavior. Social capital shapes the extent to which the 
economic choices of average citizens reflect norms 
governing equity and reciprocity. Individuals through 
interactions and socialization develop different expec- 
tations as to how equitable gestures are likely to be 
reciprocated (Fehr and Gachter 2000). To use Becker’s 
(1996, 13) formulation, individuals incorporate the 
costs associated with utility maximization behavior into 
their social capital. Efforts to formalize these con- 
textual influences have demonstrated that the eco- 
nomic environment, which consists of varying degrees 
of heterogeneity of preferences for equitable versus 
inequitable outcomes, can significantly affect equilib- 
rium outcomes in a number of cooperation games (Fehr 
and Schmidt 1999). We expect to be able to identify 
individual-level traits (attributable to contextual fea- 
tures such as culture, politics, and the socioeconomic 
environment) that distinguish more aggressive from 
less aggressive utility maximization. 

Our expectation is that individuals adjust their eco- 
nomic behavior in response to contextual factors that 
make some choices more or less costly than others. So, 
for example, many years of interacting in a community 
that punishes aggressive utility maximization is likely to 
have some long-term impact on this facet of economic 
behavior. Three factors related to social capital are 
likely to condition economic behavior: socialization to 
market, government and authority, and personal eco- 
nomic circumstances. 
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Socialization to Market. There is a variety of his- 
torical and contemporary examples of social contexts 
that reward equitable and fair choices and are antag- 
onistic toward individual choices that maximize utility. 
The postcommunist nations are examples of contexts 
in which at least official norms were antagonistic to 
individual maximization behavior. And many of the 
developing African nations, of which Benin is an ex- 
ample, have been governed by a similar set of offi- 
cial norms. As North (1981) points out these informal 
norms governing economic interaction change slowly 
in response to societal change. Individuals in countries 
that only recently adopted a market economy might 
be slow to discard command-economy reasoning that 
emphasizes social goals, in favor of market reasoning 
that focuses on personal welfare. If contextual factors 
influence the “cost” of utility maximization, we should 
see considerable out-of-equilibrium individual choices 
in these nascent market economies. 

We speculate that the effects on market reasoning of 
the cultural and ideological “baggage” of non-market- 
oriented regimes are reinforced by two primary factors. 
First, sociological theories (e.g., Smelser and Swedberg 
1994) assert that individuals with more extensive so- 
cial networks are more likely to engage in cooperative 
behavior and hence are less likely to behave in a strict 
utility maximization fashion. Summarizing this litera- 
ture, Frye (2000) notes that dense social networks are 
hypothesized to increase the cost of having a reputa- 
tion for excessively self-interested behavior and, also, 
to increase the opportunities to punish selfish behavior. 

Second, game theoretic approaches suggest that het- 
erogeneity of group membership results in higher lev- 
els of noncooperative behavior (Ostrom 1990; Snidal 
1994). More heterogeneous groups have more diverse 
interests, which hinders cooperative agreements. Be- 
cause the expectations and norms of heterogeneous 
members can be very different and conflicting, trans- 
actions are much more likely to reflect utility maxi- 
mization rather than fairness and cooperation. Hence, 
individuals in larger and more heterogeneous com- 
munities are more likely to behave according to the 
utility maximization assumptions of classic models of 
economic behavior. 

Benin offers a particularly interesting venue for as- 
sessing the extent to which market behavior varies ac- 
cording to the heterogeneity of ethnic communities. 
There are diverse tribal groups in Benin, and for our 
purposes, the fact that they vary in size and hetero- 
geneity allows us to test the heterogeneity and cooper- 
ation argument developed above. The Fon tribe is by 
far the largest and most heterogeneous tribal group, 
accounting for roughly one-third of the respondents 
in the survey. All of the other tribal communities in 
Benin are considerably smaller in comparison with 
no other group commanding more than 10% of the 
sample. The Fon tribe has historically been the domi- 
nant tribe in Benin and exhibits a very heterogeneous 
membership in terms of socioeconomic characteristics 
and geographic dispersion. We expect that cooperative 
impulses will be more instinctive in the smaller, mi- 
nority, and homogeneous tribal communities in Benin. 


Members of these homogeneous tribes, because of 
their commonality of interests, are much more likely to 
have well-developed cooperative norms. By contrast, 
members of the larger and more heterogeneous Fon 
tribe will have much less developed cooperative norms. 
Because of the greater diversity of group interests, util- 
ity maximization will guide common transactions and 
economic decisions for members of the Fon tribe much 
more than for members of other minority tribes. 

Finally, socialization also covers the personal expe- 
riences that individuals have with market mechanisms. 
To what extent are individuals familiar with market 
processes and have actually engaged in market activi- 
ties, such as saving, borrowing, investing, and property 
ownership? Henrich et al. (2001) conduct ultimatum 
games in 15 different tribal societies and find signif- 
icant group differences in terms of the modal offers. 
The principal explanation for these tribal differences 
is the degree to which people in these different groups 
relied on market exchange in their daily life. 


Government and Authority. Political institutions and 
government actions can provide the negative or pos- 
itive incentives that lead individuals to deviate from 
utility maximization. Trust represents a critical precon- 
dition for individuals to make choices that promote co- 
operative goals as opposed to maximizing personal util- 
ity (Glaeser et al. 2000). Trust in government is likely 
to ensure individuals that cooperative behavior will be 
enforced and shirking will be punished. Paradoxically, 
rising levels of trust in government should lead to out- 
of-equilibrium economic choices. Similarly highly def- 
erential attitudes toward governmental authority are 
also likely to result in out-of-equilibrium responses. 
High levels of trust in government have important pos- 
itive implications for the success of democratic polities 
(Rabushka 1974; Weingast 1997). In the context of the 
transition from authoritarian and nonmarket regimes, 
residual attitudes of deference and trust in government 
may have different implications. As Olson (1993) and 
others have pointed out, the successful transition to 
democratic market economies builds on self-interested 
behavior that checks the possible exercise of central- 
ized authority, making power sharing a more preferred 
outcome.’ Transition environments that reinforce atti- 
tudes of trust and deference in government may gen- 
erate levels of out-of-equilibrium behavior that are in- 
consistent with aggressive protection of, for example, 
private property, individual liberties, and contracts. 


Economic Circumstances. The economy is a context 
that can reassure individuals that the consequences of 
cooperative actions, even if not reciprocated, will have 
a marginal impact on their well-being. So, for exam- 
ple, we see levels of charity rise when the economy 
is doing well and individuals are less price-conscious 
in their consuming behavior. By contrast, the economy 


7 In the context of Olson’s (1993) argument, trust in government 
m a nascent democracy undermines the necessary condition for the 
transition from authoritarian to democratic rule: the existence of a 
political stalemate that forces leaders to consider power-sharing as a 
more viable option than trying to consolidate power. 
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can also reinforce utility maximization behavior. When 
the economy is performing poorly individuals are much 
more attentive to making within-equilibrium economic 
choices. The consequences of failing to maximize one’s 
economic opportunities are quite different if one lives 
on the edge of economic survival as opposed to being 
relatively wealthy. The costs of cooperation are higher 
when individuals are facing difficult economic condi- 
tions. We believe that affluence tends to moderate util- 
ity maximization, while poverty should enhance this 
economic calculus. 


A Synopsis of Our Hypotheses. Our empirical efforts 
focus on testing four hypotheses that follow from the 
above discussion: 


1. Support for basic economic norms is pervasive 
across sociopolitical contexts—there is no system- 
atic variation in individual-level support for these 
basic norms. 

2. Individual economic choices deviate in a system- 
atic fashion from micro-economic assumptions re- 
garding utility maximization. 

3. Deviations from utility maximization are not sys- 
tematically linked to individual-level differences 
in personal capital. 

4, Deviations from utility maximization are system- 
atically related to individual-level differences in 
social capital. 


DATA AND METHOD 


The survey results reported in this article are based 
on a survey conducted in the West African nation of 
Benin in May 2000. The sample consisted of a stratified 
national random sample of the Benin population. The 
Appendix contains a detailed discussion of the survey 
design and the coding of the variables included in our 
statistical analysis. 


Dependent Variables 


The focus of our statistical analysis is explaining 
individual-level variation in market reasoning. We dis- 
tinguish between market reasoning associated with 
rules versus choices. Rather than using attitudinal ques- 
tions to indirectly measure market reasoning, we em- 
ploy two quasi-experiments imbedded in the survey 
that are designed to measure these two different di- 
mensions of market reasoning. The first survey experi- 
ment characterizes economic choices by simulating the 
conditions of a dictatorship game. In this experiment, 
the interviewer tells the respondent that after the sur- 
vey is completed, two of the persons interviewed will 
be chosen randomly to share a 15,000 CFA prize (two 
months’ average salary in Benin). How that prize is 
divided will be determined by the first person chosen. 
The interviewer then asks the respondent to choose 
how he or she wants to divide the prize in the event 
of being chosen first to share the prize. Responses 
that are more consistent with the equilibrium response 
of keeping all of the prize (except a trivial amount) 
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represent more sophisticated market reasoning. See 
the Appendix for the specific wording of the dictator- 
ship game experiment. 

The second survey experiment characterizes support 
for basic rules of the market economy. It employs a 
vignette about the expropriation of a person’s farm- 
land by the regional government (see the Appendix 
for specific wording). The circumstances of the vignette 
are manipulated in three ways to create eight unique 
scenarios: (1) the occupant’s legal (de jure) claim to 
the land, i.e., whether or not the occupant has le- 
gal title to the land; (2) the merits of the occupant’s 
(de facto) claim to the land, i.e., whether or not the 
occupant improved the productivity of the land; and 
(3) the perceived motives of the regional government, 
ie., whether the land would be used to build a new 
headquarters for the region’s ruling party or a water 
tower that would provide the village with drinking 
water. Respondents were randomly assigned to the 
different scenarios or treatments. Negative reactions to 
the government’s decision to expropriate the farmland 
represent greater support for property rights and more 
sophisticated market reasoning. 


Explanatory Variables 


The statistical analysis presented below includes two 
sets of explanatory variables that correspond to the 
personal and social capital concepts introduced in the 
previous section. As pointed out earlier, our mea- 
sure of personal capital incorporates three dimensions: 
(1) cognitive capabilities, which are measured by for- 
mal education (Education); (2) modernization, which 
is measured by urban residency (Urban Resident); and 
(3) information consumption, which is measured by 
attention to the media (Media Usage). 

Social capital has three dimensions: socialization to 
the market, government and authority, and personal 
economic circumstances. These three dimensions cap- 
ture environmental factors that raise or lower the costs 
of engaging in utility maximization behavior. Social- 
ization to the market is measured by three variables: 
the density of Social Group Membership, the subjec- 
tive Importance of Ethnic Identity, and Experience with 
Market Mechanisms Density of group membership 
is hypothesized to increase the social costs of engag- 
ing in aggressive utility maximization. Similarly, tra- 
ditional ethnic or tribal norms are likely to either 
raise or lower the social costs associated with utility 
maximization behavior. We expect ties to the domi- 
nant heterogeneous Fon ethnic group to exaggerate 





8 Expenence with Market Mechanisms is only a proxy for actual 
experience with market mechanisms. For this reason, some readers 
might doubt the validity of this measure. The survey data, however, 
indicate that the proxy 1s valid. Respondents with the highest values 
for Expenence with Market Mechanisms (>2) were more likely to 
have experience with borrowing (14.6% versus 10.0% for those with 
values <2) and to own more than one household item (34.8% versus 
29.2%). Similarly, they also displayed greater willingness to use their 
savings to start a business (712% versus 67.4% responded “very 
hkely” or “somewhat likely” to do so) and to borrow money to start 
a business (58.1% versus 45.2%). 
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utility maximization, while ties to the smaller more 
homogeneous ethnic groups will moderate utility max- 
imization. And finally, experience or familiarity with 
market mechanisms should raise the likelihood of util- 
ity Maximization. 

A second dimension of social capital concerns gov- 
ernment and authority. Out-of-equilibrium results in 
the dictatorship game are predicted for individuals who 
are more likely to be assured that cooperative behavior 
will be effectively enforced. These are individuals who 
have a high Trust in Government and a high Deference 
to Authority. 

The third dimension of social capital is the economic 
environment. We expect that the economic situation 
confronting individuals “motivate” the application of 
market reasoning in the survey experiments. For in- 
dividuals facing less secure financial situations—lower 
incomes (Income), greater concern about employment 
status (Unemployment Concern), less personal prop- 
erty (Personal Property Ownership)—the costs of out- 
of-equilibrium choices are likely higher than for those 
facing more secure economic prospects. 

Finally, the analysis controls for contextual factors 
associated with region. We speculate that regional dif- 
ferences exist in local financial and social institutions 
(both formal and informal) that influence market rea- 
soning. While a full investigation of these regional 
differences is beyond the scope of this article, con- 
trolling for these potential effects seems warranted.’ 
Additionally, we control for gender differences. Studies 
of advanced industrial democracies indicate that gen- 
der differences exist in social and political capital (e.g., 
Burns, Schlozman, and Verba 1997), so it is natural to 
expect to find gender differences in market reasoning 
among citizens of a developing country with a nascent 
democracy. See Table A1 in the Appendix for descrip- 
tive statistics for all variables used in our analysis. 


RESULTS 


Economic Cholces: The Dictatorship Game 


Table 1 reports the distribution of responses to the 
dictatorship survey experiment. Table 1 indicates that 
there is considerable variation in how respondents 
chose to divide the 15,000 CFA prize. Furthermore, 
only 7.8% chose to divide the prize in a way consistent 
with the equilibrium strategy in a dictatorship game, 
while 45.6% chose an even, near-even, or disadvanta- 
geous split of the prize. The typical experimental result 
for dictatorship games is that the modal offer is the 
equilibrium outcome, i.e., 0 to the other player (e.g., 
see Forsythe et al. 1994). The individual-level variation 
in Table 1 highlights the possibility of underdeveloped 
economic reasoning in developing countries. 


9 Specifically, we expect the southern, coastal regions of Oueme, 
Mono, and Atlantique (baselne category) to be more market ori- 
ented than the central and northern regions of Zou, Atacora, and 
Borgou. This division of the regions reflects differences in their 
economies, where the former group has a greater concentration in 
service and light mdustry, while the latter has a greater concentration 


m agriculture. 
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TABLE 1. Responses to Dictatorship Survey 
Experiment 
Sample Percentage of 


Survey Responses Frequency Sample 


15,000 or 14,950 for me 114 7.8 
13,000 for me 101 6.9 
12,000 or 11,500 for me 47 3.1 


10,500 for me 131 8.9 
10,000 for me 112 7.8 
9,500 for me 68 4.6 
9,000 or 8,500 for me 111 7.5 
8,000 or 7,800 for me 111 7.5 
7,600 for me 165 11.2 
7,500 for me (even split) 453 30.8 
7,000 or 5,000 for me 53 3.6 


TABLE 2. Ordered Probit Model of Response 
to Dictatorship Experiment 





Varlable Coefficient Error 
Education —.005 .007 
Urban Resident .05 .07 
Medla Usage —.012 .017 
Soclal Group Membership —.036** .014 
Importance of Ethnic Identity —.118*** : 
Fon —.30 17 
Fon * Importance of Ethnic .183""" : 


Deference to Authority 

Trust in Government 

Income 

Unemployment Concern 
Personal Property Ownership 
Female 


Regional dummy variables 
Atacora 
Borgou 
Mono 
Oueme 
Zou 
Constant 


Hı 
He 
H3 
Ls 
Ls 

LR statistic of model 

significance 

% predicted correctly 32.8 

% reduction In error 2.9 
Note: Dependent vanable is a seven-category measure of re- 
sponses to the dictatorship survey expenment The yu coef- 
ficients are estimates of the threshold parameters of the or- 
dered probit mode! with uo standardized to zero N = 1,469. 
“p< .001, = p <.01,*p < 05 (two-tailed test). 


174.1*** 








Yet the crucial question of what explains this 
individual-level variation remains unanswered. The or- 
dered probit analysis reported in Table 2 attempts to 
address this very question (for a description of ordered 
probit models, see Greene 2000, 875—77). It models 
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responses to the dictatorship survey experiment as a 
function of personal and social capital.’° In order to 
simplify the presentation of the ordered probit model, 
we collapsed the 11 categories in Table 1 into seven 
categories. This was done by combining adjacent cate- 
gories with theoretically similar divisions of the prize 
money (see the Appendix for the specific coding of the 
collapsed measure). 

As discussed above, some critics of micro-economic 
theories of market transition question whether mass 
publics in developing countries have the requisite per- 
sonal capital for market reasoning. This theoretical per- 
spective implies that citizens in transition economies 
who engage in market reasoning must be unusual in 
terms of their cognitive abilities and knowledge of mar- 
ket processes. Consistent with our expectations, Table 2 
casts doubt on the validity of this theoretical perspec- 
tive by demonstrating the statistical insignificance of 
all of the personal capital measures: Education, Urban 
Resident, and Media Usage. Considered together, 
these three variables prove to be jointly insignificant, 
at even the 0.10 level, with a likelihood-ratio (LR) test 
statistic of 1.60. 

On the other hand, we expect social capital to be 
significant in explaining individual-level variation in 
economic behavior, specifically utility maximization. 
Variables measuring the first dimension of social cap- 
ital—socialization to the market—are mostly statisti- 
cally significant.!? Respondents who had more dense 
social networks were less likely to apply market reason- 
ing in the dictatorship game. Similarly, the subjective 
importance of ethnic identity strongly distinguished 
among respondents in terms of their market reasoning. 
For those respondents who belonged to ethnic groups 
other than Fon, their likelihood of placing personal 
gain above social justice decreased with the importance 
of their ethnic identity. In contrast, among Fon respon- 
dents, stronger ethnic identity increased the likelihood 
of engaging in market reasoning. This difference con- 
firms the argument that larger heterogeneous commu- 
nities, such as the Fon, are more likely to engage in 
myopic utility maximization while members of smaller, 
more homogeneous tribal communities are more likely 
to have cooperative norms. 


10 In an auxilary analysis (available from the authors upon request), 
we estimated the model in Table 2 with control variables for age, 
religiosity, societal ethics, support for government ownership and 
regulation, and support for social welfare policies. All of these control 
variables were insignificant at the 0.10 level except societal ethics. The 
only notable change in the parameter estimates for the measures of 
personal and social capital was that the coefficient for Personal Prop- 
erty Ownership was about 20% larger in magnitude and significant 
at the 0.05 level 
11 In an auxiHary analyms (available from the authors upon request), 
we estimated the model in Table 2 using the full set of categories 
in Table 1. The results of this analysis were essentially the same as 
those presented in Table 2. 
12 In an auxiliary analyms (available from the authors upon request), 
we replaced Media Usage with a measure of the respondent’s level 
of economic information (based on interviewer evaluation). This 
variable also proved to be statistically wnsignificant in the ordered 
probit model of response to the dictatorship experiment. 

The five variables capturing socialization to the market are jointly 
significant at the 0.0001 level, with a LR test statistic of 35.67. 
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The final component of the “socialization to market” 
dimension of social capital—experience with market 
mechanisms—influenced market reasoning in the ex- 
pected direction but did not prove statistically signifi- 
cant at the 0.05 level. In light of the variable’s statistical 
insignificance, our suspicion is that exposure to the so- 
phisticated market mechanisms measured by this vari- 
able does not significantly enhance market reasoning — 
partially because it is relatively rare in this context. It 
is likely the case that less demanding forms of market 
exposure, such as weekly barter marchés, are perva- 
sive in Benin. Hence, our results suggest that these 
less demanding experiences provide enough familiar- 
ity with the market to allow citizens to develop market 
reasoning. In sum, the dictatorship game provides ev- 
idence supporting the argument that there are aspects 
of socialization to the market (or nonmarket) economy 
that reduce or lower the social costs associated with 
engaging in utility maximization behavior. 

The empirical results for the second dimension of 
social capital concerning government and authority 
also confirm our hypotheses. Individuals with greater 
trust in government and those with higher deference to 
authority are significantly more likely, as theorized, to 
have out-of-equilibrium responses to the dictatorship 
game. These estimated effects proved to be individually 
significant at the 0.01 level and jointly significant at the 
0.001 level (with a LR test statistic of 18.40). 

Finally, Table 2 provides some evidence that per- 
sonal economic circumstances, as the third dimen- 
sion of social capital, influence market reasoning in 
a manner consistent with our theoretical perspective.'4 
As expected, respondents with less personal property 
and more serious unemployment concerns were more 
likely to engage in market reasoning in the dictatorship 
game. While the effect of Unemployment Concern is 
strongly significant at the 0.001 level, Personal Prop- 
erty Ownership is only marginally significant at the 0.10 
level. 

Because ordered probit estimates nonlinear rela- 
tionships between the explanatory variables and the 
choice probabilities, we calculated the predicted prob- 
abilities in Table 3 to illustrate the estimated effects of 
the three dimensions of social capital on the probability 
of “superior” market reasoning (i.e., choosing to kee 
at least 11,500 CFA or 76.7% of the prize money). 


14 The ordered probit model controls for gender differences based 
on the speculation that gender differences in social capital might 
exist independent of the measures of social capital included in our 
analysis Gender differences, though, do not prove mgnificant m our 
analyms. Even though gender does not have a direct impact on market 
reasoning, we did find some evidence of gender differences in social 
capital, with women having less dense social networks and being less 
deferent to authonty. The t-statistics for the difference of means tests 
are 4.61 for density of social network and 2.09 and 2 79 for the two 
items used to construct the deference to authonty measure (see the 
Appendix) These statistics are significant at the 0.0001, 0.05, and 
0.01 levels, respectively. 

15 The three vanables representing personal economic circum- 
stances are jointly significant at the 0.001 level, with a LR test statistic 
of 18.43. 

16 Since we hypothesized, and the empirical results confirm, that 
personal capital has no mpact on utility maximraton behavior, we 
have not illustrated this factor’s impact in Table 3. 
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TABLE 3. Predicted Probabilities of Supertor Market Reasoning In Dictatorship 
riment 
Low Market Soclalizaton High Market Sociallzation 


Low High Low High 
Govemment Government Government Government 
Trust Trust Trust Trust 


High Unemployment Concern 181 137 249 185 
(.018) 
.091 


(.020) (.019) 
179 136 
018 015 


(.021) 
Low Unemployment Concem 125 
.01 .013 
Note. Table reports predicted probabiltttes for respondents keeping 11,500 or more for themsetves in the 
Dictatorship survey experment (i.e., top two categories of the variable modeled In Table 2 and.top three 


; skewed 
dard deviation below the mean and the high value Is defined as 1.5 standard deviations above the mean. 
These probabiites were calculated using mean values for the other variables. The values in parentheses are 
standard errors for the predicted probabilities. They were calculated using the delta method (see Greene 2000, 
824—25). Detalls on our application of this method and the corresponding LIMDEP code are available from the 
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authors upon request. 


Table 3 reports the predicted probabilities of superior 
market reasoning for high and low values of Trust in 
Government (as a measure of the government and au- 
thority dimension), Unemployment Concern (as a mea- 
sure of the economic circumstances dimension), and 
Social Group Membership and Experience with Market 
Mechanisms (as measures of the market socialization 
dimension). As shown in Table 3, respondents with low 
trust in government, high unemployment concern, and 
bigh market socialization were 15.8 percentage points 
more likely to display superior market reasoning than 
were their counterparts with high trust in government, 
low unemployment concern, and low market social- 
ization. The largest effects in Table 3 are for economic 
circumstances where a two-standard deviation increase 
in Unemployment Concern produces 4.6 to 7.0 percent- 
age point increases in the predicted probability of su- 
perior market reasoning. Yet even the relatively small 
effects of a two-standard deviation decrease in govern- 
ment trust are notable, producing 3.4 to 5.4 percentage 
point increases in the likelihood of superior market 
reasoning. 


Basic Economic Rules: The Property 
Rights Vignette 


An experimental vignette was included in the survey 
in order to assess the manner in which respondents in 
a very deprived transition context respond to formal 
and informal norms regarding property rights.” Our 
expectation is that norms, both formal and informal, re- 
garding property rights are relatively robust regardless 
of the nature of one’s personal or social capital. Hence, 
we predict that respondents will recognize the collec- 
tive gains associated with respecting property rights 
and respond to the experimental stimuli accordingly. 


17 Our implementation of this experimental vignette is modeled on 
Gibson and Gouws 1999. 





In addition, we do not expect any systematic variation 
in these “appropriate” responses attributable to the 
personal and social capital characteristics of the re- 
spondents. The experiment consists of three treatments 
that are summarized in Table 4. 

Survey respondents were randomly assigned to one 
of the eight treatments resulting from the scenario ma- 
nipulation. The major actor in the vignette is Michel, 
a purported property owner. All respondents were 
told that the Prefet of the department (region) had 
expropriated Michel’s property. Each respondent was 
offered a version of the property rights story that had 
three components reflecting the different treatments. 
Each version described Michel as occupying a piece 
of property—in some cases he had valid title to the 
property in others he did not. A sentence described 
the extent to which Michel had improved the land dur- 
ing the period he occupied it. A concluding sentence 
described how the state (regional/department govern- 
ment) planned to use the land—to construct either a 
water tower that would serve the interests of the com- 
munity or a new headquarters for the Prefet’s political 


party.'8 


18 The questionnaire included a number of checks on whether the 
experimental manipulations had the intended effects on the respon- 
dents. First, interviewers were asked to assess the extent to which 
respondents understood the vignette. Interviewers reported that 
98% of the respondents had at least a good cursory (“passable- 
ment”) understanding of the vignette, while 70% were reported as 
having a very good understanding of the vignette. Second, several 
checks were included to assess consistency of interpretation of the 
scenano. Respondents were asked whether they agreed with the 
Prefet’s decision to expropriate Michel’s property and then whether 
they agreed with the Prefet’s proposed use of the land. We would 
expect a strong, although not perfect, correlation between these two 
responses, which is the case with a correlation of 0.60. The survey 
also checked the experimental manipulation related to Michel's use 
of his land by asking respondents to evaluate the degree to which 
Michel could be characterized as hardworking versus lazy. The mean 
evaluation of Michel's work ethic was significantly more positive at 
the 0.0001 level for the “value added to land” scenario (t-statistic 


445 


Self-Interest, Social Justice, and Market Transistion 





August 2004 


TABLE 4. Structure of the Vignette’s Experimental Manipulations 
Manipulation 


Legal Clalm to 
Property 


Versions 


A. 
title. 

B. Holds valid tte to land. 

Individual Merit 


Knowlngly purchased fraudulent 


A. Over five years, occupant has let 
farmland fall Into disuse. 





Hypothesis 
Expropriation unacceptable 
when valld tite is held. 





Expropriation unacceptable when 
occupant “merits” tte to land. 


B. Over five years, occupant has 
significantly improved productivity 


of farmland. 


Communal Merit 
water treatment plant. 


B. Expropriation for partisan 
tical i 


The statements in the Hypothesis column in Table 4 
indicate the impact we expect the treatments to pro- 
duce. The first hypothesized effect indicates the extent 
to which respondents respond positively to formally 
recognized property rights. We expect respondents to 
be more critical of the expropriation in those cases in 
which Michel has legal title to the property. Secondly, 
reflecting informal notions of property rights based on 
evidence of personal contributions to property value, 
we expect that opposition to expropriation will be 
higher for the treatment in which Michel improves the 
value of the land. Finally, we predict that the legitimacy 
of the state’s expropriation authority should vary with 
the extent to which the use of the land is designated for 
the public good. 

Table 5 illustrates the results of the vignette ex- 
periment. These results indicate that the experimental 
treatments produced exactly the effects expected. Note 
that a high score indicates opposition to the state expro- 
priation. Respondents exposed to the legitimate title 
vignette had uniformly higher mean scores. In addi- 
tion, respondents who were told that Michel improved 
the value of the land had higher mean scores. Finally, 
opposition to the Prefet’s expropriation of the land was 
greater in the cases in which the land was used for a 
new headquarters for the Prefet’s political party. 

Extending the analysis in Table 5, we estimated or- 
dered probit models of responses to four questions 
asked after the vignette that delve into the extent and 
nature of the respondent’s opposition to the Prefet’s ex- 
propriation of Michel’s land (see the Appendix for de- 
tails on question wording and variable coding). These 
ordered probit models included dummy variables for 
the three treatments in the vignette. In addition to 
the “disagreement with Prefet’s decision” question 


for difference of means test, 47.50), mdicating that respondents were 
in fact appropriately influenced by this experimental manipulation. 
Finally, respondents were also asked to what extent they believed 
the Prefet was motivated by concerns for his political career (as 
opposed to civic considerations). The mean evaluation of the Prefet’s 
motivation was significantly more negative at the 0.0001 level for the 
“party headquarters” version of the vignette (t-statistic, 36.14), which 
again indicates that the experimental manipulation had the intended 
effect on respondents. 
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A. Expropriation for constructing 





Communal goals trump 
Individual merit and legal 


property rights. 





TABLE 5. Effects of Scenario Characteristics 
on Reaction to Property Rights Vignette 
Question. Do you agree with the Prefet’s decision to 
expropriate Michel's property? 


Michel had... 
Title No ttie 











Scenano Version 

Dunng tenure on land he... 
made no effort to improve 
the land. 





2.28 1.21 
(07) (.06) 
N= 360 N= 356 


Improved the land. 1.78 


Use of expropriated land 
To construct a water storage 
tower for the community 


To bulld a new party 
headquarters 


Improvement and land use 

No effort to Improve land 
and use to construct a water 
tower for the community 

No effort to Improve land and 
use to bulld party 
headquarters 

Improved the land and use 
to bulld party headquarters 


Improved the land and use 
to construct a water tower 
for the community N = 194 
Note. Mean responses for treatment groups. Higher scores ın- 
dicate greater opposition to Prefet’s expropriation of Michel's 
land. Values in parentheses are standard errors. 








(analyzed in Table 5), we modeled responses to ques- 
tions about the amount of Michel’s reimbursement, 
whether Michel has the right to keep the land, and 
the severity of the Prefet’s violation of the law. Rather 
than presenting the ordered probit coefficients, Table 6 
presents predicted probabilities derived with those 
coefficients to illustrate the estimated effects of the 
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TABLE 6. Estimated Effects of Scenario Characteristics on Probabilities (Pr) of Negative Reactions 


to Pro Rights Vignette 
Scenarlo Characteristics 


Improved 

Title? Land? Land Use 
No No Water tower 
No Yes Water tower 
No No Party HQ 
No Yes Party HQ 

No Water tower 

Yes 

No 

Yes 


Reimbursement: 
Pr (Land’s Value or 
or Twice Its Value) 

.364 (.025) 
.493 (.025) 
362 (.024) 
.491 (.024) 
.779 (.021) 
.864 (.016) 
.TTT (.019) 
.863 (.014) 


Disagreement with Amount of Michel's 
Prefet’s Decision: 
Pr (Disagree or 
Disagree Strongly) 


Michel Has Right to Severity of Prefet’s 
Keep Land: Violaton of Law: 
Pr (Absolutely) Pr (Violated a Lot 
.067 (.008) 
.098 (.011) 
.183 (.017) 
244 (.020) 
408 (.024) 
491 (.025) 
.643 (.026) 
.718 (.023) 


support for protecting personal property nghts. Baseline probability is the proportion of sample respondents who gave 
Values m parentheses are the standard errors for the predicted probabiites. As In Table 3, we calclucated theee standard errors using 


the delta method. 


vignette treatments.!? For instance, the fourth column 
in Table 6 presents the predicted probabilities of dis- 
agreeing with the Prefet’s action—considered here as a 
response supporting property rights—for the eight pos- 
sible combinations of the three treatments. The base- 
line probability at the bottom of this column (44.8%) 
represents a naive estimate of opposition to the Prefet’s 
decision and is simply the proportion of the sample 
giving a “disagree” or “disagree strongly” response. 
Respondents who were read the most pro-property 
rights scenario (Michel has title, he has improved the 
land, and the land will be used for a party head- 
quarters) had an 83.5% probability of opposing the 
Prefet’s decision—almost twice the naive probability 
prediction. 

Table 6 demonstrates that each of the treatments had 
significant effects in the hypothesized direction. The 
legality of Michel’s claim produced the strongest effects 
followed by the communal merit of (or Prefet’s motiva- 
tions for) expropriating the land. This pattern is telling 
since it indicates that individuals in a transition soci- 
ety responded more strongly to the treatment about 
formal property rights than to the treatments about 
informal (“squatter”) property rights and about soci- 
etal interests. There is no indication here that citizens 
of an impoverished society with a recent authoritar- 
ian past have conceptions of property rights that differ 
from those we would expect to find in developed mar- 
ket economies. While this lack of cultural differences 
is consistent with our theoretical argument, it conflicts 


19 The ordered probit coefficients are available from the authors 
upon request. All of the treatment coefficients except two were 
statistically significant at the 0.001 level. The two exceptions were 
the Michel Improved Land coefficient in the Severity of Prefet’s vio- 
lation of law model, which was only nignificant at the 0.01 level, and 
the Land Used for Party HQ coefficient in the Amount of Michel’s 
reimbursement model, which was msignificant at even the 0.10 level. 
Overall, the treatment effects had considerable predictive power re- 
ducing the prediction error by 14.1% for Disagreement with Prefet’s 
decision, by 17.7% for Amount of Michel’s rembursement, by 30.4% 
for Michel has right to keep land, and by 23.8% for Severity of Prefet’s 
violation of law. 





with the perspective that respect for property rights is 
not a basic, instinctual norm and must develop over 
time. 

Another interesting finding is that communal merit 
played no role in explaining attitudes toward the 
amount that Michel should be reimbursed. This find- 
ing indicates that respondents’ attitudes toward reim- 
bursement were not conditioned on communal inter- 
ests (i.e., whether the state intended to use the land 
for the public good). While use of the land for the 
public good increased the likelihood that respondents 
would perceive the Prefet’s action as justified, it had no 
bearing on their attitudes about what constituted fair 
compensation for the loss of Michel’s land. In other 
words, this finding suggests that Benin citizens believe 
that communal interests can justify infringement on 
property rights by the state but does not invalidate 
those rights and thereby reduce the state’s obligation 
to compensate Michel for the loss of his property.” 
Again, this response to the treatment is consistent with 
our theoretical argument but conflicts with the perspec- 
tive that citizens’ conception of property rights should 
vary across cultural contexts. 

The results in Table 6 represent extremely com- 
pelling support for our contention that regardless of the 
sociopolitical context individuals respond favorably to 
both formal institutional guarantees for property rights 
and to more informal norms regarding property rights. 
Support for property rights resonates in a very strong 
fashion for the average citizen even in one of the most 


20 In an auniliary analysis (available from the authors upon request), 
we further investigated this issue by including an interaction be- 
tween Social Group Membership and exposure to the Land Used 
for Party HQ scenano. This interaction only proved significant in 
the Amount of Michel’s reimbursement model, m which it had a 
negative effect. In other words, this auxiliary analysis revealed that 
support for reimbursing Michel for infringement on his property 
rights to build a party headquarters was conditioned on the density 
of the respondent’s social network Respondents with denser social 
networks, and presumably stronger communitarian values, were less 
likely to support a greater reimbursement for Michel when his land 
was expropriated for a political purpose. 
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TABLE 7. Least Squares Model of Summary 
Reactlon to Property Rights Vignette 


Varlable 

Michel Held Title 

Michel Improved Land 

Land Used for Party HQ 
Education 

Urban Resident 

Media Usage 

Social Group Membership 
Importance of Ethnic Identity 
Fon 


Fon *« Importance of Ethnic 
Identity 

Expertence with Market 
Mechanisms 

Deference to Authortty 

Trust In Government 

Income 

Unemployment Concern 

Personal Property Ownership 

Female 

Regional dummy variables 


F-statistic of model 
significance 
Note. Dependent 
to the property nghts vignette with higher values representing 
greater support for property rights (see the Appendix). N = 
1,382 ™* p < 001, * p < .05 (two-tailed test). 





economically deprived regions of the world and a coun- 
try with a long historical tradition of antimarket politics 
and policies. We now move to exploring whether these 
positive attitudes toward property rights are confined 
to particular segments of this transition society. Of par- 
ticular interest is testing our argument that support for 
basic economic rules, in this case property rights, is 
not conditioned by personal or social capital. Hence, if 
our theoretical perspective is correct, only the exper- 
imental treatments should generate differences in op- 
position to the government’s expropriation of Michel’s 
land. In other words, our theoretical expectation is that 
the addition of variables measuring personal and social 
capital will have no statistically significant impact on 
respondents’ reaction to the stimuli. 

Table 7 takes this additional step in analyzing re- 
sponses to the vignette by modeling a summary mea- 
sure of support for property rights (see Appendix for 
definition of this variable). In addition to the treatment 
variables, this analysis includes the personal and social 
capital explanatory variables employed in the analysis 
of the dictatorship survey experiment. Consistent with 
the dictatorship results, and with our theoretical argu- 
ment, there is no evidence that individual character- 
istics associated with personal capital play any role in 
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shaping utility maximization behavior. Also consistent 
with our argument is the poor performance of the so- 
cial capital variables—only Social Group Membership 
proves significant at the 5% level.” To the extent that 
respect for property rights is a basic norm, opposition 
to the state infringing on those rights should not vary 
with personal and social capital.” 


CONCLUSION 


This article explores whether there are systematic dif- 
ferences in economic reasoning and what might ac- 
count for them. By identifying explanations for sys- 
tematic variation in economic reasoning, our analysis 
provides some insights into the dynamics of successful 
(or unsuccessful) transition to a market economy. It 
is, after all, reasonable to question the applicability of 
classic models of economic reasoning in contexts with 
cultural and historical settings that widely diverge from 
those of advanced industrial countries. The literature 
on market transitions is replete with rather broad as- 
sertions about the appropriateness and the feasibility 
of implementing market reforms in contexts with an 
underdeveloped tradition of classic market institutions. 
But what precise aspects of economic reasoning are 
likely to be relevant in these transition contexts? Fol- 
lowing the lead of Bates (1981), we attempt to integrate 
rigorous theoretical perspectives from microeconomics 
and rational choice with area studies perspectives on 
the norms and cultural values of traditional societies. 
We argue for a multidimensional conceptualization of 
economic reasoning and propose an important distinc- 
tion between economic rules (or norms) and economic 
choices. Economic rules constitute basic, instinctual 
values that govern market exchange, such as prop- 
erty rights, individual freedom, and individual secu- 
rity. A second critical facet of economic reasoning re- 
lates to the nature of economic choices—such as utility 


21 We also investigated the effects of personal and social caprtal varı- 
ables by adding them to the ordered probit models used to derive the 
predicted probabilities presented m Table 6. This aunliary analysis 
(available from the authors upon request) was largely consistent with 
the results in Table 7. The only exception was the effect of Trust in 
Government, which proved to have significant negative effects on 
the likelihood of questioning the legitimacy of the Prefet’s decision 
to expropriate Michel’s land but a significant positive effect on the 
likelihood of supporting a larger reimbursement for Michel This 
pattern of effects suggests that respondents in Benin with high levels 
of government trust are more deferent to the government when 
judging the infringement on Michel’s property rights but temper that 
deference with stronger support for Michel being fairly compensated 
for the loss of his property. 

2 The marginally significant negative effect of Experience with Mar- 
ket Mechanisms on the summary measure of support for property 
nights in Table 7 is counterintuitive since one would expect market 
experience to stmulate support for property rights. Yet a developing 
economy like Benin’s is more likely to have observed the influence 
of patronage, especially in the regulation of business. This mterpre- 
tation is borne out by our auxiliary analysis of the four items that 
compose the summary measure of support for property rights. Mar- 
ket experience might increase support for property nghts but that 
experience also exposes the respondent to cases where the abuse of 
property nghts was greater than the Prefet’s expropriation of land 
for public use. 
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maximization—in market contexts. Both, we would ar- 
gue, are critical elements of properly functioning mar- 
kets. 

Why might economic reasoning systematically dif- 
fer in one context versus another? Building on Becker 
(1996) and others, we argue that systematic variations 
in economic reasoning can be a function of either per- 
sonal or social capital. Personal capital represents in- 
dividual characteristics such as cognitive capabilities 
and information levels. Social capital represents social 
contextual factors that can either increase or decrease 
the costs associated with classic economic reasoning. 
The evidence presented here, based on experiments 
embedded in a public opinion survey in Benin, clearly 
indicates that commitments to the basic foundations of 
economic interaction—in this case property rights— 
are widely shared by citizens, even those in a very 
nascent market economy such as Benin’s. Moreover, 
personal capital—such as education or information lev- 
els—is not a precondition for commitment to these ba- 
sic rules, Similarly, the nature of one’s social capital has 
no significant bearing on commitment to these rules. 
This finding suggests that commitment to private prop- 
erty—one of the critical foundations on which market 
economics are built—is in some sense a universal trait. 

The results of the experimental dictatorship game 
embedded in our survey suggest, on the other hand, 
that there is systematic variation in the nature of eco-* 
nomic choices that individuals make. As we expected, 
utility maximization behavior is not systematically re- 
lated to personal capital. Greed and self-interest do not 
require particularly sophisticated cognitive reasoning 
or high levels of information. On the other hand, also 
consistent with our theoretical argument, social capi- 
tal does affect the extent to which individuals utility 
maximize. Social pressures and expectations can either 
increase or decrease the likelihood that individuals 
maximize personal utility. Hence, to the extent that we 
can observe variation in utility maximization—or, we 
would speculate, other economic behavior consistent 
with rational choice assumptions underlying economic 
reasoning—tt is likely the result of variation in contex- 
tual incentives or social 

We believe that a more precise understanding of 
systematic variation In economic reasoning is an im- 
portant foundation for developing policies aimed at 
promoting transitions to successful market economies. 
Economists recognize that successful efforts to pro- 
mote economic development are those that get the 
individual-level incentives right. Policies need to be 
designed to reward growth-compatible behavior on the 
part of citizens. Developmental economists are guided 
in crafting these policies by the rich theoretical assump- 
tions of classical micro-economic theory. Our results 
suggest that for the most part the norms and choices 
that individuals make in widely varying socio-economic 
contexts are likely to conform to classical assumptions. 
In particular, the basic economic norms underlying 
market behavior seem to be widely embraced with 
little evidence of any meaningful systematic variation 
across individuals. Hence, policy makers can assume 
that adopting the institutional infrastructure of market 
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economies—property rights, individual freedoms, en- 
forcement of contracts, etc.—will be widely endorsed 
by citizens and will likely elicit behaviors consistent 
with micro-economic theory. Additionally, develop- 
ment policies often assume citizens are likely to make 
choices that reflect the utility maximization maxims 
of micro-economic theory. Here, policymakers need to 
recognize that there are no inherent informational or 
educational barriers to behaving in such a fashion— 
the potential for self-interested behavior is pervasive 
within the population. Yet our results certainly indi- 
cate that utility maximizing behavior can be tempered 
by social rewards and costs. Hence, in order to get 
incentives right, policymakers need to understand the 
impact of social capital and how the balance of social 
rewards and costs in any particular society is likely to 
affect the incentives to engage in utility maximizing 
behavior. 


APPENDIX 


This appendix discusses the sampling procedures for the 
Benin survey and variable coding and presents the word- 
ing of the two survey experiments—Dictatorship Game and 
Property Rights Vignette. First, we discuss the coding of the 
dependent variables analyzed in Tables 2, 6, and 7. Second, we 
report the coding of the explanatory variables used in these 
analyses. Finally, we present the wording of the two survey 
experiments. 


Sampling Procedures for Benin Survey 


The Benin survey, conducted in May 2000, is based on a 
stratified random sample of the entire country. The eight 
sampling strata consist of the six departments, which are the 
regional administrative districts that divide up the country, 
and the two major urban areas (Cotonou and Porto-Novo). 
The sampling frame included all geographic areas within a 
50-km range of towns with more than 10,000 inhabitants. The 
interviews consisted of a questionnaire approximately one 
hour in length that was administered face-to-face in French 
and in four of the major tribal languages. A total of 1,513 
interviews were completed—this represented a response rate 
of 98% (there was a total of 22 refusals). A detailed sampling 
document and copies of the English and French versions of 
the questionnaire are available at www.uh.edu/democracy. 


Coding of Dependent Varlables 


Market Reasoning—Responses to Dictatorship Game. 
This seven-category variable measures closeness to the 
equihbrium decision of keeping the entire prize except a 
trivial amount. It is coded as follows (first amount is for 
respondent): 0—7,000 CFA for me and 8,000 CFA for other 
person or 5,000/10,000 split (volunteered); 1—7,500/7,500 
split (volunteered); 2—7,600/7,400 split; 3—7,800/7,200 split, 
8,000/7,000 split (volunteered), 8,500/6,500 split, or 9,000/ 
6,000 split (volunteered); 4—9,500/5,500 split, 10,000/5,000 
split (volunteered), or 10,500/4,500 split; 5—11,500/3,500 
split, 12,000/3,000 split (volunteered), or 13,000/2,000 split; 
6—14,950/50 split or 15,000/0 split (volunteered). 


Market Reasoning—Responses to Property Rights 
Vignette. Reactions to the vignette were analyzed 


separately and as part of a summary measure of support for 
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TABLE At. 
Variable 
7-Category Dictatorship Response 
Disagreement with Prefet’s decision 
Amount of Michel's reimbursement 
Michel has right to keep land 
Seventy of Prefet’s violation of law 
Summary Property Rights Response 
Michel Held Title 
Michel Improved Land 
Land Used for Party HQ 
Education 
Urban Resident 
Medla Usage 
Social Group Membership 
Importance of Ethnic Identity 
Fon 
Fon x Importance of Ethnic Identity 
Experience with Market Mechanisms 
Deference to Authority 
Trust In Govemment 
Income 
Unemployment Concam 
Personal Property Ownership 
Female 
Reglonal dummy vanables 

Atacora 

Borgou 

Mono 

Oueme 

Zou 





personal property rights. Respondents were asked four ques- 
tions designed to capture the extent to which they support 


property rights: 


“Do you agree with the decision of the Prefet to expropri- 
ate Michel’s property?” Responses were coded from 0 
for “agree strongly” to 4 for “disagree strongly.” 

“Do you think that the Prefet should compensate Michel?” 
Respondents who answered “yes” were then asked: 
“To what extent should the Prefet reimburse Michel?” 
Responses to these questions were coded as fol- 
lows’ 0—Prefet should not compensate Michel; 1—one- 
quarter of the land’s value; 2—one-half of the land’s 
value; 3—three-quarters of the land’s value; 4—full 
value of the land; 5—twice the land’s value. 

“Do you think that Michel has the right to keep his farm- 
land?” Responses were coded from 0 for “absolutely 
not” to 3 for “absolutely.” 

“To what extent do you think that the Prefet has violated 
Benin law regarding the protection of private prop- 
erty?” Responses were coded from 0 for “did not violate 
at all” to 3 for “violated a lot.” 


To the extent that these items tap a latent “support for pri- 
vate property” attitude, it would be reasonable to model 
them as a single construct. Principal component factor anal- 
ysis of the four items identifies a single significant compo- 
nent (eigenvalue of 2.50 and 62.4% variance explained). 
Accordingly, we constructed a summary measure of sup- 
port for property rights by standardizing (i.e, subtract- 
ing the mean and dividing by the standard deviation) the 
four variables and then adding them together. The Cron- 
bach a coefficient for this measure is 0.793. Respondents 
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Mean SD Min. Max. 
2.81 1.72 0 6 
1.98 1.38 0 4 
2.61 1.98 0 5 
1.52 1.28 0 3 
1.44 1.27 0 3 
.04 3.14 —5.07 5.06 
.498 0 1 
519 0 1 
501 0 1 
5.32 5.51 0 16.5 
.396 0 1 
01 2.27 —3.37 5.91 
1.27 192 0 14 
3.15 1.18 0 4 
301 0 1 
84 1.48 0 4 
—.004 1.81 —3.82 4.18 
—.003 1.56 —6.89 2.54 
—.01 2.05 —3.30 6.78 
4.69 5.14 0 45 
2.16 1.55 0 4 
95 1.24 0 4 
347 0 1 
134 0 1 
.167 0 1 
.137 0 1 
.181 0 1 
165 0 1 





who did not answer all four questions were excluded 
from the analysis of the summary measure presented in 
Table 7. 


Coding of Explanatory Variables% 


Michel Held Title—Dummy variable denoting scenarios stat- 
ing that Michel has legal title to farmland and that title is 
properly registered with the authorities. 

Michel Improved Land—Dummy variable denoting sce- 
narios stating that Michel worked to improve the productivity 
of the farmland. 

Land Used for Party HQ—Dummy variable denoting sce- 
narios stating that the Prefet ordered the expropriation of 
Michel’s land in order to build a new headquarters for the 
ruling political party in the region. 

Education—Respondent’s education level measured in 
years. 

Urban Resident—Dummy variable denoting residence in 
city. 

Media Usage—Constructed from responses to questions 
about how frequently the respondent watches news programs 
on television, listens to news on the radio, and reads the 
news in the daily paper. These responses were coded from 


B Tn an auxiliary analysis (available from the authors upon request), 
we reestimated the model in Table 7 with an augmented sample that 
included respondents who failed to answer all four questions about 
the vignette. For these respondents, we coded missing values to the 
sample means before creating the summary measure. This auriliary 
analysis produced results very similar to those presented m Table 7, 
which did not alter any of our theoretical inferences. 
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0 for “never” to 4 for “every day.” The resulting three vari- 
ables were standardized and then added together to construct 
Media Usage. The Cronbach a coefficient for this measure is 
0.626. 

Social Group Membership—Constructed from responses 
to questions about membership in 14 types of organizations. 
Constructed a variable for each organization coded from 0 for 
nonmember to 2 for active member. Social Group Member- 
ship is the sum of these separate organization variables, such 
that higher values represent a more extensive social network. 
The Cronbach a coefficient for this measure 1s 0.436. 

Importance of Ethnic Identity—Coded response to ques- 
tion on importance of ethnic group identity ranging from 0 
for “not very important” to 4 for “very important.” 

Fon—Dummy vanable denoting identification with Fon 
ethnic group. 

Deference to Authority—Constructed from responses to 
questions on how strongly the respondent agrees or disagrees 
(coded on a five-point scale) with the following two state- 
ments: “People shouldn’t accept everything the authorities 
say without questioning it” and “It is very good that people 
today have greater freedom to protest against things they do 
not like.” These variables were standardized and then added 
together, with higher values denoting greater deference to 
authority (i.¢., more agreement with the first statement and 
less agreement with the second statement). The y coefficient 
for these two variables has an asymptotic t-statistic of 6.83. 

Government Trust—Constructed from coded responses to 
the following three questions: 


“Generally speaking, would you say that this country is run 
by a few big interests looking out for themselves or that 
it ıs run for the benefit of all the people?” (Responses 
coded from 0 to 2) 

“What do you think about how the people working for 
the government use the money of taxpayers? Do they 
waste a lot of the money, some of the money, little of 
the money, or none of the money?” (Responses coded 
from 0 to 3) 

“In your opinion, does the government include a lot of dis- 
honest people, some dishonest people, or no dishonest 
people at all?” (Responses coded from 0 to 2) 


These variables were standardized and then added together, 
with higher values denoting greater trust of government. The 
Cronbach «a coefficient for this measure is 0.426. 

Experience with Market Mechanisms—Constructed from 
coded responses to the following two questions: 


“How difficult would you say it is for someone like yourself 
to start a successful small business?” (Responses coded 
from 0 for “very difficult” to 4 for “very easy”) 

“How difficult would it be for you to start a small busi- 
ness and have it grow into a very large business?” 
(Responses coded from 0 for “very difficult” to 4 for 
“very easy”) 


These variables were standardized and then added together, 
with higher values denoting greater experience with market 
mechanisms. The y coefficient for these two variables has an 
asymptotic t-statistic of 32 45. 

Income—Monthly household income measured in 10,000 
CFA. 

Unemployment Concern—Constructed from questions on 
employment status and concern about becoming unem- 
ployed during the next 12 months. Variable ranges from 0 
for “not at all worried” to 4 for currently unemployed. 

Personal Property Ownership—Five-category measure of 
personal property ownership constructed from questions 


about 13 household items (e.g., iron, television, stove). Vari- 
able is coded 0 for respondents who do not own any of the 
13 items, 1 for those who own one item, 2 for those who own 
two or three items, 3 for those who own four to six items, and 
4 for those who own seven or more items. 

Regional Dummy Variables—Dummy variables denoting 
region of residence. Dummy variables were constructed for 
each of the regions except Atlantique, which serves as the 
baseline for comparison. 

Female—Dummy variable denoting female respondents. 


Wording of Dictatorship Game Survey 
Experiment 


As you know people across the country are being interviewed 
for this survey. After all of the interviews are completed, two 
of the people interviewed will share a prize of 15,000 CFA. 
Here 1s how the winners of the money will be decided. Each 
person interviewed has been given an envelope just like the 
one I handed to you. The envelope contains your name and 
address. After we have finished all of our interviewing, we 
will put all of the envelopes in a large urn. One envelope 
will be selected. If your envelope is selected, you will be paid 
whatever amount of the 15,000 CFA you decided to give to 
yourself. Then another person’s name will be selected from 
the urn. This second person will receive whatever amount 
remains of the 15,000 CFA prize. Your decision will deter- 
mine how much money you might receive so please take the 
decision seriously. 

In the event that your name 1s chosen first, we want you 
to decide right now how much of the 15,000 CFA prze you 
want to give to yourself. Here is a list of amounts. Circle the 
amount of the 15,000 CFA you want to give to yourself. The 
rest of the money will be given to the other person selected 
from the urn. After you have circled the amount place the 
paper in the envelope and seal it. If your name 1s selected 
first then we will open your envelope and you will receive 
the amount circled: 1) 14,950 for me, 50 for other person; 
2) 13,000 for me, 2,000 for other person; 3) 11,500 for me, 
3,500 for other person; 4) 10,500 for me, 4,500 for other per- 
son; 5) 9,500 for me, 5,500 for other person; 6) 8,500 for me, 
6,500 for other person; 7) 7,800 for me, 7,200 for other person; 
8) 7,600 for me, 7,400 for other person; 9) 7,000 for me, 8,000 
for other person. 


Wording of Different Scenarios for the 
Property Rights Vignette 


Three parts of the vignette were manipulated to produce 
eight different scenarios. These parts dealt with whether 
Michel had legal title to the land (title), whether Michel 
had added value to the land (merit), and the intended use’ 
of the land by the prefet government (land use). There are 
two versions of each part of the vignette. Each version for 
each part is presented below. 


Title. Positive version is: Michel has legal title to the farm- 
land he occupies and the title is properly registered with the 
authonties. Negative version is: Michel is not the legal heir of 
the farmland he occupies although he did purchase the land 
along with purchasing false titles to the property. 


Merit. Positive version is: For a number of years Michel 
worked to improve the productivity of the farmland. Nega- 
tive version is: For a number of years Michel has done nothing 
to improve the productivity of the farmland. 
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Land Use. Positive version is: The Prefet administration 
ordered the expropriation of Michel’s land in order to build 
a large water tower that would provide the village with 
drinking water. Negative version is: The Prefet administra- 
tion ordered the expropriation of Michel’s land in order to 
build a new headquarters for the ruling political party in the 
department. 
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ights are so beneficial, then why are they so rare? More precisely, what factors promote secure 
property rights? Do rightholders view private or state agents as a greater threat to property? Do 

they value bureaucratic commitment or discretion? I use evidence from two original surveys of company 
managers in Russia to assess the institutional, social, and political determinants of secure property rights. 
Most managers said that state arbitration courts did not work badly in disputes with other businesses, but 
few expected these courts to protect their rights in disputes with state officials. More importantly, managers 
who expressed confidence that state arbitration courts could constrain state officials invested at higher 
rates, even controlling for the perceived effectiveness of state institutions. Thus, tightening constraints on 
state agents can increase the security of property and bolster state capacity. These results generate insights 
into debates on the role of state in the economy, the origins of secure property rights, the nature of state 


Ps dispute that secure property rights are critical to economic development. But if secure property 
ri 


capacity, the importance of informal institutions, and the process of legal reform. 


ecure property rights are commonly seen as 

critical to economic development. North (1973) 

famously attributed the “Rise of the Western 
World” to the relative security of property rights in 
northwestern Europe and recent studies have linked 
secure property rights to high rates of economic 
growth, investment, and per capita wealth (Barro 1997; 
Johnson, McMillan, and Woodruff 2002; Kaufmann, 
Kray, and Zoido-Laboton 1999). However, if secure 
property rights are so beneficial, then why are they 
so rare? More precisely, what factors promote secure 
property rights? Some scholars emphasize that soci- 
etal factors, such as broad levels of social trust or civic 
participation, are conducive to secure property rights. 
Others focus on the partisanship of political elites. Most 
commonly, scholars attribute the security of property 
rights to various features of the state. 

Legal scholars, historians, economists, and political 
scientists have made important contributions to this 
debate, but many issues remain unresolved. This is un- 
fortunate because analyses of the security of property 
rights raise core theoretical issues. First, they call atten- 
tion to the role of the state in the economy. In the spirit 
of Hobbes, one longstanding theory identifies private 
competitors in the market as the main threat to prop- 
erty rights and looks to state institutions as a remedy. To 
strengthen property rights, policymakers should give 
courts, police forces, and state bureaucracies sufficient 
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resources and discretion to protect property against 
trespass by private agents. 

An equally longstanding theory locates the threat 
to private property within the state itself. In the tra- 
dition of Locke, Weingast (1993, 287) cautions: “The 
fundamental economic dilemma of a political system 
is this: a government that is strong enough to protect 
property and enforce contracts is also strong enough 
to confiscate the wealth of its citizens.” State agents, 
with their great advantage in the use of force, are 
uniquely positioned to threaten property rights. Thus, 
the key to protecting property is placing constraints on 
state agents. To strengthen property rights, policymak- 
ers should increase the independence of courts from 
other state officials and empower groups within civil 
society as a bulwark against arbitrary behavior by state 
agents. These arguments have long been conducted 
on a philosophical plane, but scant evidence exists 
to determine which of these threats to property are 
more severe and which proposed solutions are more 
effective. 

Second, many argue that effective institutions— 
broadly understood—are related to the security of 
property rights, but we have little knowledge about 
which specific institutions are especially important 
(Acemoglu, Johnson, and Robinson 2001). Given 
scarce resources should governments seek to create 
more effective courts, police forces, or state bureau- 
cracies? Or would their efforts be better spent on 
strengthening informal institutions, such as dense social 
networks that promote trust and prevent opportunism? 
Few studies have sought to identify the relative im- 
portance of different institutions for the security of 
property rights. 

Similarly, our knowledge of how institutions affect 
the security of property is limited. Do courts, for exam- 
ple, increase the security of property rights primarily 
by protecting rightholders against threats from state 
bodies or from other private agents? Courts may be rel- 
atively effective in disputes between private agents, but 
relatively ineffective in disputes with state officials. The 
theoretical conclusions and policy prescriptions that 
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follow from these two perspectives are quite different. 
Advising countries to “get the institutions right” is of 
little help if we do not know which institutions are 
important and how they work. 

Third, these debates inform the study of property 
rights themselves. Few studies have tried to distin- 
guish precisely which aspects of property rights are 
especially important. Are property rights that are free 
from trespass sufficient to promote investment? Or 
must rightholders believe that their property rights are 
also credible, that is, unlikely to be subject to arbitrary 
change over time? The former view suggests that schol- 
ars and policymakers should focus on building state 
capacity to prevent trespass, whereas the latter implies 
a focus on constraining state power to reduce arbitrary 
changes in property. 

These questions have important consequences for 
social science theory, but they also have signal im- 
plications for policy. Over the last decade, interna- 
tional financial institutions, individual countries, and 
private donors have poured vast sums into Asia, Latin 
America, and the postcommunist world to promote 
stronger legal institutions. One observer recently noted 
that a “rule of law revival” is under way among in- 
ternational policymakers (Carothers 1998). Yet few 
studies have tried to identify the institutional, social, 
and economic determinants of secure property rights 
at the individual level. This essay uses evidence from 
two original surveys of business elites in Russia in 1998 
and 2000 to begin to address these issues. 


PROPERTY RIGHTS 


Property rights are typically treated as a bundle of 
rights that include the power to consume, obtain in- 
come from, and alienate assets, such as land, labor, or 
capital (Barzel 1989, 2; Riker and Weimer 1993). In- 
dividuals, groups, or the state may exercise rights over 
these assets as is typically the case with private, com- 
mon, and state property, respectively. Property rights 
vary along many dimensions, but three have received 
special attention: the clarity of allocation, the ease of 
alienability, and the security from trespass. In recent 
years, scholars have also suggested that secure property 
rights must be credible, that is, unlikely to be sùbject to 
arbitrary changes over time. 

These features of property rights influence eco- 
nomic performance by determining the extent to which 
rightholders are rewarded for their efforts. If righthold- 
ers view their assets as subject to competing claims by 
others, difficult to sell, vulnerable to theft, or lacking 
in credibility, then the costs of exercising these rights 
increase. Where these costs exceed the expected return, 
they will have little incentive to engage in productive 
economic behavior, and economic performance will 
suffer.! 


1 For tractability’s sake, neoclasmcal economics 1s built on assump- 
tions of complete private property nghts and perfect competition. A 
central task of the new institutionalism 1s to relax these assumptions 
and explore how institutions affect the ease with which rightholders 
can exercise claims on ther assets in a variety of settings 
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Rightholders may employ private agencies to in- 
crease the security of property rights, but in complex 
economies the state can generally protect property 
rights more efficiently than other organizations due to 
its economies of scope and scale (North and Thomas 
1973). Among their tasks, state agencies, such as the 
police, regulators, and the procuracy, enable the state to 
make property rights free from trespass, particularly by 
private agents. Courts can increase the security of prop- 
erty rights by resolving disputes among private agents, 
but also by placing constraints on state officials. Thus, 
property rights lie at the intersection of economics and 
politics. These aspects of property rights are fairly well 
understood, but the factors most important for making 
property rights secure are not. Most arguments fall into 
one of four categories. 


Private Threats and State Institutions 


One line of argument emphasizes private threats 
against property rights and looks primarily to the 
courts, police, and bureaucracies to prevent trespass. 
Because administering justice against private threats is 
a public good, effective state institutions foster secure 
property rights. This view gives pride of place to police 
and regulatory bureaucracies and focuses on the role of 
courts against private predators, such as unscrupulous 
competitors and criminals. Broadly put, rightholders 
who perceive various state institutions to be effective 
in preventing trespass should view their property rights 
as secure. 

This argument rings true in the postcommunist 
world, where tales of private threats to property by 
organized criminal groups are commonplace and many 
attribute weak property rights in the region to in- 
effective state institutions (Frye 2002; Holmes 1996). 
Roland (2002, xix) observes, “If anything, the experi- 
ence of transition shows that policies of liberalization, 
stabilization and privatization that are not grounded in 
adequate institutions may not deliver successful out- 
comes.” Observers of Russia have little trouble point- 
ing to ineffective state institutions that range from 
poorly trained bureaucrats making policy in unheated 
buildings to outgunned policemen in Russian-made 
Ladas chasing criminals in German-made Mercedes.” 
Graham (2002, 39) notes, “The real drama of the first 
decade of the new Russia was the fragmentation, de- 
generation, and erosion of state power.” 

A related argument emphasizes the corrosive effect 
of corruption on property rights. Corruption of public 
bureaucracies raises the costs of conducting business, 
increases uncertainty, and distorts investment (Shleifer 
and Vishny 1998). These effects are readily apparent in 
the postcommunist world where corruption has taken 
a heavy toll (Hellman, Jones, and Kaufmann 2000; 
Johnson, McMillan, and Woodruff 2002). 

These arguments locate the primary threat to prop- 
erty in private agents and trace the security of property 
to state institutions. Thus, secure property rights should 


2 In recent years, police in Russia have become better equipped 
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be associated with effective and uncorrupted courts, 
police, and other public agencies. 


State Threats, Credible Commitment, and 
Discretion 


The credible commitment argument stresses that the 
arbitrary use of state power by government officials 
may attenuate property rights. Again, examples from 
Russia abound.” Even if state officials are well trained, 
well equipped, and uncorrupt, where politicians and 
bureaucrats can disregard legal decisions, property 
rights are likely to be vulnerable. This “grabbing hand” 
view of business—state relations implies that regard- 
less of managers’ views of the capacity of legal in- 
stitutions in general, they may still fear violations of 
their property rights by state agents (Frye and Shliefer 
1997). 


Central to this argument is the theoretical problem of 
credible commitment. Property rights are often weak 
because economic activities involve time-inconsistent 
exchanges between state and private agents. More 
precisely, laws and policies often promise benefits in 
the future for changes in behavior today. For example, 
to encourage investment, a government may pass a 
law promising tax benefits for 5 years for firms that 
invest. After a firm invests, however, it is vulnerable 
to ex post violations of its property rights by state 
agents. If courts do not constrain state agents, the 
latter may impose confiscatory taxes regardless of le- 
gal rules to the contrary. Anticipating this possibility, 
rightholders will view their property as vulnerable and 
be reluctant to invest in the first place (Diermeier 
et al. 1997). 

Independent courts that raise the costs to the gov- 
ernment of violating the law ex post may mitigate 
this problem. North and Weingast (1989) argue that 
during England’s “Glorious Revolution,” Parliament 
replaced the notoriously corrupt Star Chamber with a 
far more independent court that could credibly punish 
both Parliament and King for violating the security of 
private property. This institution made it more difficult 
for the state to expropriate wealth. This constraint on 
state power, in turn, increased the security of private 
capital and encouraged investment, including loans 
to the government. By significantly raising the costs 
of ignoring inconvenient judicial decisions, the gov- 
ernment “tied its hands” and strengthened property 
rights (Root 1989). The problem of credible commit- 
ment underpins the irony that state agents with few 


3 Simachey (2003, 5) reports data from a 2002 survey of 300 bumness 
managers in three cites in Russia “The most common violations 
of property rights came from the local and regional government 
Among large enterprises (more than 500 employees) every second 
firm experienced a property rights violation by the regional govern- 
ment.” Most recently, the arrest of Mikhail Khodorkovsky, Russia's 
richest man, and several of his associates for a variety of alleged 
economic crimes has raised concerns over the selective enforcement 
of laws and the arbitrary use of state power. 

4 Campos and Root (1994), North and Weingast (1989), Root (1989), 
and Weimer (1997) 


constraints on their behavior may attenuate property 
rights precisely because rightholders understand that 
their property rights depend on the discretion of state 
agents. State agents who develop means to commit to 
comply with judicial decisions may foster more secure 
property rights. 

Such commitment, however, comes at the price of a 
loss of discretion. And some argue that in a highly un- 
certain transition environment there may be reasons to 
value discretion. Facing the unprecedented challenge 
of transferring property rights from state to private 
actors across the economy in a short period, bureau- 
crats and politicians may benefit from the ability to 
act quickly as new and unanticipated policy challenges 
arise. In this sprit, Holmes (1995, 75) notes, “One of the 
main priorities in Eastern Europe today is to preserve 
the government’s capacity to re-adjust to changing cir- 
cumstances. Politicians must therefore be able to rene- 
gotiate the rules while they are playing the game....” 
Institutions that tightly circumscribe state agents may 
limit the flexibility required to protect property in a 
transition setting. Though not neglecting the role of 
courts, this view emphasizes the value of enabling insti- 
tutions that provide state officials with the discretion to 
make policy and resolve disputes. This “helping-hand” 
view of relations between business and the state is par- 
ticularly favored by advocates of the “Chinese model” 
for transition economies and suggests that constraints 
on state power should, on balance, weaken property 


rights (Stiglitz 1999). 


Socletal lanations: Soclal Trust and 
Civic Participation 


A third group of arguments focuses on societal factors, 
such as social trust. Arrow (1974, 357) notes, “Vir- 
tually every commercial transaction has within itself 
an element of trust....It can plausibly be argued that 
much of the economic backwardness in the world can 
be explained by a lack of mutual confidence.” Dense 
social ties and widespread norms of reciprocity reduce 
the transaction costs of exercising property rights. Ex- 
pecting business partners to bear reputational costs for 
violating property rights, rightholders in high trust so- 
cieties can invest with confidence (Greif 1994). This 
view is common in the postcommunist world where 
some attribute variations in the strength of property 
rights to social ties developed under communist rule. 
For example, Russia’s experience of 70 years of com- 
munist rule with little room for trust-building social 
organizations compares poorly with Poland’s 40 years 
of communist rule that allowed for the Solidarity la- 
bor movement and a politically active Catholic Church 
(Hoff and Stiglitz 2002). 

Other observers identify civic participation as an 
important societal factor influencing the strength of 
property rights (Raiser 1999). Members of encom- 
passing social groups may have more opportunities to 
lobby state officials and may therefore be more likely 
to invest. Moreover, having overcome the collective 
action problem necessary to create the organization, 
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members may be better placed to punish state of- 
ficials for violations of property rights through co- 
ordinated political action (Putnam 1994; Weingast 
1997). Facing powerful groups of rightholders, state 
officials may prefer to respect property rights rather 
than subvert them. In sum, societal explanations sug- 
gest that strong property rights should be associ- 
ated with broad-based trust among business part- 
ners and high rates of civic participation in social 
organizations. 


Elke Partisanship 


A fourth argument ties the security of property 
to the partisanship of political elites Many have 
noted that political leaders committed to market- 
oriented reforms can increase the security of property 
(Przeworski 1991). Where economic agents believe 
that state officials are weakly committed to building 
a market economy based on private property, they will 
likely view their property rights as insecure. State of- 
ficials often expend great effort to convince private 
agents that they will not deviate from policies that 
inflict costs in the short run but will improve the econ- 
omy and strengthen property rights in the long run 
(Diermeier et al. 1997). 

Such arguments have particular relevance in tran- 
sition economies. Because constituencies in support 
of private property are often weak or absent early 
in the transformation, elites may play a lead role in 
defending the nascent private sector. In addition, fluid 
constitutional frameworks may allow the ideological 
commitments of leaders to assume greater prominence. 
The elite partisanship theory suggests that righthold- 
ers who believe that political elites are committed to 
building an economy rooted in private property may 
have greater confidence in the security of their property 
rights. 


THE SURVEY 


Postcommunist Russia offers an excellent opportunity 
to assess these theories. Because developed economies 
typically have strong legal institutions, elites committed 
to a market economy, relatively high levels of social 
trust, and fairly secure property rights, it is often diffi- 
cult to disentangle relationships among these variables. 
In contrast, because the creation of state institutions, 
social relationships, and the security of property rights 
have proceeded at different paces in postcommunist 
Russia, this case offers greater possibilities for under- 
standing the process of the creation of secure property 
rights. 

Using surveys of business elites to study property 
rights offers several advantages over existing literature. 
Many works on credible commitment and property 
rights rely on historical case studies. These studies 


5 See footnote 4 This essay differs from existmg work in other re- 


spects. In contrast to other surveys of business elites, it analyzes 
how perceptions of pohtcal elites influence property rights. Whereas 
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have made valuable contributions, but the microfoun- 
dations of the credible commitment logic often receive 
little empirical attention. Moreover, it is difficult to 
capture the perceptions of a large number of partici- 
pants in historical studies. Here I am able to assess the 
individual-level determinants of secure property rights 
by using an original survey of company managers in 
500 firms in Russia conducted in October-November 
2000. Researchers from VTsIOM, the All Russian 
Central for Study of Public Opinion, interviewed 
high-level business managers in Moscow, Nizhnii 
Novgorod, Novgorod, Smolensk, Tula, Ufa, Voronezh, 
and Ekaterinburg. The survey included firms in rich 
and poor, urban and rural, and left and right gov- 
erned regions. Firms were selected using a stratified 
random sampling technique. I obtained data from 
the state statistical agency (Goskomstat) on the num- 
ber of employees and the types of firms in each re- 
gion and then stratified the sample by size and type 
of firm. Researchers then selected firms at random 
from within these strata using Goskomstat data and 
regional business directories. Each firm within each 
stratum had an equal possibility of being included in 
the survey. Given the fluidity of Russia’s transition 
economy, it is difficult to determine how closely the 
sample reflects the true population at any given mo- 
ment.® However, the distribution of firms in the sam- 
ple across sectors, size, and profit rates roughly mir- 
rors a national sample.’ The sampling frame included 
firms from Goskomstat categories of 10 economic sec- 
tors and excluded firms in the agricultural, commu- 
nal services, health, social services, educational, and 
cultural sectors. 

Due to costs, I included only firms in the capital city 
in these regions, This problem is less severe than it 
appears at first glance because the majority of firms 
are located in the capital city in each of these regions. 
Interviewers spoke face to face with the chief executive 
officer, the chief financial officer, or the chief manager 
of at least 60 firms in each region. The response rate 
for the sample as a whole was 56%.8 

Most questions in the survey were innocuous and 
generated few incentives to dissemble, but a few may 
have been regarded as sensitive.” I tested all of the 


Frye and Shleifer (1997) evaluate the effects of the organization of 
regulation on property nghts and present comparisons of means in a 
smal] sample, this essay highlights the distinction between the state's 
ability to protect property in disputes with private and state agents, 
offers multivariate tests, and uses different data 

6 Goskomstat reporting on small firms are particularly inadequate 
(Xakovley 2001). 

In comparison to the national population of firms, industrial firms 

are slightly overrepresented (42% versus 54%) and retail firms are 
slightly underrepresented (27% versus 15%) in this sample Firms in 
the eight other sectors are included at roughly the same rate as the 
national population. 
8 Given the length of these elite-level interviews, this response rate 
is reasonable. The response rate for the American National Election 
Study was 60% in 2000 In four regions the response rate was over 
70% and in three other regions it was above the mean. In Smolensk, 
the response rate was 44%. Dropping observations from Smolensk 
produces no substantive change in the results. 

Some questions were used ın previous surveys in the postcommu- 
nist world. 
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questions in a pilot survey that included debriefings 
with some respondents. I also received comments 
from representatives of business organizations and 
from entrepreneurs known to the researchers. J asked 
several potentially sensitive questions using crude or- 
dered or binomial categories to alleviate some of these 
concerns. Previous research and pilot surveys sug- 
gested that managers are reluctant to reveal rates of 
profitability given the potential public and private costs 
of disclosing precise financial information. For exam- 
ple, rather than asking firms to identify how much profit 
they made last year, I asked whether they made a 
profit, took a loss, or broke even. Similarly, to mea- 
sure investment J asked managers whether they had 
built a new building or renovated their place of busi- 
ness rather than how much they reinvested in their 
firm. These investment measures can be verified by re- 
searchers and do not require the revelation of detailed 
financial data. What is lost in precision, I hope to gain 
in validity. 

Just over half the managers (55%) ran heavy or 
light industrial firms; 20% ran retail trade companies; 
and 25% headed firms in the construction, transport, 
or communications sectors. The average firm had 840 
workers; the smallest and largest firms had 4 and 53,000 
employees. Almost two thirds (65%) of the firms were 
formerly state-owned but had undergone privatization. 
Twenty percent were created after 1989 as new private 
firms; 15% remained state owned. Sixty-eight percent 
of firms claimed to have made a profit in 1999, 20% 
came out even, and 12% claimed to have lost money. 
Ten percent of the firms said that they had no serious 
competitors. Two percent of the firms were members of 
formal financial industrial groups, but just under a third 
were members of some type of production association. 
Thirty percent of firms were members of a business 
association, such as the Russian Union of Industrialists 
and Entrepreneurs. 


THE SECURITY OF PROPERTY RIGHTS 


To measure the security of property rights, I use three 
indicators of investment.!° I asked managers whether 
they had constructed a new building in the last two 
years, had extended trade credits to suppliers or buy- 
ers in the last two years, or were currently planning to 
make a significant new capital investment in the coming 
year. Twenty percent of managers had constructed a 
new building, 40% of managers had extended supplier 
or customer credits, and 31% of managers planned to 
make a new capital investment in the next year. These 
types of activities are good proxies for secure prop- 
erty rights because each requires significant immediate 
costs with only the promise of future revenue. More- 
over, given the diversity of firms in the sample, it is 
difficult to find a common measure of the security of 
property rights suitable for all firms. 


10 The responses are from business elites rather than the population 
at large. Like most firm surveys, responses may suffer from survivor 
bias. This is likely more problematic for samples that include only 
small firms. 
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CREDIBLE COMMITMENT AND STATE 
INSTITUTIONS 


The survey included a variety of questions to assess 
perceptions of the performance of state arbitration 
courts (GosArbitrazh) in Russia. Reformed in the early 
1990s from a Soviet era predecessor, these courts hear 
economic disputes between private legal entities and 
between private legal entities and state agents.!! Typ- 
ical cases involve taxes, debt collection, bankruptcy, 
and privatization. The state arbitration courts are or- 
ganized largely along regional lines with 82 courts of 
first instance located in Russia’s 89 regions. Ten higher- 
level arbitration courts hear appeals, as does a Supreme 
Arbitration Court. State arbitration courts heard over 
400,000 cases in 2000 and employ about 2700 judges 
who until recently were appointed for life (Solomon 
2002). The Ministry of Justice at the federal level pro- 
vides salaries and budgets for these courts, but local 
and regional governments often provide additional in- 
formal monies to supplement frequent shortfalls in fi- 
nancing. Judges are appointed by the executive branch 
after they have been screened and approved by a cor- 
porate organization of judges. 

Respondents were asked how frequently they turned 
to state arbitration courts to resolve business disputes. 
Seventy percent of firms had experienced some vio- 
lation of property rights over the last two years that 
they considered sufficiently grave to merit taking to 
court. Managers were far less likely to take such a 
dispute to court if the conflict was with the local or 
regional government rather than with another private 
firm.” Forty-four percent of firms that had a dispute 
with the local or regional government over the last two 
years turned to state arbitration courts to resolve at 
least one conflict. In contrast, 66% of managers who 
experienced a property rights violation by a business 
partner in the last two years took at least one dispute to 
court, indicating a greater willingness to use courts in 
cases involving private rather than state entities. Thus, 
it is fairly common for managers to use the courts 
in disputes with state and private entities, but they 
are more likely to take the latter to court than the 
former.” 

These responses reflect only the experience of man- 
agers who had a conflict in the last two years. To gain 
responses from all managers, interviewers asked a se- 
ries of hypothetical questions that probe managers’ 
expectations that courts could protect their rights in 


11 Courts of general jurisdiction hear cases involving physical rather 
than legal persons Observers have been more critical of these courts 


than the state arbitration courts. State arbitration courts were re- 
formed somewhat in December 2001. See Solomon 2002. Regional 
legislatures are now excluded from the appomtment process. 

12 Win rates of citizens against state officials are roughly 80%, which 
suggests the possibility of selection bias in the cases that citizens 
bring to court (Solomon 2002). Thus, court use 1s a poor proxy for 
court effechveness. Measuring all respondents’ perception of court 
effectiveness minimrzes this selection process. 

13 Hendley (2002) finds that businesses are suing the state at increas- 
ing rates Because the data presented here capture perceptions of 
arbitration courts at one point and do not measures change over 
time, the two findmgs may be entirely consistent. 


457 


Credible Commitment and Property Rights 


TABLE 1. Courts and the State 








August 2004 


No It Is Hard 
(%) to Say (%) 
20 


Courts can defend interests against local/regional 41 


government 
Courts can defend Interests against 
business partner 


State can ensure compllance If decision goes 


against the local/regional government 
State can ensure compliance If decision 


goes against a business partner 





18 
47 


67 13 


Questions 
"In the case of an economic dispute with the focal or regional government do you belleve 
that the state arbitration courts could protect your legal Interest?” 
“In the case of an economic dispute with a business partner do you believe that the state 
arbitration courts could protect your legal interest?” 
"Do you think that a decision of the arbitration court In your favor will be enforced ff it goes 


against the focal or regional govemment” 


"Do you think that a decision of the arbitration court In your favor will be enforced ff It goes 


agalnst a business partner?” 





cases involving state and private entities. Managers 
said that state arbitration courts were much less ef- 
fective in protecting their rights when the state was a 
party to the case. Interviewers asked managers: “In 
the case of an economic dispute with the local or 
regional government do you believe that the courts 
could protect your legal interests?” They then asked, 
“In the case of an economic dispute with a business 
partner do you believe that the courts could protect 
your legal interests?” Similarly, they asked whether the 
state could enforce decisions to protect their property 
Tights in disputes with state officials and with private 
entities. 

Table 1 reveals that 76% of managers expected state 
arbitration courts to protect their interests in cases 
involving other private businesses, but only 39% of 
managers said that courts could protect their interests 
in cases involving the local or regional government. 
Moreover, managers said that local and regional gov- 
ernments were much less likely to abide by judicial 
decisions than were private businesses. 

Managers gave relatively high marks to state arbitra- 
tion courts in comparison to other state institutions. 
When asked to rate the performance of a variety of 
state institutions on a scale of one to five, state arbi- 
tration courts received a score of 3.1, a higher rating 
than most other bureaucracies, including the police, the 
courts of general jurisdiction, and the bailiffs, which 
received scores of 2.9, 2.7, and 2.7, respectively. 

The results were similar when managers were asked 
the extent to which they associated effectiveness, hon- 
esty/lack of corruption, professionalism, accessibility, 
and objectivity with the state arbitration courts. Using 
an unweighted additive index that averages responses 
to these questions on a scale of one to five, managers 
rated the performance of the state arbitration courts 
3.2. Managers do not believe that state arbitration 
courts per se work badly. Instead, they view arbitration 
courts as somewhat more effective than other state 
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institutions, as modestly effective in conflicts with pri- 
vate actors, but relatively ineffective in cases involving 
the local or regional government. 


QUANTITATIVE ANALYSIS 


To assess the determinants of the security of property 
rights, I begin by estimating the following equation: 


Investment 
= Bi + foCredibleCommitment 

+ B3ArbitrationCourt 
+ BaPresidentMarket + Bs Trust 
+ BsLackofCredit + pı Corruption 
+ BgCompetition (1) 
+ Bo BusinessOrganization + Bi 9Police 
+ B11 Profit + &f12ManagerControls 
+ D613FirmControls 
+ UB14SectorControls + e. 


The dependent variable, Investment, is one of the 
three indicators of the security of property rights de- 
scribed above. It measures whether firm managers con- 
structed a new building, extended credit to a customer 
or supplier, or planned a new investment. Because 
the dependent variable is dichotomous, I use a probit 
model. I employ robust standards errors and clustering 
on each city to account for heteroskedasticity in the 
data. 


14 Ericson (1997), Hendley, Murrell, and Ryterman (2000, 88), and 
Pistor (1996) also provide fairly positive views of the performance 
of state arbitration courts. 
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The independent variables include CredibleCommit- 
ment, which equals one for managers who expected 
state arbitration courts to protect their property rights 
in a dispute with the regional or city government 
and zero otherwise. This question is particularly 
appropriate because it captures the possibility that a 
manager who has never had a conflict with the local or 
regional government may be deterred from investing 
due to the threat of state officials unconstrained by the 
judicial system. 

To assess arguments linking state institutions to the 
security of property rights, the model includes two 
variables. ArbitrationCourt and Police measure the re- 
spondent’s evaluation of the effectiveness of state ar- 
bitration courts and the police, respectively, on a scale 
of one to five. It is appropriate to include Arbitra- 
tionCourt because CredibleCommitment may be en- 
dogenous to the respondents’ perceptions of the courts 
in general.!® This variable controls for the possibil- 
ity that respondents’ perceptions of the ability of the 
courts to constrain state officials merely reflects their 
expectations about the overall performance of state 
arbitration courts. As a measure of bureaucratic effec- 
tiveness, I add Corruption, which measures the extent 
to which managers perceived corruption in the state 
bureaucracy to be a problem on a scale of one to five. 

To test the impact of societal factors on property 
rights, I add Trust, which is a four-point variable mea- 
suring the extent to which respondents believed that 
they could trust other businesspeople in their region. 
The survey asked, “In general can one trust other 
businesspeople in your region to fulfill their contrac- 
tual obligations in dealings with other businesspeo- 
ple?” About two-thirds (70%) of the managers an- 
swered yes or more or less yes. The survey also gen- 
erated a measure for civic participation using a dummy 
variable for membership in a business association as 
a proxy. 

I examined managers’ perceptions of the executive’s 
commitment to private property to evaluate the elite 
partisanship argument. The survey asked, “What type 
of economic system is most appropriate for Russia: 
One based on state ownership of property and con- 
trolled prices or one based on private property and free 
prices? Please place yourself on a five-point scale where 
one equals a command economy and controlled prices 
and five equals an economy based on private property 
and free prices.” On average, managers placed them- 
selves at 3.5 on the five-point scale. Interviewers then 
asked the managers how they expected President Putin 


15 The zero category combines respondents who answered “no” with 
those who answered “it 1s hard to say.” Dropping the “it is hard to say” 
responses from the analyses does not change the substantive results 
save for Model 2, in which CredibleCommument retains its sign but 
not its significance. Placing the “it is hard to say” responses in an 
intermediate category produces mmular results. Without clustering, 
the results are simular save for Model 2 in which the coefflaent on 
CredibleCommitment is innguificant. 

16 CredibleCommitment and ArburationCourt are correlated at 
.16 (p < .01), which suggests that viewed the effectiveness 
of courts as largely distinct from whether they could use the courts 
against state agents. All analyses use STATA 6.0. 
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would answer the question: Managers said that Putin 
was a cautious supporter of the market and placed 
him at 3.1. PresidentMarket captures respondents’ per- 
ceptions of President Putin’s commitment to a market 
economy. 

Variables related to the manager, the firm, and mar- 
ket conditions may influence decisions to invest. I in- 
clude variables to account for the age and education 
of the manager. I add dummy variables for small firms 
(<150 employees) and medium-sized firms (>150 and 
<500 employees). I include a dummy variable that 
equals one for private firms and zero for state-owned 
firms. Because profitable firms may invest at higher 
tates, I include a dummy variable for firms that claim 
to have made a profit in the year 2000.17 

Variables related to market conditions may influ- 
ence investment. I include Competition and LackOf- 
Credit, which measure the extent to which competition 
and a lack of credit were obstacles for their business 
on a scale of one to five. The sector and city in 
which a firm is located may shape investment pat- 
terns. In particular, firms in different sectors and cities 
may face different opportunities to invest and differ- 
ent levels of demand for their product. The model 
includes controls for 10 economic sectors; fuel, ma- 
chine tools, metals and chemicals, light industry, con- 
struction, transportation, communications, retail trade, 
and finance. The excluded category is food process- 
ing. The model also includes dummy variables for 
each city. Doing so guards against the possibility that 
hard to measure omitted variables specific to any lo- 
cation or sector drive the results. I present the models 
with dummy variables for sector and for sector and 
city. 
Model 1 in Table 2 examines the impact of these 
variables on the probability that a firm has constructed 
a new building in the last two years. The results reveal 
strong support for the credible commitment argument. 
As indicated by the positive and significant coefficient 
on CredibleCommitment, managers who expect the 
courts to protect their property rights in a dispute with 
the local or regional government were more likely to 
have invested in a new building than were other firms. 
Managers who had a favorable perception of the per- 
formance of the court system were also more likely to 
invest.!® Members of business organizations invested at 
higher rates, whereas private and small firms invested 
at lower rates than other firms. 


17 Tormmg a profit in the last year is an imperfect measure of the 
financial performance of a firm, but given the unwillingness of re- 
spondents to provide more detailed information on their financial 
conditions of their firm, it is perhaps the best that can be achieved. 
18 This result is sensitive to the coding of the perceived performance 
of the court system. Elsewhere m the survey managers were asked 
to rate the performance of a variety of poHtical institutions, inchud- 
ing arbitration courts, on a scale of one to five. When this measure 
of the performance of arbitration courts is added to Models 1-6, 
it 18 insignificant and does not affect the sign or sgnificance of 
other coefficients of interest. Using a variable that measures whether 
respondents expected courts to protect thei nghts m disputes with a 
private agent rather than ArbitrationCourt in Models 1—6 produces 
very similar results. 
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TABLE 2. The Determinants of Secure Property Rights 
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TABLE 3. Marginal Effects of Credible Commitment on Investment 


For Managers Who Expect That They... 


Cannot Take Can Take 
Government Qovemment 
to Court to Court 


Probability That Firms. .. 


Construct new building 35 45 
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Extend credit to suppliers 
or customers 

Plan new Investment 

Note. This table presents the 


59 
46 55 


probabilities that a manager wil undertake various 


types of mvestment Probabilities are calculated based on results from Model 1 in Table 2 


when continuous 
vanables are hetd at their modal value. 


Other variables provided little leverage.!? Proxies 
for social trust, corruption among state bureaucrats, 
competition from other firms, and a lack of credit had 
little impact on investment patterns in this specification 
of the model. 

Model 2 in Table 2 explores the probability that a 
firm extended customer or supplier credit in the last 
two years. As before, the coefficient on CredibleCom- 
mitment is significant and positive, indicating that man- 
agers who believed that courts could constrain state 
officials were more likely to extend credit to suppliers 
and buyers than were other managers. This finding sug- 
gests that commitment concerns influence firm behav- 
ior even in matters not directly involving state agents. 
Members of business organizations extended credit at 
higher rates than did nonmembers. In this model, com- 
petition, firm size, and social trust promoted the exten- 
sion of credit to other firms, whereas other variables 
did not have a significant impact on credit patterns. 

Model 3 assesses the probability that firms are plan- 
ning to make a significant new capital investment in the 
coming year.” Again, the variables CredibleCommit- 
ment and AnibtrationCourt have a significant impact on 
the probability of new investment. 

In addition, the stronger were managers’ perceptions 
that President Putin was committed to a market econ- 
omy, the more likely the firm planned to make a capital 
investment.” This is the cleanest test of the elite par- 
tisanship argument. In previous models, interviewers 
asked managers whether they had invested in the last 
two years, a period in which both Boris Yeltsin and 
Vladimir Putin served as president. This model, how- 
ever, explores how current perceptions of President 
Putin’s commitment to private property influenced fu- 
ture levels of investment. This suggests that perceptions 


3 Controlling for other factors, firms in Voronezh invest at higher 
rates than firms in Nizhnif Novgorod, Moscow, Ekaterinburg, 
Novgorod, Smolensk, and Tula. 
2 By examining plans for future mvestments, I hope to reduce the 
possibility for reverse causality (Besley 1995). 

Evaluations of the commitment of the respondent’s governor or 
mayor to a market economy based on private property had little 
effect on the security of property nghts. 





variables are held at their mean and dummy mdependent 


of executive partisanship are important for the security 
of property rights. 

Profitable firms were also more likely to be planning 
to make a new investment than were other firms. Mem- 
bers of business organizations also planned to invest at 
higher rates than did nonmembers. Other variables had 
little impact. 

As indicated in Table 3, the substantive impact 
of CredibleCommitment is large. Predicted probabil- 
ities for each model with continuous variables set at 
their means suggests that for a profitable midsized 
private firm that is a member of a business organi- 
zation, having confidence that the courts can protect 
your rights in a dispute with the local government 
raises the probability of investing in a new building 
from .35 to .45; extending credit, from .59 to .64; and 
planning a new capital investment, from .46 to .55. 
In sum, the perceived capacity of courts to discipline 
state agents was consistently associated with stronger 
property rights. 


INITIAL ROBUSTNESS CHECKS 


To assess robustness, I recoded the dependent vari- 
able Investment as a dummy variable that equals one 
for any firm that built a new building, or extended 
credit, or planned a new investment, and zero oth- 
erwise. Sixty-four percent of firms made at least one 
of these three forms of investment. I also created an 
index of investment that ranges from zero to three, 
depending on the number of investment activities that 
each firm undertook. Thirty-six percent of firms en- 
gaged in none of these investment activities, 40% 
engaged in one, 17% engaged in two, and 7% en- 
gaged in three. Using the independent variables from 
Model 1, I estimated a probit model on the dummy 
variable measure of investment and an ordered pro- 
bit model on the index of investment. The results 
are essentially unchanged as indicated by the coeffi- 
cients on CredibleCommitment in Models 4 and 5 in 
Table 2. 

I split the sample by whether respondents had 
experienced a property rights violation. I also split 
the sample by whether respondents had experi- 
enced an economic conflict with the local or regional 
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government. I then reestimated Model 5 using each 
subsample. In all four analyses, the coefficient on 
the variable for CredibleCommitment remained signif- 
icant. Thus, even managers who have not experienced 
a violation of property rights by state officials are de- 
terred from investing if they believe that courts cannot 
compel state agents to comply with judicial decisions.” 

I explored the possibility that CredibleCommitment 
is a proxy for political power. J added a dummy variable 
that measured whether a manager believed that his firm 
could influence legislation on matters of importance to 
the firm. I also interacted CredibleCommitment with 
other potential proxies for political power, such as size 
and profitability. Adding these variables did not change 
the substantive results. 

Finally, there may be some concern that the vari- 
ables for corruption and the effectiveness of arbi- 
tration courts may be endogenous to the perceived 
ability to take the government to court. I reestimated 
the ordered probit model without Corruption and 
without ArbitrationCourt, respectively, and the results 
were largely unchanged. In both cases the coefficient 
on CredibleCommitment retains its sign and signifi- 
cance. 

I also conducted the analysis including only the vari- 
able CredibleCommitment and the dummy variables 
for city and sector. The coefficients reported in Model 
6 suggest that doing so produces little change in the 
results.” These findings are robust to specifications that 
include both sectoral and city dummy variables, 
that present different specifications of the model, 
and that use different codings of the dependent vari- 
able. 


AN ADDITIONAL ROBUSTNESS CHECK: 
SURVEY DATA FROM 1998 


These results appear robust but may depend on the 
survey instrument. They also may be a reflection of the 
Putin era. To assess these possibilities, I analyze data 
from a 1998 survey of small business managers in three 
cities in Russia: Ulyanovsk, Smolensk, and Moscow. 
This survey asked similar questions, but focused only 
on small businesses. Researchers interviewed 190 man- 
agers of retail firms, such as groceries, pharmacies, 
and auto parts stores that had between four and 50 
employees.” 


Z Unless noted in the text, these results are not reported, to save 
spaco, but are available from the author 

Alternatrve models that include measures of the perceived per- 
formance of the regional bureaucracy, the bailiffs, and the procuracy 
on a one-to-five scale do not affect the substantive results on the 
variables of interest but, in some cases, reduce the sre of the sample. 
24 MASMI, a decade-old Moscow-based polling agency, conducted 
the survey. Response rates ranged from 55% to 75% depending on 
the city Each firm had a physical storefront. Shops were chosen 
at random from business directories. Shops operating ın the infor- 
mal economy are likely to be registered and to advertise in such 
directories (Yakovlev 2001, 37—39). The survey was conducted face- 
to-face m Russian in the fall of 1998 after a sharp financial crisis. 
Most questions ask about activities over the last two years and focus 
on behavioral measures that may be less strongly colored by recent 
events. 
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Interviewers asked small business managers whether 
they expected the courts to protect their property rights 
if they were involved in different types of disputes. 
Firm managers again exhibited greater confidence that 
the courts could protect their property rights vis-a-vis 
other private actors than vis-a-vis local government 
agents. Fifty-five percent of managers expected the 
courts to protect their property rights in a dispute 
with a business partner, but only 35% of managers 
expected the state arbitration courts to protect their 
property rights in a dispute with the local govern- 
ment. CredibleCommitment equals one for managers 
who expect the courts to protect their property rights 
in a dispute with the local government and zero 
otherwise. 

The measure for investment in this analysis is 
whether firm managers have renovated their place of 
business. The term in Russian is kapitalniii remont and 
indicates a significant investment, such as replacing 
pipes, resurfacing the floor, or installing new equip- 
ment. Sixty-three percent of managers conducted a 
renovation. Renovation equals one for firms that reno- 
vated and zero otherwise. 

As above, the statistical analysis introduces controls 
for the age of the manager, the perceived effectiveness 
of the courts on a scale of one to five; the sector in which 
the firm is located (food sector = 1; others = 0); the 
size of the firm as measured by the number of employ- 
ees; and the extent to which corruption, competition, 
and the performance of the police were perceived to 
be obstacles to their business on a scale of one to 10. 
The instrument in this small business survey was less 
extensive than in the survey described above. As a re- 
sult, the specifications of the statistical models differ 
somewhat. 

Evidence from Model 7 in Table 4 is consistent with 
preceding analyses. The coefficient on CredibleCom- 
mitment is positive and significant. Moreover, its sub- 
stantive impact is large. A simulation using coefficients 
from Model 7 that takes continuous variables at their 
means and dummy variables at their modal values 
suggests that the probability of renovation for a man- 
ager who does not expect the courts to protect his 
property rights vis-à-vis local officials is .46. The cor- 
responding figure for a manager who expects the 
courts to protect their rights vis-d-vis local officials 
is .55. 

The coefficient on a variable that captures the 
respondent’s assessments of the performance of 
courts more generally, ArbitrationCourt, is statistically 
insignificant.Thus, the direct impact of the perfor- 
mance of courts on the security of property rights is 
sensitive to differences in samples and types of invest- 
ment. 

Retail firms with more employees were more likely 
to renovate than were other firms. In contrast to the 
preceding analysis, managers who perceived corrup- 
tion to be a greater problem were significantly less 
likely to renovate their business than were other man- 
agers. This difference may be due to the inclusion 
of only small businesses in this analysis. Corruption 
is generally thought to be more problematic for small 
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TABLE 4. The Security of Property 
Rights: Evidence from a Survey of 
Small Businesses 


Model 7 

55" 
(27) 
Arbitration Court .03 
(-20) 

—.10" 
(.04) 
03 


Credible Commitment 


Corruption (as obstacle) 
Competition (as obstacle) 
Age 

Number of Employees 
Type (food = 1) 

Private 


(.04) 


Polica(as obstacle) 


In parentheses. Raw data are taken from a survey of 
190 small business owners m three Russian cites 
In 1998. Dependent variable is Renovation, a dichoto- 
mous vanabie that equals one for firms that renovated 
thelr place of business and zero otherwise. 





businesses in Russia (Frye 2002: Hellman, Jones, and 
Kaufmann 2000). Firms that faced greater competition 
or had little confidence in the police were no more 
likely to renovate than were other managers. 

These results indicate that rightholders’ concerns 
about their capacity to take the government to court 
hindered secure property rights during both the Yeltsin 
and the Putin administrations, which suggests that the 
problem of credible commitment has deep structural 
roots in this case. 


CONCLUSION 


At the time the 2000 survey was conducted, then 
US. Treasury Secretary, Lawrence Summers noted, 
“No challenge in Russia today seems to me greater 
than the establishment of property rights and contract 
enforcement” (Reuters, October 7, 2000). But Russia is 
not unique. A prominent review recently noted, “Vir- 
tually no transition country succeeded in rapidly de- 
veloping a legal system and institutions that would be 
highly conducive to the preservation of private prop- 
erty and the functioning of a market economy... . This 
lack of a market-oriented legal structure appears to 
have been the Achilles’ heel of the first dozen years 
of transition” (Svejnar 2002, 7). Property rights are 
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a critical issue in the postcommunist world and be- 
yond. Indeed, when discussing economic development 
more generally, North (1993, 14) noted, “Commitment 
is not the whole solution to the problems we con- 
front. But throughout history (and in the present ail- 
ing economies) it is overwhelmingly the most pressing 
issue.” 

Using evidence from original surveys of business 
elites conducted in 1998 and 2000, this essay identifies 
several factors that influence the security of property 
rights. First, managers who expected courts to protect 
their interests in disputes with state agents invested at 
higher rates.” This result holds controlling for other 
factors, including the perceived effectiveness of courts 
in general. Evidence linking the effectiveness of state 
institutions to the security of property rights was mixed. 
Managers who perceived the courts to be ineffective 
invested at lower rates, but this result was sensitive 
to differences in question wording and samples. Per- 
ceptions of the performance of the police and corrup- 
tion in the bureaucracy were not associated with the 
security of property rights. Stronger and more consis- 
tent evidence pointed to the failure of courts to con- 
strain state agents as an impediment to secure property 
rights. 

Second, civic participation was consistently associ- 
ated with secure property rights. Members of business 
associations were more likely to have constructed new 
buildings, extended credit to buyers or suppliers, and 
planned new capital investments in the coming year. In 
contrast, managers who exhibited high levels of trust 
were more likely to extend credit but were no more 
likely to make other investments. 

Finally, there was some evidence that confi- 
dence in President Putin’s commitment to a mar- 
ket economy was associated with stronger property 
rights) That managers conditioned their investment 
on perceptions of President Putin underscores the 
importance of executive partisanship for property 
rights. 


The Role of the State In Transition and 
Developing Countries 


These findings have broader implications for social 
science and public policy. They contribute to our un- 
derstanding of the role of the state in transition and 
developing economies by emphasizing the value of con- 
straints on state power. Many observers identify private 
predators as the main challenge to secure property 
in the region and argue that state bureaucracies are 
too poorly equipped to meet this threat. To increase 
state capacity it is necessary to retrain bureaucrats and 


Z Premdent Putin has made improving the performance of the 
Russian state, mcluding judicial institutions, a priority, but his other 
policies may offset these reforms. Putin has concentrated power in 
executive branches at the federal and regional level, but this research 
suggests that such efforts will produce modest results absent strong 
courts to constrain executive power. 

26 Trust and membership in business organizations are correlated 
only at .02. 
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reequip state police. This argument resonates with the 
literature on governance in political science and eco- 
nomics. 

However, managers did not view state agents so 
much as ineffective, but as not credibly bound by legal 
institutions. State agents posed a threat to property in 
part because they stood above the law, and this lack of 
legal constraint undermined the security of property 
rights. The ability of courts to tie the hands of state 
agents had a strong impact on the security of prop- 
erty rights, even controlling for the perceived effec- 
tiveness of various state institutions. Thus, increasing 
constraints on state agents may bolster the capacity of 
the state to protect property. 

The results contribute to one aspect of an ongo- 
ing debate in economics on the proper reform strat- 
egy for Russia. Although Shleifer and Vishny (1998) 
and Stiglitz (1999) recognize the importance of judicial 
institutions for property rights, the former are more 
skeptical of granting decisionmaking authority to state 
bureaucracies during the transition. Whereas Shleifer 
and Vishny recommend limiting the discretionary au- 
thority of regulatory bureaucracies, Stiglitz draws on 
the China model and suggests giving state regulatory 
bureaucracies far greater sway. The evidence presented 
here is generally more consistent with the Shleifer and 
Vishny view. On balance, firm managers valued bu- 
reaucratic commitment over discretion when making 
investment decisions. 

These results also suggest the value of a more nu- 
anced treatment of the elusive concept of state capacity. 
Because effectiveness varied widely across bureaucra- 
cies it is useful to disaggregate the state. Moreover, 
because the effectiveness of courts varied depending 
on the parties to the dispute, it is helpful to iden- 
tify the policy dimensions along which state capacity 
is measured. Finally, analyses of state capacity typi- 
cally consider what the state can do. Can it extract 
resources from society? Can it create meritocratic bu- 
reaucracies? Can it overcome societal resistance to its 
policies? These are useful components of state capac- 
ity, but this analysis suggests that it is also important 
to consider how constraints on state power influence 
state capacity. Because providing the public good of 
secure property rights is a key aspect of state capac- 
ity, scholars should devote greater attention to this 
issue. 


Informal Institutions 


Debates about the impact of informal institutions on 
property rights in transition and developing economies 
emphasize two different mechanisms. Some point to 
the importance of broad social trust among members 
of a community, whereas other stress the value of civic 
participation as a means to protect property. This essay 
finds that civic participation is a powerful predictor of 
various types of investment, but general social trust is 
only associated with a propensity to give credit. This 
suggests that business associations can be an impor- 
tant means to protect property. More broadly, these 
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results reinforce the argument that constraints on state 
agents—here through the countervailing power of pri- 
vate business organizations—bolster the security of 


property rights.” 


Privatization and the Origins of Secure 
Property Rights 


These findings add a qualification to debates on the 
roots of secure property rights. Arguments taken from 
neoclassical economics emphasize that property rights 
emerge based on private demand for scarce goods. 
Demsetz’s (1967) influential interpretation of the emer- 
gence of private ownership of land among native Amer- 
ican hunters notes that only after the fur trade began 
and land became scarce did local populations have an 
incentive to create secure property rights. This insight 
is a step forward, but its impact is limited by omitting 
state agents (Riker and Sened 1991). 

Arguments drawn from political economy and the 
new institutional economics stress that secure prop- 
erty rights are the result of rulers seeking to maximize 
revenue or other political benefits (Levi 1988). Riker 
and Sened (1991) argue that property rights over air- 
port landing slots in the United States in the 1980s 
became secure when federal bureaucrats saw a benefit 
in enforcing these rights. Only when the revenue gains 
and political benefits from enforcing property rights 
exceeded the costs did state officials make property 
rights over landing slots at airports secure. This essay 
suggests that demand from market participants and 
supply from state agents is insufficient to create secure 
property rights. Where institutions or countervailing 
interest group power do not compel state officials to 
abide by legal norms, property rights remain insecure. 
Secure property rights are not solely granted from 
above by state actors or seized from below by private 
actors, but emerge through political struggle between 
state and private agents. 

These results also suggest that transferring state- 
owned assets into private hands will not necessar- 
ily lead to secure property rights. As others have 
noted, privatization may bolster property rights by 


277 Members of business associations may perceive their 

rights as more secure because the association increases ther lobbymg 
power or because they strengthen social networks. Initial evidence 
suggests the merit of both arguments I asked managers to iden- 
tify what types of benefits, if any, they recerved from a business 
organization. Sixteen percent of all firms identified “representation 
of business interests in the executive and legislative branch” as a 
benefit of membership. Twenty-three percent of all firms said that 
“expanding professional contacts” was a benefit of membership. 
Adding dummy vanables for the responses to these questions to 
Models 1—6 reveals that members of business organizations who 
said that the “representation of business interests” was an important 
benefit of membership were significantly more likely to extend credit 
and plan new investments than were other firms. A dummy variable 
for those who said that “expanding professional contacts” was a 
benefit of membership was significantly associated with extending 
credit and planning new capital investment 
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clarifying competing claims to assets, but support- 
ing institutions are needed to significantly reduce 
the costs of exercising property rights (Roland 2000; 
Stiglitz 1999). 


Legal Reform 


Finally, scholars and policymakers have devoted great 
attention to legal reform in recent years. Carothers 
(1998, 1) noted, “One cannot get through a foreign pol- 
icy debate without someone proposing the rule of law 
as a solution to the world’s troubles.” Extensive surveys 
of business elites on governance issues by the World 
Bank and the EBRD in recent years further attest to 
interest in the issue. Despite great interest in the topic, 
there is little consensus. One partial explanation for 
the slow progress is that few studies have considered 
that the quality of legal institutions may depend sig- 
nificantly on the parties to the dispute. That company 
Managers in Russia have far greater confidence in 
the courts in disputes with private than with state 
agents indicates that such distinctions are essential to 
understanding the legal environment in Russia and 
perhaps elsewhere. Future analyses of legal institutions 
should distinguish between the performance of courts 
in disputes with state and that in disputes with private 
agents. 

This distinction suggests several guidelines for pol- 
icy reform. Most importantly, it emphasizes the value 
of building courts that are less vulnerable to pres- 
sure from state bureaucrats and elected officials. One 
recommendation is to create court jurisdictions that 
do not coincide with electoral jurisdictions. A judge 
whose jurisdiction falls exclusively under one gov- 
ernor (as in Russia) will likely face greater pres- 
sure than one whose jurisdiction falls under sev- 
eral governors. More subtly, this distinction suggests 
that increasing bureaucratic capacity without increas- 
ing judicial constraints on state agents will lead to 
less secure property rights. At a minimum, reform 
of the judiciary should proceed hand in hand with 
the efforts to build the capacity of state police and 
bureaucracies. 

This essay identified a number of social, economic, 
and political factors that influence the security of 
property rights—a key feature of economic perfor- 
mance and state capacity. Future research should 
seek to identify the conditions under which state 
agents make credible commitments to secure property 


rights.* 


% Case studies suggest that a peaceful international environment 
May promote more secure property rights North and Weingast 
(1989) argue that Britsh ehtes frequently sexzed property arbitrarily 
to raise short-term funds to compete with France. Root and Campos 
(1994) argue that organized labor, capital, and state ehtes m East 
Ana created credible bargaining arrangements while facing 
only a moderate threat of the spread of communism. Thus, efforts 
to promote secure property rights in Russia should be coupled with 
policies to foster a more secure international environment. 
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APPENDIX 1 





TABLE A1. Five Hundred-Firm Survey 




















Mean SD Min. Max. 

Supplier or Customer 40 49 0 1 
Credit in Last 2 Years 

New Bullding in Last 21 AI 0 1 
2 Years 

Plan New Investment In 34 47 0 1 
Coming Year 

CredibleCommitment 39 49 0 1 

ArbitrationCourt 3.18 1.20 1 5 

PresidentMarket 3.29 .94 1 5 

Lack of Credit 2.75 1.63 1 5 
(as obstacte) 

Corruption 2.43 1.49 1 5 
(as obstacłe) 

Competition 2.75 1.36 1 5 
(as obstacle) 

Trust 3.04 60 1 4 

Small 49 50 0 1 

Medium 22 41 0 1 

Age 45.82 10.006 23 82 

Education (1 = middle 2.81 54 1 5 
school, 5 = Ph.D.) 

Profit 65 47 0 1 

Private 80 30 0 1 

Business Organization .32 47 0 1 

Police 3.31 11 1 5 





Note. Survey of 500 firms ranging in size from four to 53,000 
employees conducted In November and December 2000 In etght 
cites In Russia. 
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TABLE A2. Survey of Small Firms 
Mean SD 3 


Renovation 
Credible 


3.40 10 
Note. Survey of 190 small businesses of between five and 
50 employees In three cites in Russia conducted m November 
1998. 
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Regulatory Regimes, Agency Actions, and the Conditional Nature 


of Congressional Influence 


CHARLES R. SHIPAN University of Iowa 


States. To examine the extent to which these agency actions are responsive to the preferences of 


Pra bureaucracies make the overwhelming majority of public policy decisions in the United 


elected officials, in particular, Congress, I develop a spatial model of oversight. The most important 
insight of this theory is that agencies make policy decisions within given regimes and may be constrained 
by the preferences of different political actors at different times. To test the theory, I collect and analyze 
data on the monitoring activities of the Food and Drug Administration (FDA). I find that under certain 
conditions, the FDA is responsive to the preferences of committees and floors in Congress, but under 


other conditions the agency can act autonomously. 


embers of Congress delegate broad powers to 

agencies for a variety of reasons. Because of 

competing demands for their attention they 
may, for example, lack the time necessary to make 
policy themselves and prefer instead to delegate this 
responsibility to an agency that is dedicated specifi- 
cally to the policy area in question. Relatedly, they may 
realize that they lack the expertise they would need in 
order to make informed decisions and take appropriate 
actions. Even those members of Congress who special- 
ize in a policy, such as the members of congressional 
committees with jurisdiction over that policy, tend not 
to have the level of expertise found in agencies. These 
reasons, along with a host of others, provide members 
of Congress with the incentive to delegate to agencies, 
as well as the justification for doing so. 

Because these justifications are widely accepted, del- 
egation is usually seen as a necessary part of poli- 
cymaking in a complex society. But delegation can 
also be seen as being at odds with a basic tenet of 
democracy, which holds that there should be a link 
between citizens and public policy. Elections provide 
the link between citizens and legislators; but is there 
a link between elected officials and bureaucrats? To 
answer this question, we need to know whether the 
bureaucrats to whom policymaking power has been 
given are responsive to Congress, the institution that 
delegated power to them in the first place. Do agency 
actions reflect congressional preferences? More specif- 
ically, are current agency actions responsive to the 
preferences of the current Congress? Congress can, of 
course, attempt to structure an agency in specific ways, 
hoping to increase the likelihood of certain kinds of 
outcomes. But does Congress exert contemporaneous 
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influence, so that agency actions are influenced by the 
current group of elected representatives and senator? 
These are basic and essential normative concerns in 
any democratic political system in which legislatures 
delegate responsibilities to agencies. 

Furthermore, we need to understand the mecha- 
nisms by which legislatures can delegate yet still main- 
tain influence policy outcomes and actions. If agencies 
are responsive to congressional preferences, then we 
need to know why they are responsive. What pow- 
ers allow Congress to influence agencies? And how 
does the internal institutional structure of Congress 
affect its ability to influence agency actions? Is it 
possible, or even likely, that agencies sometimes are 
constrained by Congress but other times can act au- 
tonomously? Answers to these questions will help us 
to achieve a better understanding of agency policy- 
making and the relationship between Congress and 
agencies. 

This paper attempts to address these crucial ques- 
tions about the link between elected officials and bu- 
reaucrats. The most general objective of this paper is 
to investigate whether the actions of bureaucrats are 
responsive to the preferences, and changes in the pref- 
erences, of political actors. To address this aspect of 
political influence, the analysis focuses on a specific 
power that Congress has over agencies—the threat that 
it will pass legislation inimical to the agency—and ex- 
amines how this power causes agencies to adjust their 
actions. The analysis also highlights the role that con- 
gressional committees play in this process. More specif- 
ically, I develop a spatial model of committees, floors, 
and agencies that demonstrates that agency policymak- 
ing takes places within specific regulatory regimes and 
that in some of these regimes we should expect to find 
congressional influence and in others we should not. 
I then use this model to structure statistical tests of 
the influence of political actors on the monitoring ac- 
tions of the Food and Drug Administration (FDA) over 
time. 


STUDIES OF POLITICAL INFLUENCE 


The extent to which bureaucratic decisions are affected 
by the preferences of Congress has been a topic of 
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interest among political scientists for decades (see, e.g., 
Cushman 1941 and Herring 1936).! Many of the re- 
cent studies that have analyzed this topic have argued 
that Congress is inattentive to agencies, thereby giving 
agencies a wide berth to do whatever they want. To 
support their arguments, these studies point out that 
although Congress has tools that it can use to over- 
see agencies, such as holding hearings, requesting re- 
ports, and passing laws, it rarely uses these tools (e.g., 
Dodd and Schott 1986, Ogul 1976, Scher 1963; but see 
Aberbach 1990, 2002). 

Beginning with the work of Weingast and Moran 
(1983), scholars began to take a different tack in at- 
tempting to learn whether Congress exercises contem- 
poraneous influence over regulatory agencies.” That is, 
scholars attempted to discern whether agencies take 
into account the preferences and powers of the cur- 
rent members of Congress. To examine whether such 
influence exists, these studies generally would take 
some feature of administrative action, such as the 
types of cases that the agency hears or the level of 
agency activism, and regress it on a set of political 
variables. The guiding idea behind this approach is 
that members of agencies are attuned to the prefer- 
ences of elected politicians, and when the preferences 
of those elected politicians change, so will the agency’s 
activities. 

These studies, and those that followed in their 
footsteps, have used an array of innovative statis- 
tical techniques to examine contemporaneous influ- 
ence. Many found clear evidence of congressional in- 
fluence over agencies (e.g, Moe 1985a, Scholz 1991, 
Weingast and Moran 1983, and Wood and Anderson 
1993). Others, however, found that Congress either 
exerted no influence on the activities of agencies or 
provided only limited support for the presence of 
such influence (e.g., Eisner and Meier 1990, Moe 
1987, Wood 1988). Even with respect to the FDA, 
results have been mixed. Carpenter (1996) concludes 
that although presidential and congressional budgets 
can send signals to the agency, and the agency re- 
sponds to those signals, the ideological preferences 
of the oversight committees have no effect on the 
agency’s monitoring actions. Similar, he finds that the 
congressional variables do not influence the speed 
of approval of “new chemical entities” (Carpenter 
2002). On the other hand, Olson shows that Congress 
and its committees affect FDA approval rates for 
drugs and medical devices (1995) and recalls (1996b) 
and provides some evidence that oversight comm- 
ittees influence the level of inspections (1996b, 
1999). 


1 Because good critical summaries of studies in this area can be found 
elsewhere (e.g , Krause 1999 and Spence 1997), I do not attempt to 
provide a comprehensive overview here but instead highlight only a 
few key studies 

2 Around the same time, Moe (1982) prompted a smular bine of 
research on presidential influence. In this brief synopsis, I focus on 
studies of congressional mfluence, since that will be the primary focus 
of my theory Later in this study, however, I also address the issue of 
when and how the president can influence agencies 
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Thus, a host of studies have used increasingly sophis- 
ticated statistical methods to attempt to determine the 
extent of congressional influence on political agencies. 
These investigations have provided us with a better 
understanding of political influence on agencies as well 
as the level of activities undertaken by different agen- 
cies, What they have not done, however, is settled the 
question of whether Congress exerts contemporaneous 
influence on agencies—that is, whether agencies take 
into account the preferences of the current Congress 
when taking action. Some studies find that Congress 
does matter; others find that it does not. What is lacking 
is a More nuanced examination of the conditions under 
which Congress might matter. 


POLICYMAKING AND REGULATORY 
REGIMES 


At the same time that empirically oriented scholars 
were presenting detailed statistical analyses of con- 
gressional influence on agencies, other scholars were 
developing spatial models of agency policymaking (e.g., 
Ferejohn and Shipan 1990, Hammond and Knott 1996, 
Steunenberg 1992). These models show how agencies 
make decisions within a political context and demon- 
strate that internal institutions, such as the committee 
system in Congress, may have important political reper- 
cussions. Furthermore, they incorporate the knowledge 
that policymaking is sequential and that because the se- 
quence of policymaking is fixed, when the preferences 
of elected political actors change, an agency that is 
anticipating the reactions of these other actors faces 
a different political context. In effect, this context can 
be considered a regime. And under different regimes, 
agencies will take different actions, even if their own 
preferences remain the same. 

Within the study of American politics, the con- 
cept of a regime has been applied most commonly 
(and most fruitfully) to the study of regulation (e.g., 
Harris and Milkis 1989 and Rabin 1986). Such studies, 
however, largely have been historical and sweeping in 
their conception of a regime. In this analysis I define a 
regime simply and specifically as a given configuration 
of institutions and preferences.’ The concept takes on 
importance because, as I demonstrate in the spatial 
model presented below, an agency will choose a policy 
within a certain regime or within a given configura- 
tion of institutions and preferences. And when sta- 
tistically assessing political influence on agencies, one 
must take into account the possibility that at different 
times an agency will be making decisions in different 
regimes. 


3 The analyses most similar to this paper m their use of regimes 
are Nokken and Sala’s (2000) and Snyder and Weingast’s (2000) 
investigations of presidential appointments to agencies and Moraski 
and Shipan’s (1999) study of the President’s choice of Supreme Court 
nominees. See also Morris and Munger (1998) for a related theoret- 
ical exploration of the conditions under which agencies can exercise 
autonomy. 
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A Spatial Model of Regulatory Regimes 
and Congressional Influence 


A spatial model can demonstrate how agency poli- 
cymaking can occur in different regimes. The model, 
which is adapted from Ferejohn and Shipan (1990), 
incorporates the following key assumptions First, a 
legislature has delegated responsibility over some 
policy to an agency, A. I initially assume, for the sake of 
exposition, that this legislature is unicameral, but later 
add a second chamber. Second, the floor of this legis- 
lature, F, has delegated primary responsibility within 
the legislature to a committee, C. Later I also discuss 
how the president can enter into the model. Third, the 
policy and the actors’ preferences can be aligned along 
a single ideological dimension. 

Fourth, I assume that the committee has gatekeeping 
powers. This is a somewhat controversial assumption 
(see, e.g., Krehbiel 1998). Yet adopting such an as- 
sumption is common in spatial models (e.g., Denzau 
and Mackay 1981 and Shepsle and Weingast 1987) and 
bas a theoretical as well as an empirical basis (Epstein 
1997). And more importantly, it comports with the em- 
pirical regularity of how Congress functions. Numerous 
studies have shown that the vast majority of bills die 
after being referred to committees (e.g., Deering and 
Smith 1997) and others have shown gatekeeping to be 
a fairly systematic empirical phenomenon (Adler and 
Still 2001). 

Fifth, action proceeds under an open rule. In reality, 
of course, rules may be closed or restricted, thereby 
protecting the committee’s proposal from amendment. 
Open rules are more common, however; and more 
importantly, when the committee decides whether to 
gatekeep or report a bill, it cannot know whether it 
will receive an open or closed rule. Thus, the committee 
needs to proceed as if it will receive an open rule.* 

Given these players (A, C, F) and the assumptions 
outlined above, the game proceeds in the following 
stages: 


Stage 1: The agency makes a proposal, a. 

Stage 2: In response to the agency’s proposal, the 
committee chooses between introducing a bill, b, 
and gatekeeping. If the committee gatekeeps, then 
the game ends and a is the outcome. 

Stage 3: If the committee introduces legislation, the 
floor can either vote to reject the bill (producing 
an outcome of a), pass the bill as written (b), or 
pass the bill as amended (b*). 


Given the sequential nature of the game, the equilib- 
rium concept used to solve it is subgame perfection. 
Thus, when the agency makes its proposal, it will con- 
sider how the committee will react, and the committee 
will in turn condition its action on the expectation of 
the floor’s action. 

The game can be illustrated by considering the three 
regimes that correspond to the different relative loca- 
tions of A, C, and F. Figure 1 presents an array of pref- 


4 Dion and Huber (1996) and Huxtable (1994) develop spatial mod- 
els in which the choice of the rule us endogenous 


FIGURE 1. The Committee-Floor Regime In a 


Unicameral Model of Agency Pollcymaking 





erences in which A <C <F. The figure also includes 
an additional point, labeled C(F), which is the indif- 
ference point for the committee with respect to F and 
which is located at C(F) = 2C — F In this first example, 
A is located to the left of C(F). The situation shown 
in Figure 1, in which A <C(F) <C <F is called the 
Committee-Floor Regime. 

The committee’s indifference point is of particular 
importance to the agency. If the agency selects a policy, 
a, that is equal to A, its most preferred policy, the com- 
mittee will open the legislative gates and propose legis- 
lation." Then, under an open rule, the floor will amend 
the policy and F will be the outcome (i.e., b* =F). If, on 
the other hand, the agency thinks carefully and strate- 
gically about the situation, it will choose a = C(F). The 
agency realizes that if it selects this point, the commit- 
tee will be indifferent between inaction, which would 
result in C(F) becoming policy, and opening the gates 
and introducing legislation, which, under an open rule, 
would result in a policy of F. Because the commit- 
tee does not prefer F to C(F), it will keep the gates 
closed and allow the agency’s policy choice, a= C(F), 
to stand.® 

What do we learn from this example? First, the policy 
outcome—the subgame perfect equilibrium—is gener- 
ated by the combination of the actors’ preferences and 
the sequence of the game. The legislature’s ability to 
respond to the decision of the agency causes the agency 
to choose a policy away from its most preferred point 
in order to avoid being saddled with a policy that is 
even less desirable. The agency’s choice, in turn, keeps 
the committee from opening the gates and reporting 
legislation that would result in an outcome far from 
the agency’s ideal point. Significantly, the legislature 
does not have to pass new legislation in order to influ- 
ence the outcome. Instead, it is the potential for a new 
law that causes the agency to choose a different policy. 
The legislature might use traditional means of over- 
sight, such as holding hearings or conducting investiga- 
tions, to signal its seriousness and its preferences (e.g., 
Ferejohn and Shipan 1989), but the power underlying 
these activities is the threat of legislation. 


5 The term policy can be used to refer either to a specific policy 
(e.g., the FDA’s decision to require more informative labels on food 
containers) or to a more general level of policy output (e.g , the level 
of FDA activism). In the empirical section of this paper, since I am 
interested in measuring change over time and making generahrvations 
about political influence on agencies, I focus on the latter. 

é Technically, the agency would locate at C(F) +s, which the com- 
mittee prefers to F Alternatively, I assume that if the committee is 
indifferent between two actions, it chooses the least costly action. In 
this case, that would mean keeping the gates closed. 
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FIGURE 2. Regulatory Regimes (by Agency 
Location) 
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Second, as the preferences of the legislature change, 
so will (at least in this regime) the agency’s outputs. 
The model shows that as the committee moves to the 
right, so will the agency’s output. Counterintuitively, it 
shows that as the floor moves to the right, the agency 
can locate farther to the left! Another way to view 
these findings is through the lens of regression analysis. 
If we were to regress some measure of agency output 
on the preferences of the committee and the floor in 
this regime (i.e., where A < C(F) <C<F), we would 
expect the coefficient for the committee to be positive 
but the coefficient for the floor to be negative. 

Consider now a different configuration of prefer- 
ences, one where A € [C(F), F], which is the Gatekeep- 
ing Regime. In this second regime the agency will be 
able to implement its most preferred policy, A, uncon- 
strained by the preferences of the legislature. It is able 
to do so because the committee prefers A, the agency’s 
choice, to F, which would be the policy outcome under 
an open rule. Under these conditions, then, the agency 
receives a considerable amount of autonomy from the 
legislature, and will be unaffected by changes in con- 
gressional preferences. 

Finally, consider the situation where C<F<A, 
which is the Floor Regime. Here, if the agency were to 
choose a policy equal to A, then the committee would 
introduce legislation to overturn the agency’s action. 
In fact, since the committee prefers F, the legislative 
outcome under an open rule, to any point to the right 
of F, the only way the agency can avoid legislation is to 
locate at F. Thus, in this regime we would expect the 
influence of the floor to be significant and positive— 
as the floor moves to the right (or left), so will the 
agency’s chair. However, we would not expect to find 
a significant relationship between the committee and 
the agency. As long as the agency and committee are 
located on opposite sides of the floor, a shift in the 
committee’s preferences will not result in a shift in the 
agency’s preferences. 

The most important result from this model is that 
whether an agency is affected by the legislature, and 
by a committee within the legislature, depends on the 
nature of the regime—that is, the configuration of these 
actors’ preferences. The committee and the floor will 
matter, but only under certain conditions. Figure 2 





1 More specifically, the agency’s output is postrvely affected (1¢., 
ucreased) by having the committee move to the nght (i e., away 
from the agency’s ideal pomt) or having the floor move to the left 
(Le, toward the agency’s ideal point). It ıs negatively affected by 
having the commuttee move to the left or the floor move to the nght. 
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sums up the discussion in the previous paragraphs. As 
the figure shows, there are three regimes, depending 
on the location of the agency relative to the commit- 
tee and the floor. The Committee-Floor Regime oc- 
curs when the agency is to the left of the committee’s 
indifference point (i.e., A < C(F) < C < F), the Gate- 
keeping Regime occurs when A is located between 
the committee’s indifference point and the floor (i.e., 
C(F) < A<F), and the Floor Regime occurs when A 
is on the opposite side of the floor from the committee 
(i.e, C<F <A). 

By now the relevance of this simple theoretical 
model for statistical estimations of political influence 
on agency policymaking should be clear. Most studies 
typically regress some measure of agency output on the 
preferences of legislative actors. Some of these models 
find that the preferences of legislative actors exert a 
significant effect on the agency; others do not. But what 
this model shows is that whether or not legislative in- 
stitutions should influence agency actions depends on 
the nature of the political regime. More specifically, 
the model shows that we should expect the committee 
to influence the agency under the Committee-Floor 
Regime. It also shows that the floor should influence 
the agency under both the Floor Regime and the 
Committee-Floor Regime, but that it should have op- 
posite effects under these two regimes. Thus, political 
influence is contingent. 

As the preceding discussion makes clear, in order 
to achieve meaningful results—or even to expect to 
achieve meaningful results—the nature of the regula- 
tory regime must be taken into account in any statis- 
tical analysis of political influence on agencies. Before 
proceeding to the data analysis, however, we first need 
to consider the effects of two other institutions: the 
president and bicameralism. 


The President 


The primary focus of the model, as mentioned earlier, 
is to pin down the conditions under which the legis- 
lature—in particular, the committee and the floor— 
influences the agency. Of course, another actor that can 
also play an important role in guiding agency decisions 
is the president. Thus, we need to consider how the 
president can influence agency actions. 

To incorporate the president into the analysis, I as- 
sume that the president is able to set the general lo- 
cation of the agency’s ideal point. The president in- 
fluences this ideal point prior to the three stages of 
the game that are spelled out earlier; in fact, we could 
consider the president’s choice of the agency’s ideal 
point as the first stage of the game. This is a strong 
assumption, one that essentially means that the presi- 
dent is in a privileged position, relative to other polit- 
ical actors, with respect to the agency. It is, however, 
an assumption that is based on convincing empirical 
and descriptive accounts of the powers of presidents— 
or, more generally, chief executives—over executive 
agencies (e.g., Howell 2003, Huber and Shipan 2002, 
Meier 1993, and Moe 1982, 1985b). In particular, the 
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president has the advantage of being able to act unilat- 
erally to influence the agency’s goals and preferences 
(e.g., Moe and Howell 1999, Moe and Wilson 1994).8 
Among other things, the president can, for example, 
reorganize agencies, alter jurisdictions, and, in many 
cases, make appointments to key positions, and can 
do all these things without needing the cooperation of 
Congress. 

While I assume that the president has considerable 
powers in setting the agency’s ideal point, I should be 
clear that I am not assuming that A = P, which would 
mean that the president is able to set the agency’s ideal 
point so that it is exactly equal to his own. Rather than 
making this unrealistically (and unnecessarily) strong 
assumption, I make the less stringent assumption that 
because of the president’s powers vis-a-vis the agency, 
the agency’s ideal point will be similar to—in other 
words, located in the same region as—that of the presi- 
dent. Thus, if the president is located to the left of C(F) 
in the model, the agency will also be located to the left 
of C(F). Similarly, if the president is located between 
C(F) and F, or to the right of F, that is where the agency 
will be.? 

Ideally, of course, we would have some independent 
measure of the agency’s location. Such a measure does 
not exist, however, and thus we need to rely on this 
assumption that, although giving a great deal of power 
to the president, is clearly defensible.!° In addition, 
this assumption has three significant implications for 
constructing a statistical model of political influence. 
First, in contrast to the other two regimes, the president 
is most likely to be able to exert influence on the ex- 
act location of the agency’s action in the Gatekeeping 
Regime, since in this regime the committee will not 
introduce legislation, giving the president his strongest 
ability to unilaterally influence the agency. Second, it 
has the practical effect of letting us examine whether 
there is any observable evidence of congressional in- 
fluence even if we give the president the significant 
advantage of being able to set the agency’s ideal point. 
And third, the main effect of assuming that the agency’s 
ideal point is located close to the president’s is to bias 
the coefficients for the congressional variables toward 
zero and to make it more difficult to find significant 
results for congressional variables in the Committee- 
Floor and Floor regimes.” 


8 This is especially true in the case of an agency like the FDA, which 
is located within the executive branch. 

9 Note that in the ftrst and third regimes, ıt does not matter specifi- 
cally where the agency is located; thus, the president 1s less concerned 
about the exact location of A ın these regimes. 

10 Later I explore some alternatives, such as allowing the agency’s 
position to be fixed and determining the agency's position by looking 
at the ideology of the president who was m office when the current 
agency head was appointed. To foreshadow these results, the data 
best support the use of the current president as a surrogate for the 
agency. See footnote 26 and the discussion of Table 2. 

11 For example, if we misclassify some Gatekeeping Regime obser- 
vations as belonging to the Floor Regime, then we make it more dif- 
ficult to find statistical significance for the floor in the Floor Regime. 
Similarty, if we misclassify Gatekeeping Regime observations as be- 
longing to the Commuttee-Floor Regime, then we make it less Hkely 
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While I assume that the president sets the location 
of the agency, I do not include the president as a veto 
player in the model. To begin with, Congress frequently 
can pass legislation even when the president opposes it, 
often by bundling together different policies. Thus, by 
using such a logrolling approach, Congress is able to 
pass policies that the president might not otherwise 
like, and the threat of legislation remains credible. 
Furthermore, in the time period I examine, Congress 
passed dozens of laws having to do with the FDA, 
none of which were vetoed. It is possible, of course, 
that some bills were not vetoed because Congress an- 
ticipated a presidential veto and designed the bill to 
avoid a veto, but it is also likely that Congress passed 
several of these bills despite a lack of support from the 
president. And finally, if we were to include the presi- 
dent and a veto override player in the model (see, e.g., 
Ferejohn and Shipan 1990, Hammond and Knott 1996, 
Spitzer 1990), it complicates things to the point where 
the model is not testable given a limited number of 
observations." Overall, then, the model overstates the 
president’s influence by allowing him to set the agency’s 
ideal point but understates it by not allowing for a 
veto. 


Blcameralism 


In a bicameral legislature like the U.S. Congress, the 
House and the Senate and their respective committees 
will have an interest in the agency’s decisions. Such a 
complication obviously needs to be taken into account. 
At the same time, however, it causes difficulties for 
the analysis. Instead of having three distinct regimes— 
committee-floor influence, agency autonomy, and floor 
influence—we would have five. The agency might be 
influenced by the House committee and floor, the 
Senate committee and floor, the House floor alone, 
or the Senate floor alone, or it might operate en- 
tirely autonomously from both chambers of the legis- 
lature. 

However, since both houses would have to pass leg- 
islation in order for a law overriding the agency to 
be passed, the agency need only pay attention to one 
house. Once that house is satisfied by the agency’s ac- 
tion, the agency no longer needs to worry about over- 
ride legislation being passed and it can disregard the 
preferences of the other chamber. Thus, in principle the 
agency might worry about both floors and both com- 
mittees, but in reality it needs to worry about pleasing 
only one committee-floor pair (in the first regime) or 
one floor (in the third regime). 

An example will illustrate this point. In terms of 
the model presented above, if the House committee’s 
indifference point with respect to the House floor is 
labeled Cy(Fy) and the Senate committee’s indiffer- 
ence point with respect to the Senate floor is labeled 


that the committee and floor coefficients will turn out to be significant 
in the Committee-Floor Regime. 
12 In effect, ıt creates too many distinct regimes 
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TABLE 1. Conditions for Regime Classifications 


Conditions 
f A<Cy(Fy) < Fy and A < Cg(Fs) < Fe 


Reglme 
Committee-Floor 


Gatekeeping 
Floor if 


f Cu(Fu) < A < Fy or Cg(Fs) < A < Fg 
Fu) < Fp < A and Ce(Fg) < Fg <A 


Expected Effects of 
Congressional Varlables 
Committee (positive) 
Floor (negative) 

None 
Floor 


Note. Regimes are classified the same way for mirror Images (e.g , If Fy > P and Fg > P, then It is a Floor Regime) 


FIGURE 3. A Blcameral Model of Agency 
Pollcymaking 


ee a aoyo 


A Cu(Fu) Ce(Fs) CsCu Fs Fu 


Note: Subscripts H and S are used to denote the House and 
Senate, respectively. 


Cs(Fs), then when A < Cy (Fn) < Cs(Fs), as in Figure 3 
below, the agency needs to worry about satisfying 
only the House committee.” In this Committee-Floor 
Regime, we would expect the House committee and 
floor to influence the agency. If, however, the positions 
of the House and the Senate were reversed, the Senate 
committee and floor would be influential. Similarly, if 
either Cs(Fs) < A < Fs or Cy(Fu) < A < Fy, then the 
situation would be classified as a Gatekeeping Regime 
and the agency would be able to act independently 
of Congress. Finally, if Fy <A and Fs <A, then we 
are in the Floor Regime, where the agency would be in- 
fluenced by Fy if Fs < Fy < A and by Fs if Fy < Fs <A. 

Thus, even when there are two chambers, configu- 
rations of preferences can still be classified into the 
three regimes discussed earlier. Either there is one 
committee-floor pair that matters, there is one floor 
that matters, or the agency can act autonomously from 
the legislature. Collapsing the five potential regimes 
into three regimes has both benefits and costs. The 
benefit is that by focusing on only the committee and 
the floor that might matter, regardless of chamber, we 
are able to limit the statistical analysis to three regimes. 
This is important for two reasons: first, to preserve de- 
grees of freedom and, second, because it is possible 
that there might be very few observations in any given 
regime, a situation that would be exacerbated by using 
five regimes. In addition, such an approach still gives 
us leverage on whether congressional institutions influ- 
ence an agency. The cost, however, is that we lose the 
ability to determine whether this influence is coming 
from the House or the Senate. 


Testing the Model 


The model gives us guidance about the conditions un- 
der which committees and floors should matter, guid- 


13 As in the case of a unicameral legislature, Cy (Fy) =2Cy — Fu. 
Alternatively, it could be defined as 2Cy — (Fu + Fs)/2 (Le, with 
respect to the midpoint of the interval [Fp, Fs]). 
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ance that we can use to design a statistical test. Table 1 
spells out the conditions under which each regime oc- 
curs and offers predictions about which congressional 
variables should be significant, and in which direc- 
tion.!* Based on these results, the theoretical model 
calls for the following specification: 


Y, = fo + £1 DcrCommittee; + f2DcrFloor; 
+ bsDoPresident, + 24DrFloor: + AXi + &t, 


where Dcr, Dg, and Dr are dummy variables set equal 
to one for the Committee-Floor, Gatekeeping, and 
Floor regimes, respectively, and zero otherwise, and 
where X; represents a vector of control variables. Y; 
represents the level of agency outputs. As discussed in 
the next section, a greater value of Y, indicates a higher 
level of agency activity and, thus, a more liberal level 
of activity. The predictions for the model are: f; > 0, 
fo <0, 63 > 0, and fy > 0. 


ESTIMATING POLITICAL INFLUENCE 
ON THE FDA 


The Food and Drug Administration (FDA) plays a 
central role in setting public health policy in the United 
States. From drug approval to food safety to the regula- 
tion of medical devices, the agency has a hand in setting 
and enforcing a wide variety of policies. By the agency’s 
own estimate, 25 cents of every consumer dollar spent 
in the United States is spent on products regulated by 
the FDA. , 

While the FDA has a variety of responsibilities in the 
policy areas it regulates, among the most important are 
its monitoring activities. Most notably, the agency con- 
ducts on-site inspections and also takes and analyzes 
product samples. It is through these activities that the 
agency discovers whether regulated firms are comply- 
ing with directives set down by the agency, Congress, 
and the president. If the agency finds, through its in- 
spections and samples, that a firm is acting illegally, or 
is in some other way violating regulatory rules or laws, 
it has a variety of enforcement options available to it, 
ranging from urging the firm to recall its products to 
seizing the firm’s products. 

Monitoring activities serve as excellent proxies 
for the overall level of agency activity. First, they 


14 The model also suggests that specific variables should not matter 
in certain regimes. I return to this point when discussing the results. 
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represent the initial steps in a process that can ulti- 
mately lead to penalties, fines, and even criminal pros- 
ecution. Thus, they are viewed as extremely serious 
by the 95,000 firms that the agency regulates. Second, 
the agency engages in tens of thousands of these ac- 
tivities each year. An individual inspection or sample 
might not seem all that important (other than to the 
firm being investigated), but the combination of all 
of these activities over the course of the year gives a 
good sense of how actively the agency is regulating 
the firms within its jurisdiction. Thus, the measure taps 
into the overall level of agency activism, which matches 
up well with the single-dimensional approach of the 
model. Third, Congress itself views these activities as 
important and has held numerous hearings about the 
style and level of these activities (Carpenter 1996). 
More generally, the notion that these activities are the 
subject of much political interest has been supported 
by both qualitative (e.g., Magill 1995, Quirk 1980) 
and quantitative (e.g., Carpenter 1996, Olson 1996b) 
analyses. 

To analyze the influence of political factors on the 
FDA’s monitoring activities over time, I have collected 
data on the number of inspections and samples taken 
from 1947 to 1995.5 What factors might influence the 
quantity of monitoring activities in each year? First, 
there are the political variables discussed in the previ- 
ous section: the preferences of the president and the 
committees and floors in each chamber. For estimates 
of these preferences, I rely on Common Space Nomi- 
nate scores. 

In order to test the model accurately, it is espe- 
cially important to identify the primary committees 
in each chamber with jurisdiction over the FDA. I 
relied on two approaches to determine these commit- 
tees. First, I used the Public Law database compiled by 
Baumgartner and Jones as part of their Policy Agendas 
project (see, e.g., Baumgartner and Jones 1993).!7 Sec- 
ond, I used the same authors’ congressional hearings 
database to determine which committees held hearings 
on topics related to the FDA. These two sources of data 
produced findings that were consistent with each other. 
In the Senate, primary oversight responsibility was held 


15 These data were drawn from FDA Yearly Reports, FDA Quarterly 
Activity Reports, and “A Statistical History of the Food and Drug 
Admmistration” (an unpublished FDA manuscript). 

16 Common Space Nominate scores for presidents extend back to 
Eisenhower. However, because these scores remain constant over 
the course of a poHtician’s career, for Truman I used the score calcu- 
lated from his years in the Senate. In this paper I reversed the ngn of 
these scores, so that more conservative members received negative 
scores and more Hberal members receive positive scores. This means 
that, for example, all else equal, a higher score for the president 
should result in a greater number of mspections and samples. For 
recent discussions of Nommate scores, see Poole and Rosenthal 1997, 
2001. 

17 These data were originally collected by Frank R. Baumgartner and 
Bryan D. Jones, with the support of National Sclence Foundation 
Grant SBR 9320922, and were distnbuted through the Center for 
Amenican Politics and Pubhc Policy at the University of Washington. 
Neither NSF nor the collectors of the data bear any responsibility 
for the analysis reported here. 


by the Labor and Public Welfare Committee until 1976 
and by the Labor and Human Resources committee 
from 1977 to the present. And in the House, primary 
oversight responsibility was held by the Commerce 
Committees—first, Interstate and Foreign Commerce 
and, later, Energy and Commerce.}® 

Once I obtained the ideology scores for the presi- 
dent, the median member of each committee, and the 
median member of each floor in each year, I was able 
to classify each observation as belonging to a particu- 
lar regime. Overall, from 1947 to 1995, 17 years were 
classified as belonging to Committee-Floor Regime, 20 
were classified as belonging to Gatekeeping Regime, 
and the remaining 12 were classified as belonging to 
the Floor Regime (see Appendix). 

Of course, in addition to these political and institu- 
tional variables, a number of other political influences 
need to be controlled for. Olson (1995, 1996b, 1999), 
using an external signals model, identifies several fac- 
tors that influence the activities of the FDA. First, 
because monitoring activities are relatively high-cost 
activities, the agency’s budget should influence them. 
Second, she contends that industry size should be an 
important determinant of agency actions. Carpenter 
(1996) also argues that the budget and industry size 
need to be accounted for and that an additional fac- 
tor that should enter into the statistical specification is 
the 1962 passage of the Kefauver—Harris amendments, 
which increased the agency’s regulatory authority and 
led to an increased level of activism (see also Olson 
1996a).19 

I control for these variables as follows. First, I in- 
clude the agency’s budget in real dollars (using 1983 
as the baseline). Second, to control for the Kefauver— 
Harris amendments I create a dummy variable that 
was set equal to zero prior to 1962 and one from 
1962 on. Third, I measure industry employment by 
controlling for the number of workers in the food 
and health industries in each year.” The expectation 
for the budget and Kefauver—Harris variables is that 
each will have a positive coefficient. Because Olson 


18 All pubhc laws relating to the FDA that emerged from commit- 
tees came from either the Senate Labor or the House Commerce 
committees, In addition, these committees conducted more hearings 
on the FDA than any other committees. The policy codes from the 
Baumgartner and Jones data set that I examined were 306, 400, 401, 
and 403. I then deleted all laws and hearings that were primarily 
about agriculture, an area in which the FDA has very limited juns- 
diction. 

19 This 1s consistent with a growing literature that contends that 
legislatures and executives design bureaucracies strategically in or- 
der to influence future outcomes Early theories along these lines 
include those of McCubbmms, Noll, and Wemgast 1987 and Moe 1989; 
recent tests include those by Lewis 2002, Potoski 2002, Potoski and 
Woods 2001, and Wood and Bohte 2004. For a recent overview of this 
hterature, see Huber and Shipan 2000. My empirical results remain 
substantially the same regardless of whether or not I include this 
variable. 

%2 I obtained this information from the Bureau of Labor Statistics’ 
Web site (http://stats bls gov/). The data were part of the “Non- 
farm Payroll Statistics from the Current Employment Statistics 
(National).” 


473 


Regulatory Regimes, Agency Actions, and Congressional Influence 


(1999) finds that industry size has a negative influ- 
ence on the number of inspections, while Carpenter 
finds that it has a positive effect, I make no predic- 
tions about the sign of the variables measuring industry 
size. 

In addition to these variables, I include separate con- 
trol variables for inspections and samples to account 
for different reporting methods used by the FDA over 
time. Prior to 1975 the agency reported the count of 
all samples that it collected, whereas after 1975 the 
agency reported only the count of samples that were 
physically analyzed. Thus, in the analysis of samples I 
include a dummy variable that takes on the value of 
one beginning in 1975. Prior to 1955 the agency re- 
ported factory inspections, but since 1955 the agency 
has reported all established inspections. Thus, I include 
a dummy variable for inspections, set equal to zero 
from 1950 to 1954 and one from 1955 to the present. 
I expect the dummy variable for samples to have a 
negative and significant coefficient and the analogous 
dummy variable for inspections to have a positive and 
significant coefficient. 


RESULTS 


Autocorrelation is always a potential problem with 
time-series data. And indeed, when the various mod- 
els presented below are run using OLS, Durbin— 
Watson tests reveal a high level of autocorrelation. 
Thus, to estimate the statistical model I use the Prais— 
Winsten autoregressive technique, which is a GLS 
estimator appropriate for first-order serially corre- 
lated residuals and is especially useful with small 
samples.” 

Before testing the theory, it is instructive to examine 
the results that obtain from an atheoretical approach. 
In Table 2, I present two columns of results, one for 
samples and one for inspections. In each of these athe- 
oretical regressions I include the ideology for the floors 
and relevant oversight committees in each chamber, as 
well as the president. In essence, then, this regression 


21 I should also note several differences between the variables I use 
and those used by Carpenter and Olson. First, Carpenter uses the 
Education and Labor Committee in the House and, in the early years 
of the time senes, the Agriculture Committee in the Senate. Second, 
following Olson, I use the agency’s entre budget, whereas Carpenter 
reles on the agency’s staff budget. Third, Olson (1996b) uses the 
number of oversight hearings (lagged) and the number of adverse 
findings by the FDA. Fourth, Carpenter uses a cascade processing 
model, tn which the agency adapts slowly over time both to the 
amount of the appropriations it receives and to whether appropria- 
tions have been or decreasing in recent years. The model I 
fisg heieinmeve stdlartowharhe labels the “serial updating” model. 
2 It ıs worth noting that mcluding a lagged dependent vanable with 
OLS produces results very similar to those obtained using Prais— 
Winsten. A case could also be made for using either tobit (because 
the dependent vanable is bounded and never falls below zero) or 
negative binomial regression (because the dependent variable con- 
sists of count data). Results using these alternative methodologies 
again are substantially the same as those reported here. 
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TABLE 2. FDA Monitoring Activities 
(In Thousands), Using Atheoretical 
Independent Variables 
Vanable 
Political actors 

House Committee 


House Floor 
Senate Committee 
Senate Floor 
President 


Controls 
FDA Appropnatons 
($ mlllions) 
Kefauver—Hams 
Amendments 
Food Industry 
Employment (thousands) 
Health Industry 
Employment (thousands) 
Samples dummy 


(73.0) 


e. i 
—104.6" 
(9.5) 

Inspectons dummy — 


521.9H+ 
(132.9) 


N 49 
R? 0.76 
p 0.59 
DurbIn—Watson 1.11 
Original Transformed 1.73 
Note: Prals—Winsten estates Numbers In parentheses are 
robust standard errors. For one-tailed tests, “*p< 01 and 
= p< 06 For two-talled tests, +++ p <.01 and ++ p < .05. 


Constant 





assumes that there is a linear relationship between 
these variables and FDA monitoring activities and tests 
whether political influence is unconditional. As such, 
it functions as a sort of alternative to the hypotheses 
stated earlier—namely, these two equations test the 
hypotheses that these political variables matter irre- 
spective of regime. 

The results of these two regressions are striking for 
their lack of significance. None of the congressional 
variables are significant, and indeed, none are even 
very close to being significant. Furthermore, although 
the variable measuring the president’s ideology is sig- 
nificant for inspections, it is indistinguishable from zero 
in the equations about samples. Importantly, the results 
are similarly insignificant if the regressions include only 
one chamber at a time, or only the two committees, 
or only the two floors.” Thus, there appears to be no 


B I tested these alternative specifications to make sure that the lack 
of significance for the political vanables in Table 2 was not sımply 
due to multicollineartty. 
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Varlable Column 1 
Committes-Floor Regime 
Committee 184.77 
(40.4) 
Floor -138.3" 
(28.6) 
Gatekeeping Regime 
President 7.4 
(6.7) 
Floor Regime 
Floor 93.07" 
(16.5) 
Controls 
FDA Appropnations ($ millions) 0.20 
(0.05) 
Kefauver—Harris Amendments 24.57 
(9.4) 
Food Industry Employment —256.0H+ 
(thousands) (53.3) 
Health Industry Employment —4.5+ 
(thousands) (2.0) 
Samples dummy —-114.6" 
(9.2) 
Inspections dummy — 
Nominating President — 
Lagged Appropriations ($ mllllons) — 
Constant 508.41 
(97.6) 
N 49 
Fe 0.88 
p 0.26 
DurbIn—Watson 1.64 
Original Transformed 1.85 


TABLE 3. FDA Monitoring Activities (In thousands), Using Theoretical Predictions 


Samples 


VoL 98, No. 3 









Inspections 
Column 2 Column 3 Column 4 
186.0 129.4* 148.07 
(58.4) (77.0) (70.6) 
—122.3" —132.87 —157.2" 
(57.9) (72.7) (69.5) 
75 7.0" 8.2" 
(7.2) (4.4) (4.1) 
101.4" —15.4 —17.9 
(20.7) (11.6) (16.4) 
0.14" 0.04 0.06 
(0.07) (0.04) (0.08) 
21.3" 13.77 14.0 
(9.9) (4.3) (4.3) 
—265.144+ —29.8 —13.1 
(65.0) (35.4) (30.2) 
—6.14+ —7.1t —B.444+ 
(2.7) (2.6) (2.7) 
—119.4—" — — 
(8.8) 
— 15.5 13.3" 
(6.1) (6.6) 
—1.5 — —6.0 
(8.2) (5.7) 
0.10" — —0.02 
(0.05) (0.06) 
526.61 74.8 47.3 
(118.3) (65.9) (55.7) 
48 48 48 
0.87 0.67 0.68 
0.33 0.30 0.32 
1.62 1.56 1.59 
1.80 1.90 1.88 


Note, Prale—Winsten estimates. Numbers in parentheses are robust standard errors. For one-tailed tests, “*p<.01, *p< 05, 
and * p < 10. For two-tailed tests, +++ P<.01 and ++ p< 05. 





support for the hypothesis that the preferences of 
elected politicians influence agency actions in a linear 
fashion—that is, regardless of regime. 

In Table 3 I turn to the task of testing the theoretical 
predictions. Columns 1 and 3 presents the results from 
the basic model, where the number of samples (in Col- 
umn 1) and inspections (in Column 3) is a function of 
congressional committee and floor preferences in the 
Committee-Floor Regime, the president’s preferences 
in the Gatekeeping Regime, and the floor’s preferences 
in the Floor Regime. As discussed above, I also control 
for the agency’s budget, the Kefauver—Harris amend- 
ments, the size of the industry, and the change in the 
FDA’s reporting methods. 

The results in Column 1 provide strong support for 
the theory’s predictions. Most strikingly, the floor has a 
negative and significant influence in the first regime, but 
a positive and significant influence in the third regime. 
Thus, a more liberal floor will lead to a more activist 


agency in the Floor Regime but to a less activist agency 
in the Committee-Floor Regime. The coefficient for the 
committee is also strongly significant and positive, as 
predicted, in the Committee-Floor Regime. The only 
finding that does not comport with the theoretical pre- 
dictions is that the president does not exert a significant 
influence on the agency in the Gatekeeping Regime. It 
is worth noting that if I add the president’s ideology 
score as a main effect, in addition to including the 
interaction of his ideology with the dummy for the 
Gatekeeping Regime, neither the interacted term nor 
the main effect turns out to be significant. Similarly, if I 
include only the president’s ideology score, irrespective 
of regime, it is not significant. In each case, however, 
the congressional variables remain significant in the 
predicted directions. 

Two of the control variables are also significant in the 
predicted direction. The coefficient for the Kefauver— 
Harris amendments is significant and positive, 
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supporting Carpenter’s (1996) and Olson’s (1996a) 
contentions that these amendments produced a more 
activist culture at the agency. In addition, again consis- 
tent with the work of Carpenter and Olson, the amount 
of money appropriated to the agency also influences 
the agency’s level of output. The variable represent- 
ing the change in the method by which samples were 
reported is, as predicted, negative and significant. In 
addition, the measures of industry employment have 
a negative sign, consistent with the findings of Olson 
(1999). 

The results in Column 3, which focus on investiga- 
tions rather than samples, also provide support for the 
theory, olbeit the support is not as strong as it is for sam- 
ples. Both the committee and the floor have the pre- 
dicted sign in the Committee-Floor Regime, albeit at a 
relatively weak level of significance. Furthermore, the 
president has the expected positive influence in the 
Gatekeeping Regime. Only the floor variable in 
the Floor Regime, which does not differ significantly 
from zero, fails to provide corroboration for the the- 
ory. Taken together, these results in Columns 1 and 
3 provide strong initial confirmation for the theory 
by showing that in specific regimes, identified by the 
theory, Congress does exert influence over the agency’s 
actions. 

The results also provide some indirect support for 
another aspect of the theory. In addition to predicting 
when certain congressional variables should matter, 
the theory also predicts when these variables should 
not matter. More specifically, the House and Senate 
floors should not be significant under the Gatekeeping 
Regime, and the relevant committees should not be 
significant under the Gatekeeping or Floor regimes.* 
Unfortunately, I cannot test these additional hypothe- 
ses directly, since the inclusion of these other variables 
leads to unacceptably high levels of multicollinearity 
that make it impossible to obtain efficient estimates, 
But we can at least draw some tentative conclusions 
from a comparison of Tables 2 and 3. The committee 
variables, for example, are not significant in Table 2 
(or in alternative specifications of the results presented 
in Table 2). Yet the committee variable is significant in 
the results reported in-Table 3. Thus, since Table 3 
shows significance for the Committee-Floor Regime, 
and Table 2 shows a lack of significance overall, this 
implies that the variable is not significant in the pre- 
dicted direction in regimes other than the Committee- 
Floor Regime. A similar conclusion can be drawn for 
the floor variable. Admittedly, this evidence is indi- 
rect, but it is also clearly consistent with the pre- 
dictions of the theory. Overall, then, the results pro- 
vide strong evidence that variables matter when the 
theory says they should matter and some indication 
that the variables do not matter when they should 
not. 


M Furthermore, the committee that is not prvotal in the Committee- 
Floor Regime also should not be significant, and the floor that is 
not used in the Commuttee-Floor or the Floor regimes should not be 


significant. 
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Additional Tests 


In order to check on the robustness of the results shown 
in Table 3, I examined a variety of other variables and 
approaches. First, a number of studies argued that po- 
litical principals attempt to influence the future actions 
of the agency by designing specific agency structures 
and procedures, perhaps to set the agency on “au- 
topilot” (e.g, McCubbins, Noll, and Weingast 1987) 
or perhaps to insulate it from political pressure (e.g., 
Moe 1989). The significance of the Kefauver—Harris 
amendments may be seen as partial support for this 
argument. Another possibility is that the president who 
was in office when the sitting chair of the agency was 
appointed still exerts influence over the agency even 
after a new president takes office (via the chair) and 
that the agency’s actions still reflect that previous pres- 
ident’s preferences.” Thus, to the regressions reported 
in Columns 1 and 3 in Table 3, I add a variable that 
measures the ideology of the president who was in of- 
fice when the chair took over the FDA. Second, I add 
a lagged version of agency appropriations. Although 
this does not completely capture Carpenter’s (1996) 
elegant argument about the budgetary signals that are 
passed down from political principals to agencies, it 
does at least roughly capture the central idea that 
agencies obtain information not only from the current 
budget, but also from past budgets. 

Columns 2 and 4 show the results of the regressions 
containing these two other variables. In general, the 
results are similar to those reported in Columns 1 and 
3. All of the theoretical variables except the president 
are significant for samples, and all except the floor 
are significant for inspections (with the committee and 
floor now achieving higher levels of significance in the 
Committee-Floor Regime). There is some evidence 
that past budgets contain information for the agency, at 
least in the realm of samples. But there is no support for 
the idea that the president who supervised the choice 
of the agency chair continues to exert influence over 
the agency’s actions.” 

In the tests reported thus far I have run separate 
regressions for each of the two dependent variables. 
This follows Carpenter (1996), who makes a good 
case that although there is some overlap between 


> In some cases, a new chairman has been appointed immediately 
after a new president takes office. In other instances, there has been 
a lag of a year or two. And m still others, a chairman who served 
under an earlier president remains in office during the entire term 
of the succeeding president. Although the president did not directly 
appoint the FDA chair until fairly recently, he clearly had influence 
over this appointment, by virtue of its location within the executive 
branch. Consequently, I sometimes refer to this variable (eg., in 
Table 3) as “Nominating President.” 

26 This is also true if instead of adding the nominating president 
to the model, I substitute the nominating president for the current 
president. In an additional test, I also examined what would happen 
if the agency's preferences were relatively fixed over time. I set the 
agency’s ideal point at the midpoint of the two houses of the 80th 
Congress, allowed it to move to the midpoint of the two chambers 
in 1962 (when Congress passed the Kefauver—Haris Amendment), 
and calculated the regimes that way. This approach produced gener- 
ally insignificant results, thereby casting doubt on the idea that the 
agency’s preferences remain fixed over time. 
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inspections and samples, they are generally indepen- 
dent from each other. It is conceivable, however, that 
these two variables act as substitutes for each other. 
Indeed, Olson (1996b) has convincingly argued that 
the FDA has shown a pattern of substituting one type 
of enforcement activity for another. 

Although there is no theory that allows us to pre- 
dict which monitoring tool will be used under which 
conditions, there is an econometric technique that al- 
lows us to account for the relationship between the 
two measures. Seemingly unrelated regression (SUR) 
is appropriate when the error terms of two separate 
equations are correlated. Each equation can be esti- 
mated separately; however, doing so, in effect, ignores 
the information that comes from knowing that the two 
equations are related, in particular, that the error terms 
in the two equations are related (Greene 1990). In this 
case, the FDA can substitute inspections and samples 
for each other. Thus, if we were to regress each of these 
variables separately on the set of independent variables 
included in the previous table, we would be ignoring 
this possible substitution effect. SUR can be used to 
estimate the two equations jointly. 

To test the theory using SUR, I use the variables 
presented in the first and third columns in Table 3. 
However, I need to add one additional variable before 
proceeding. The results in Table 3 are derived using an 
autoregressive technique, Prais—Winsten, to account 
for autocorrelation. Since I cannot use an autoregres- 
sive technique with SUR, I include a lagged version of 
the dependent variable for each equation to account 
for autocorrelation. 

The results using SUR are presented in Table 4. The 
equations for both samples and inspections continue to 
support the theoretical argument. All of the important 
variables, with the exception of the floor in the Floor 
Regime for inspections, are statistically significant in 
the predicted directions. The agency becomes more 
liberal in its monitoring activities as the committee gets 
more liberal and the floor more conservative in the 
Committee-Floor Regime, as the president becomes 
more liberal in the Gatekeeping Regime, and as the 
floor becomes more liberal in the Floor Regime (for 
samples). Finally, the correlation between the resid- 
uals of the two equations is 0.42. A Breusch—Pagan 
test of independence allows us to reject the hypothe- 
sis that the two equations are independent (p <.001), 
indicating that the two equations can be estimated 
jointly. 

Finally, I also tested the statistical models shown in 
Tables 3 and 4 using other alternative approaches and 
variables. First, in an attempt to test whether the courts 
influence the relationship between Congress and agen- 
cies, I created a dummy variable for the Chevron case, 
a 1984 case in which the Supreme Court ruled that 
agencies should be given broad leeway when Congress 
delegates but does not give precise instructions to an 
agency. The results, however, indicate no difference in 
congressional influence before and after this ruling.” 





77 More specifically, I included this dummy variable both as a main 
effect and as a term interacted with the regime variables These 
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TABLE 4. SUR Estimates of FDA Monitoring 
Activities (In Thousands) 
Varlable 


Samples 


Inspections 





Commtttee-Floor Regime 
Committee 114.67 
(64.1) 

—143.1* 


(93.1) 


Floor 


Gatekeeplng Regime 
President 12.7” 
(6.9) 
Floor Regime 
Floor 101.6 
(34.1) 
Controls 
FDA Appropriations 
($ millions) 


0.14 
(0.04) 
18.6 
(6.2) 
217.444 
(43.2) 
2.6 
(2.1) 
88.9" 
(8.3) 
027 
(0.07) 


Health Industry Employment 
(thousands) 

Samples dummy 

Samples (t— 1) 

Inspectons dummy 


Inspections (t— 1) 


tests, ™ p < .01, “= p <.05, and * p <.10. For two-tailed teats, 
+= p < .01 and + p < 0.5. 





Still, the idea that courts can influence the relationship 
between legislatures and agencies is deserving for fur- 
ther attention, as both theoretical work (e.g., Shipan 
2000) and empirical studies of courts and agencies 
(e.g., Canes-Wrone 2003 and Howard and Nixon 2002) 
indicate the potential influence of courts on agency 
actions. 

Second, I investigated two potential alternatives to 
my use of the floor median, both of which represent 
initial investigations into institutions that are omitted 
from my model. One of these was the majority party 
median; the other was the filibuster pivot in the Senate. 
In each case I substituted this other variable for the 
floor median, used this variable to determine the 
regime, and then used it in the regressions in place of 
the floor median. The results using the filibuster rather 
than the Senate floor median were broadly similar to, 


variables were not significant. These results, as well as those discussed 
m the next paragraphs, are available from the author. 
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although sometimes weaker than, those reported in this 
paper, while the results using the party median gener- 
ally were not significant. These results lend support to 
the idea, central to my model, that agencies (and com- 
mittees) are concerned with the median member of the 
floor. At the same time, these should be viewed only 
as preliminary tests, intended more to spur future anal- 
ysis than to be viewed as definitive. The obvious next 
step is for scholars to think more comprehensively and 
systematically about how parties and other institutions 
like the filibuster could be incorporated into a model 
of agency policymaking and then to solve such a model 
and test its predictions (e.g., Chiou and Rothenberg 
2003). 


DISCUSSION AND CONCLUSION 


For more than two decades, scholars have been 
conducting sophisticated statistical analyses of 
contemporaneous political influence on the actions of 
regulatory and administrative agencies. Using a va- 
riety of increasingly advanced statistical techniques, 
these studies have provided insights into the work- 
ings of a number of agencies. Findings about whether 
or not Congress influences agency actions, however, 
have been rather mixed. This paper points out one 
potential reason for these mixed findings: scholars 
have failed to identify the conditions under which 
influence is likely to occur. By mixing together ob- 
servations in which influence is likely to occur with 
those in which it is not, scholars have unintention- 
ally biased their results against finding any significant 
effects. 

To remedy this gap, I developed a simple spatial 
model of agency policymaking, one that identifies a 
series of regulatory regimes, each of which predicts 
that a different set of political actors will influence 
the agency. In essence, the model shows that if the 
question is “Does Congress influence agencies?” then 
the answer is “Sometimes.” But importantly, the model 
shows when Congress can matter, and whether it is the 
floor or the committee that matters, Utilizing the threat 
of legislation, under some conditions committees can 
influence agencies, and under other conditions, floors 
can influence agencies. In addition, it shows that there 
are also specific conditions under which they can oper- 
ate autonomously from Congress. 

The statistical results support the argument that 
Congress can—under certain conditions—infiuence 
agency actions. In particular, when the agency is located 
far to the left of the committee, which is turn is located 
to the left of the floor, the agency will need to be sensi- 
tive to the preferences of the committee. All else equal, 
as the committee becomes more liberal, the agency 
will become more activist; and as the committee be- 
comes more conservative, the agency will become less 
activist. The agency can, however, move in the opposite 
direction of the floor in this regime. On the other hand, 
if the agency is located on the opposite side of the floor 
from the committee, it will have to be attuned to the 
floor’s preferences. 
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The results also show that agencies are influenced by 
other factors when deciding what sorts of policy actions 
to take. Obviously the president, for example, exerts an 
influence over the agency, as does the overall level of 
the agency’s budget. But we also see that the agency 
pays attention to other influences, in addition to the 
preferences of elected politicians, such as the size of 
the industry. All told, then, agencies are sensitive to a 
variety of signals that they receive from the political 
environment. 

One interesting and unexpected result that emerges 
from the model speaks to the recent debate about 
whether the floor is made better off by committees that 
are representative of the floor (e.g., Maltzman 1997). 
The theoretical model presented here indicates that if 
a committee and an agency are on the same side of 
the floor, then the floor has an incentive to make sure 
that the committee is as close to the floor as possible. 
The more distant the committee is from the floor, the 
more likely it is that the agency will be able to choose 
a policy that is far from the floor. Consequently, in 
order to obtain an agency output that is in line with 
its own preferences, the floor should prefer commit- 
tees to be inliers. Furthermore, the model shows that 
floors should have a strong preference for committees 
that are located on the opposite side of the floor from 
the agency. Only when this is the case—when com- 
mittees and agencies are countervailing—can the floor 
exert a positive influence on the actions of the agency 
or expect the agency’s policy outcomes to be in line 
with the floor’s preferences. Interestingly, although the 
model used in this paper assumes the absence of trans- 
action costs, this finding is consistent with the results of 
Epstein and O’Halloran’s (1999) innovative transac- 
tion cost approach. 

Although the results are generally strong, there are 
several obvious directions that future research can 
take. First, using the ideology of the current president 
to determine the location of the agency works well, 
but it certainly would be preferable to have a more 
nuanced and independent measure of the agency’s lo- 
cation. Second, solving and testing competing mod- 
els that include other institutions, such as parties and 
courts, or the agency’s structure or location, would 
provide insight into the effects of those other insti- 
tutions on agencies. Third, this model is built around 
the threat of legislation. It would be useful to develop 
a theory centered around other means that Congress 
has for influencing agencies, one that, like this the- 
ory, produces testable implications about the specific 
conditions under which agencies will pay attention to 
Congress. 

This paper, by setting out a model of agency pol- 
icymaking and then conducting statistical tests of the 
implications of this model, provides a useful and adapt- 
able approach for testing for political influence. It pro- 
duces evidence that contemporaneous influence exists, 
that agencies are responsive to congressional prefer- 
ences. Most importantly, this paper shows that the pri- 
mary focus of research should not simply be whether 
Congress influences the agency but, rather, when, and 
under what conditions, influence exists. 
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APPENDIX 


TABLE Ai. Regime Type by 
Year 


Year Regime Type 
1947 Committee-Floor 
1948 Committee-Floor 
1949 Gatekeeping 
1950 Gatekeeping 
1951 Gatekeeping 
1952 Gatekesping 
1953 Gatekeeping 
1954 Gatekeeping 
1955 Committee-Floor 
1956 Committee-Floor 
1957 Committee-Floor 
1958 Committee-Floor 
1959 Committee-Floor 
1960 Committee-Floor 
1961 Gatekeeping 
1962 Gatekeeping 
1963 Gatekeeping 
1964 Gatekeepling 
1965 Gatekeepling 
1966 Gatekeepling 
1967 Gatekeepling 
1968 Gatekseping 
1969 Committee-Floor 
1970 Committee-Floor 
1971 Committee-Floor 
1972 Committee-Floor 
1973 Committee-Floor 
1974 Committee-Floor 
1975 Committee-Floor 
1976 Floor 

1977 Gatekeeping 
1978 Gatekseping 
1979 Gatekeeping 
1980 Gatekeepling 
1981 Committee-Floor 
1982 Committes-Floor 
1983 Floor 

1984 Floor 

1985 Floor 

1986 Floor 

Floor 

Floor 

Floor 

Floor 

Floor 

Floor 
Gatekeeping 
Gatekeeping 
Floor 


1988 
1989 
1990 


1992 
1993 
1994 
1995 
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low. Our analysis suggests that low bureaucratic capacity diminishes incentives for bureaucrats 


W: analyze a model of delegation and policymaking in polities where bureaucratic capacity is 


to comply with legislation, making it more difficult for politicians to induce bureaucrats to take 
actions that politicians desire. Consequently, when bureaucratic capacity is low, standard principles in 
the theoretical literature on delegation no longer hold. We also use the model to examine the issue of 
political reform in polities with low bureaucratic capacity. The model indicates that politicians in such 
polities will be trapped in a situation whereby they have little incentive to undertake reforms of either the 
bureaucracy or other institutions (such as courts) that are crucial for successful policymaking. 


he central tension motivating contemporary the- 

ories of delegation from politicians to bureau- 

crats lies in the potential conflict between the 
value of bureaucratic expertise, on one hand, and the 
desire for political control, on the other. This tension 
is well-known, and has been debated at least since 
Weber’s classic work on bureaucracy. Modern bureau- 
cracies are staffed with individuals who, by virtue of 
“rational” bureaucratic organization, are highly skilled 
policy experts who in principle should be able to 
help less knowledgeable politicians achieve their goals. 
However, the very skills and expertise that bureau- 
crats enjoy create the possibility that bureaucrats will 
usurp the rightful role of politicians in policymaking 
processes. Because the way this tension is resolved de- 
termines the ultimate compatibility of democracy and 
bureaucracy, existing theories of delegation typically 
focus on understanding how and to what extent politi- 
cians can resolve it.! 

In many contexts, particularly developing democra- 
cies, bureaucracy does not fit the “Weberian” charac- 
teristics that are assumed in the existing delegation 
literature. The bureaucracies may be staffed with in- 
dividuals who lack the personal capacity to execute 
orders effectively. The organizational structure may 
lead to breakdowns within bureaucratic hierarchies, 
making it difficult for senior administrators to control 
their subordinates. Departments and ministries may 
lack basic budgets and resources to undertake imple- 
mentation actions. They may suffer from incentives for 


John D Huber is Professor of Political Saence, Department of Po- 
Htical Science, Columbia Univernty, New York, NY 10025 Gdh39@ 
columbila.edu). 

Nolan McCarty is Professor of Politics and Public Affairs, 
Woodrow Wilson School of International and Public Affairs, Prince- 
ton University, Princeton, NJ 08540 ( ton.edu). 

We would like to thank Jobn Carey, David Epstem, David Lewis, 
and Enc Schickler for comments on early versions of the manuscript. 
We are also grateful for feedback from participants in seminars at 
UC—Berkeley and the University of Michigan and from participants 
in conferences on the Macropolitics of Congress and Brazihan Po- 
Irtical Institutions in Comparative Perspective. Huber is grateful to 
Princeton’s University’s Center for the Study of Democratic Politics, 
where he was a visiting fellow during work on this project 
1 Influential works include informal arguments by McCubbing, Noll, 
and Wemgast (1987), the formal model in Bawn 1995, and the for- 
mal model and empirical tests in Epstem and O'Halloran 1999. See 
Huber and Shipan 2002, chap. 2, for a recent review of the literature. 


corruption. For a variety of reasons, then, it may be 
very difficult for bureaucracies to implement policies 
effectively, even when leaders within the bureaucracy 
have sufficient expertise to understand which policies 
will yield desired outcomes. The problem for politi- 
cians in such contexts is not how to create appropriate 
incentives for high-powered bureaucrats. Instead, the 
problem is how to make policy when bureaucracy is 
known to lack capacity. 

We develop a model of delegation that can help 
understand policymaking processes when bureaucratic 
capacity is lacking. A central feature of our approach is 
to distinguish the problem of information (knowledge 
about the consequences of different actions) from the 
problem of capacity (the ability to accomplish intended 
actions). The information problem has dominated the 
existing delegation literature, whereas the capacity is- 
sue has been essentially ignored. We argue that the 
problems of information and capacity are conceptually 
distinct. Senior bureaucrats, for example, may have pol- 
icy expertise but, because of the problems described 
in the previous paragraph, be unable to execute re- 
liably the policies they intend. By disentangling the 
effects of expertise and capacity on delegation, we can 
achieve important insights into policymaking in situa- 
tions where bureaucratic capacity is low. 

Specifically, we use the model of delegation to low- 
capacity bureaucracies to explore three substantive 
issues. First, we examine how low bureaucratic ca- 
pacity diminishes the ability of politicians to achieve 
their policy goals. Our main argument is that although 
low bureaucratic capacity reduces the general qual- 
ity of bureaucratic outcomes though a straightforward 
efficiency loss, it also changes policymaking in a less 
obvious way. In particular, it diminishes incentives for 
bureaucrats to comply with legislative statutes. Put dif- 
ferently, bad bureaucracies are not only inefficient (i.e., 
less successful at implementing the policies they in- 
tend), but also harder to control because their incom- 
petence diminishes their mcentives to implement the 
policies politicians describe in legislation. For this rea- 
son, politicians can often induce more desirable ac- 
tions from highly competent “enemy” bureaucrats (i.e., 
bureaucrats with policy preferences that differ from 
the politician’s) than from less competent “friendly” 
ones (who share the politician’s policy preferences). 
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Importantly, however, politicians need some level of 
policy expertise to take advantage of bureaucratic com- 
petence. Without it, competent bureaucrats can often 
work successfully to achieve outcomes that politicians 
oppose. 

Second, we use the model to understand how delega- 
tion strategies in situations of low bureaucratic capacity 
differ from delegation strategies that have been de- 
scribed in previous research on high-capacity bureau- 
cracies. Our model suggests that when bureaucratic 
capacity is low, well-established principles in the ex- 
isting literature no longer hold. In models that assume 
high-capacity bureaucrats, for example, politicians typ- 
ically delegate more discretion to bureaucrats when 
the bureaucrats are ideological allies and when ex post 
monitoring possibilities are most effective. We find 
the opposite in low capacity systems. Thus, predictions 
about discretion in legislative statutes may not apply in 
political systems where bureaucratic capacity is low. 

Finally, we use the model to understand incentives 
for political reform in polities with low bureaucratic 
capacity. We argue that politicians in such polities are 
trapped in a situation where they have little incen- 
tive to reform not only the bureaucracy, but other 
institutions as well. The incentives of politicians to 
reform low-capacity bureaucracies increase as politi- 
cians gain more policy expertise, for instance, but the 
incentives of politicians to gain policy expertise are 
smallest when bureaucratic capacity is low. Similarly, 
incentives to reform low-capacity bureaucrats increase 
when enforcement of statutes—which is often done 
by courts—becomes more effective, but incentives to 
create effective enforcement are diminished by low- 
bureaucratic capacity. In addition to these problems, 
we also find that the incentives of politicians to “politi- 
cize” the bureaucracy are largest when bureaucratic 
capacity is low and that, when bureaucrats are “politi- 
cized,” incentives to reform bureaucracy are lowest. 


THE MODEL 


The model we describe in this section focuses on a 
specific notion of bureaucratic capacity that is related 
to the ability of senior members of a bureaucracy to 
implement desired actions. We are motivated, in par- 
ticular, by previous descriptions of inefficiency and in- 
competence in policy implementation in developing 
countries. Geddes (1994, 17), for example, points out 
that in the early 1970s, leaders in Chile’s bureaucracy 
clearly had the knowledge and expertise for successful 
nationalization of Chilean copper mines, but general 
mismanagement, prompted by political incentives to 
treat important political positions as patronage plums, 
led to very poor performance of these mines. Geddes 
(1990) similarly describes a very incompetent bureau- 
cracy in the 1930s in Brazil, when the government could 
not even locate the documents that were necessary 
to determine how much money Brazil owed foreign 
lenders. Other scholars describe specific factors that 
influence basic bureaucratic capacity, such as merit- 
based pay and promotion (e.g., Evans and Rauch 1999), 
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public-sector salaries that are sufficiently high to attract 
skilled individuals into public service (e.g., Besley and 
McLaren 1993), the nature of social networks among 
bureaucrats (e.g., Carpenter 2001), insulation of agency 
budgets from excessive political influence (e.g., Geddes 
1994), and training programs (e.g., McCourt and Sola 
1999). 

Importantly, low bureaucratic capacity can exist even 
when high-level bureaucrats know what they would 
like to accomplish. A senior administrator in a depart- 
ment of education, for example, may have a clear idea 
of which curriculum for students will yield the best test 
scores, but implementation of this curriculum may be 
difficult and uneven given variation in the quality of 
school principals and teachers. Or it may be the case 
that a senior bureaucrat charged with administering a 
pension system can establish a clear set of rules about 
eligibility and payments. But this bureaucrat may be 
unable to enforce adequately the application of these 
rules, as subordinates may favor particular individuals 
over others, refuse pensions where they are deserved, 
or provide them to individuals who are ineligible. Or 
a senior bureaucrat may be assisted by international 
development agencies regarding how to build a water 
project but be unable to motivate subordinates to get 
the job done right. We want to focus, then, on problems 
that senior bureaucrats often face when attempting to 
have their subordinates implement policy and, there- 
fore, to separate this problem of “capacity” in imple- 
mentation from the problem of “expertise.” 

Our approach to modeling policymaking in the pres- 
ence of capacity and expertise problems is as follows. 
The model has two players, a Politician who writes laws 
and a Bureaucrat who implements them.” Interactions 
occur in a one-dimensional policy space where the 
Politician has an ideal point at xp = 0 and the Bureau- 
crat has an ideal point at xg >-xp. Each player has a 
quadratic utility function over policy outcomes. 

The game begins with the Politician adopting a 
statute that establishes the domain of implementation 
actions that the Bureaucrat can take while complying 
with the law. The Politician may lack expertise (i.e., be 
uncertain about which specific policies best serve her 
interests). Following the adoption of the statute, the 
Bureaucrat, who has policy expertise, attempts to im- 
plement policy. The Bureaucrat’s success at executing 
the desired action depends on the Bureaucrat’s capac- 
ity. In the final stage, if the Bureaucrat’s action does not 
comply with the statute, he is caught and punished with 
some probability. We interpret this probability as the 
effectiveness of oversight and of judicial enforcement. 
This general structure is therefore similar to the model 
developed by Huber and Shipan (2002). A central 
difference is that the model here adopts assumptions 
about bureaucratic capacity that follow Chang, Lewis, 
and McCarty (2000). 

To model bureaucratic capacity in the sense de- 
scribed above, we assume that if the Bureaucrat at- 
tempts action a, then the realization of this action is 


* We use in our model a female politician and male bureaucrat to 
avoid confusion in the use of pronouns. 
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a— w, where œw is a random variable with a probabil- 
ity density function f (w) = (Q — |@])/Q on the inter- 
val [—Q, Q]. Therefore, w is distributed symmetrically 
around a mean of zero with a variance o2 = (27/6). 
Because it is directly related to the variance of w and 
therefore to the Bureaucrat’s ability to control the real- 
ized action, Q represents bureaucratic incapacity in our 
model. When Q is very small, the Bureaucrat’s capac- 
ity is large. As Q increases, so too does the possibility 
of large errors in implementation. Because we wish 
to understand how bureaucratic incapacity influences 
institutional reform and policymaking in developing 
polities, we focus on cases where Q is sufficiently large 
(ie., where bureaucratic capacity is reasonably small). 
We make this explicit below. 

Our assumption about policy expertise follows the 
standard approach in the literature. Politicians of- 
ten understand what policy outcomes they want to 
achieve—such as an efficient pension system or a re- 
liable supply of clean water—but may be uncertain 
about which specific policy will achieve this objective. 
A pension system might be accomplished through pri- 
vate contributions to insurance schemes or payments 
from general revenues, for example, and water supply 
might be assured by digging wells or making dams. In 
our analysis, we wish to treat the Politician’s degree of 
policy uncertainty as a variable. Specifically, we assume 
that if some policy a — w is implemented, the ultimate 
policy outcome is a — w — £, where e ~ U[—E, E] with 
variance o? = (E?/3). Each player j’s preferred policy 
action is therefore x, + e. In this paper, we assume that 
the Bureaucrat knows s, capturing the fact that in most 
political systems, senior-level bureaucrats have more 
resources for obtaining policy expertise than politi- 
cians. An important avenue for future research would 
be to examine cases where expertise in the bureaucracy 
is also variable.’ 

The smaller is E, the lower the Bureaucrat’s expertise 
advantage over the Politician. At the extreme, where 
E=0, both the Politician and the Bureaucrat know the 
consequences of adopting any specific policy. We ma- 
nipulate the value of E to examine two issues. The first 
concerns how the Politician’s level of policy uncertainty 
(the value of E) influences policymaking processes. The 
second concerns political development: How does bu- 
reaucratic capacity influence politicians’ incentives to 
create institutions that help them to develop expertise 
(e.g., that make E smaller), and how does the level 
of expertise affect incentives to improve bureaucratic 
capacity? 

It is important to bear in mind that the ultimate out- 
come from any action attempted by the Bureaucrat is a 
function of capacity (a) and the policy shock (e). Thus, 
the final outcome from the action, a, attempted by the 
Bureaucrat is a — w — €. 


2 For our results, only the unimodality and symmetry of w is crucial. 
The densty of œw ıs that of the sum or difference of two uniform 
random variables with zero mean. 
3 Bendor and Metrowitz (2004) provide a model where levels of ex- 
pertise vary across bureaucracies, but each is assumed to implement 
its mtended policy choice perfectly 
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As noted, the game begins when the Politician adopts 
a law. This law specifies the upper and lower bound 
on policies that the Bureaucrat can implement while 
remaining compliant with the law. Formally, this law is 
x= [x, ¥]. Because we assume that xp < xg, the Politi- 
cian will set x as low as possible because the Bureau- 
crat never has an incentive to take actions that are 
lower than what an informed legislature would have 
wanted. To simplify the analysis, we therefore assume 
that x= —oo and focus our analysis on the location of 
x. Following the adoption of the law, the Bureaucrat 
attempts to implement a, the realization of this action 
is a — w, and the policy outcome is a — w — £. 

A central feature of our model is the assumption that 
the policy that the Bureaucrat implements (i.e., a — w) 
may or may not comply with the statute (i.e., it may or 
may not be the case that a — w € [x, X]). We assume that 
if the policy is noncompliant (i.e., a — w ¢ [x, X]), then 
with some probability, y, the Bureaucrat is caught in 
noncompliance and pays a penalty, 6. Our model does 
not assume, then, that the Bureaucrat must comply 
with the law. The Bureaucrat may try to do so, but 
fail, because of a large implementation error. Or he 
may not even try, instead attempting an action that is 
known to be out of compliance with the statute. The 
parameters y and é therefore capture the effectiveness 
of the political system, particularly the courts, at un- 
covering and punishing actions by the Bureaucrat that 
do not comply with statutes. 

It is useful to note that the punishment of bureau- 
crats for noncompliance is based on the realization of 
the Bureaucrat’s attempted action, a — w, rather than 
on the policy outcome itself, a — w — e. We assume that 
it is never possible to observe a, the Bureaucrat’s in- 
tended action. We also assume that the Bureaucrat 
cannot be punished if the realization of his action com- 
plies with the statute (a — w € [x, x]). It may be the case 
that a — w — e ¢ [x, x]. However, the Bureaucrat cannot 
be punished for this state of the world if he actually 
complied with what the statute says. 

Politicians, for example, may want to contain health- 
care costs. A law may state that global budgeting 
procedures should be adopted for individual hospitals 
to achieve this goal. It may be that senior bureaucrats 
instruct their team to develop these budgeting systems. 
If in fact some bureaucrats do not implement the re- 
quired budgeting systems for their hospitals, this would 
be a situation where the policy is not compliant with 
the statute (and a — w ¢ [x, %]). In this case, if caught in 
noncompliance, the bureaucrats could be punished. By 
contrast, it may be the case that all of the budgeting pro- 
cedures were put in place (a — w € [x, x]), but the end 
result did not save money, as was expected. In this sec- 
ond case, a—w—e¢[x, xX], yet the bureaucrat should 
not be punished for doing what the law prescribed. This 
possibility that faithful execution of the law leads to bad 
outcomes represents one of the risks to an uninformed 
legislator (one who is uncertain about the value of e) 
of placing policy constraints on the Bureaucrat—that 
is, the Politician may force the Bureaucrat to take 
actions that, given the actual value of £, are bad from 
both the Bureaucrat’s and the Politician’s perspective. 
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FIGURE 1. Sequence of Interactions In the Model 
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FIGURE 2. The Bureaucrat’s Optimal Response to g 


f(a) 





Note. The example In this figure assumes that s = 0. 





Figure 1 summarizes the model. In the next section, 
we examine how bureaucratic capacity mfluences the 
Bureaucrat’s implementation strategy. 


BUREAUCRATIC CAPACITY AND POLICY 
IMPLEMENTATION 


In the model, one reason the Politician is harmed by a 
low capacity Bureaucrat is because of a straightforward 
efficiency loss. This efficiency loss is simply o2 = 27/6, 
which is increasing in Q. In this section, by exam- 
ining the Bureaucrat’s best response to any statute, 
we show that bureaucratic capacity has two distinct 
effects on policymaking in addition to this efficiency 
effect. On one hand, the best action the Bureaucrat 
can be induced to take improves for the Politician as 
bureaucratic capacity increases. In fact, the Politician 
can often induce a better action from a high-capacity 
“enemy” Bureaucrat (with an ideal point far from the 
Politician’s) than from a low-capacity “friendly” Bu- 
reaucrat (with an ideal point close to the Politician’s). 
On the other hand, a high-capacity Bureaucrat’s action 
is more responsive to his information about the policy 
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environment.‘ This responsiveness can work both for 
and against the Politician’s interests, for reasons we 
spell out below. In what follows, we describe the logic 
of these two effects and state formally the optimal re- 
sponse of the Bureaucrat to any statute. 

The Bureaucrat reacts to a statute by attempting an 
action, a. He knows that the realization of this effort 
is a — w but does not know the value of w (only that 
w €[—Q, Q]). In choosing the optimal a, then, the Bu- 
reaucrat must weigh the (potential) costs and benefits 
of moving a toward his ideal point. Such moves can 
lead to the benefit of better policy outcomes, but at the 
cost of possibly being punished for noncompliance. 

Figure 2 illustrates the Bureaucrat’s decision prob- 
lem given a statute, x, and given an arbitrary level of ca- 
pacity, Q, which has a probability density function f (œ). 
In Figure 2 we assume that the Bureaucrat knows that 
e =0. The figure depicts three different actions—a;, a, 
and a3—along with the distributions of realizations 
of these actions, which are denoted by the triangles 
centered above each action. The attempted action 


4 Bendor and Meirowitz (2004) make a similar point in a model with 
variable expertise. 
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a,=X—Q would ensure compliance with the statute 
for any possible w because even if w= —Q, the real- 
ization of the Bureaucrat’s action complies with x. By 
contrast, by attempting action a2 = xp, the Bureaucrat 
completely ignores the statute by attempting the action 
at his ideal point. This will often lead to a better policy 
outcome for the Bureaucrat, but it will also lead to 
substantial noncompliance. In this example, whenever 
@<xg—, the realization of the Bureaucrat’s action 
will be noncompliant, and if caught in noncompliance 
(which occurs with probability y), the Bureaucrat pays 
a punishment penalty, 6. The probability that the re- 
alization of the action at a2 is noncompliant is repre- 
sented by the area under the triangle centered above 
a that is to the right of x. 

Obviously, as the Bureaucrat moves his attempted 
action from a; to az he obtains a better policy outcome 
but he increases the probability that the realization of 
his action will not comply with the statute, and that he is 
caught and punished. In equilibrium, the Bureaucrat’s 
optimal attempted action equates the marginal policy 
benefit of moving the action toward his ideal point 
with the marginal cost associated with being caught in 
noncompliance with the law, point a3 in Figure 2, for 
example. It is useful to note that the optimal action will 
at times result in noncompliance for some realizations 
of w. 

Note that for some locations of X%, the Bureaucrat’s 
optimal response will always be to attempt to im- 
plement his most preferred policy. If x>xg+, for 
example, such as %; in Figure 2, then the Bureau- 
crat can attempt his most preferred action without 
ever risking noncompliance. If £ is sufficiently far to 
the left of xp (relative to the cost of noncompliance, 
ie, X<xg—Q),° the Bureaucrat prefers taking his 
most preferred action over taking any action that at- 
tempts to comply with the statute. In general, for any 
X¢[xg+e—Q, xp +e4+Q], the Bureaucrat will at- 
tempt an action at his ideal point. We can therefore see 
that as the bureaucratic capacity improves ( declines), 
the range of statutes that result in a= xg + increases 
(i.e., the interval [xg +8 —Q, xp +£ + Q&Q] shrinks). 

Although a high-capacity Bureaucrat will respond to 
a wider range of “extreme” statutes by attempting an 
action at his ideal point, a higher-capacity bureaucrat is 
also easier to “control” with nonextreme statutes. That 
is, when x € [xa +£- Q, xp +e+4+Q], a higher-capacity 
Bureaucrat can be induced to take actions that are 
closer to the Politician’s preferred policy than can a 
low-capacity Bureaucrat. The reason can be seen in 
Figure 3, which depicts two different levels of bureau- 
cratic capacity, a higher-capacity Bureaucrat in the top 
plot and a lower-capacity Bureaucrat in the bottom 
plot. 

Given the x depicted in Figure 3, consider the con- 
sequences for the Bureaucrat of moving his attempted 
action from a; to a2. For both levels of capacity, if the 
Bureaucrat attempts an action at a; =x, the probability 


5 Note that this boundary point implies that the cost of punishment 
for noncompliance is sufficiently small because we assume in this 


paper that dy < Q? (see discusnon below). 
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FIGURE 3. Bureaucratic Capacity and 
Implementation Actions 


f(a) 





of noncompliance is one-half. For the higher-capacity 
Bureaucrat depicted in the top plot, attempting an ac- 
tion æ guarantees full compliance with the statute (i.e., 
because a2 + 2 <¥ for all realizations of w, a2 — w <X). 
By contrast, for the low-capacity Bureaucrat depicted 
in the bottom plot, adopting a2 reduces the risk of non- 
compliance by much less (i.e., for many realizations of 
w, @2 —@ >). Thus, the expected benefit for the low- 
capacity Bureaucrat of attempting greater compliance 
with the statute (by moving from a, to æa) is much 
less than this expected benefit for the higher-capacity 
Bureaucrat. The policy loss (when noncompliance does 
not occur or is not detected) of moving from aj to az is 
the same for the high-capacity and the low-capacity 
Bureaucrat, but the low-capacity Bureaucrat is less 
willing to pay this cost because he reduces the risk of 
noncompliance much less than does the high-capacity 
Bureaucrat. Thus, all else equal, the lower-capacity Bu- 
reaucrat is harder for the Politician to control. Because 
the results of the low-capacity Bureaucrat’s actions are 
less sensitive to his efforts, he has less to lose from 
risking additional noncompliance and thus, for a given 
statute, will attempt actions that are closer to his pre- 
ferred policy than will the high-capacity Bureaucrat. 

To establish formally the precise location of the 
Bureaucrat’s optimal action, recall that we assume 
Xp < Xp, X= —oo, and we multiply the quadratic policy 
utility by one-half. With these assumptions, the Bu- 
reaucrat’s expected utility of attempting to implement 
the policy a in response to statute X is given by 

Q 
1 2 
-f z6- 2- 6 — xB) f (w) dw 
a-t 
at 1 
- f [F@-0- 2-25)? +78] (0) do 
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The first term in this expression is the expected 
utility over the range of œ for which a leads to a 
compliant policy by the Bureaucrat. The second term 
in this expression is the expected utility over the 
range of w’s that lead to noncompliant policies. Eval- 
uating the integrals reveals that the expected utility 
for the Bureaucrat from an Ser a, is EUg(a|xX) = 
—4(a — 8 — xp)? — yF (a — Z) — o}, where F(w) is the 
gumulativé distribution TEE for w. Thus, the Bu- 
reaucrat’s optimal implementation strategy a* solves 
the first-order condition 3EUpg(a | ¥)/3a = xp — a* + & — 
vif (a* — 2) =0. 

The first-order condition has a very intuitive inter- 
pretation. It says that in equilibrium the marginal utility 
of increasing the intended action toward xp, xp — a* + € 
must equal the marginal increase in the likelihood of 
punishment yf (a* — xX) times the penalty 8. Below we 
refer to xg —a* + € as the marginal policy benefit and 
yéf (a — x) as the marginal compliance cost of action a. 
Thus, the key feature that determines how much the 
Bureaucrat is willing to choose actions far from his 
ideal point is f (a* — x), the value of the density of w at 
a* — ï. 

Figure 4 plots the marginal policy benefit curve and 
the marginal compliance costs for three values of x as 
a function of a. The marginal benefit line is the thick 
downward-sloping line. The marginal policy benefit is 
independent of the location of x and declines as the 
Bureaucrat’s action approaches the Bureaucrat’s most 
preferred action, xg +£. The marginal cost curves de- 
pend on the location of X and are depicted in Figure 4 
by the three triangles centered at X1, X2, and x3. Outside 
these triangles the marginal cost is zero. If x were too 
high or too low, the marginal costs would be zero at the 
Bureaucrat’s ideal intended action, xg + £. This would 
lead the Bureaucrat to choose his ideal action. For 
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nonextreme statutes, the Bureaucrat’s best response 
lies at the intersection of the marginal benefit curve 
with the relevant marginal cost curve. Given xj, for 
example, the optimal action is aj. For any a < œ}, the 
marginal policy benefits of increasing the policy action 
(toward the Bureaucrat’s most preferred) exceed the 
marginal compliance costs, and for any a > at, the re- 
duction in compliance costs of moving the action away 
from the Bureaucrat’s most preferred action exceed 
the policy losses. 

We can see from Figure 4 that the effect of changes 
in X¥ on the Bureaucrat’s best response depends on 
whether the apex of the “compliance cost” triangle is to 
the left or right of the policy benefit line (i.e., to the left 
or right of x2 in Figure 4). For %; << X2,a* > x. In this 
range, increases in ¥ increase the marginal compliance 
costs of any a>X, inducing the Bureaucrat to move 
a* toward the Politician’s ideal point. At %2, however, 
this effect reverses. For X > X2, a* <X, and increases in 
x decrease the marginal compliance cost of any a* <i, 
inducing the Bureaucrat to adjust his action closer to 
his ideal point. 

We focus in this paper on the case where Q? > yê. 
This simplifies the analysis by ensuring that the 
marginal benefit and cost lines intersect only once and 

tees second-order conditions for a maximum 
hold at that solution. Substantively, this implies that 
we are focusing on cases where the variance in the 
implementation errors is relatively large compared to 
the Bureaucrat’s expected costs of noncompliance. This 
restriction is reasonable given our focus on democ- 
racies where bureaucratic capacity is limited and the 
probability of the courts rigorously enforcing statutes 
is relatively small. 

Proposition 1 specifies the best-response function for 
the Bureaucrat. 





Marginal policy benefit of increasing a 
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FIGURE 5. The Bureaucrat’s Best Response 








Proposition 1. When SÈ > y8, the following intended 
actions by the Bureaucrat are optimal given X: 


(i) Ifx—e<xg—Q, a*=xgpte. 

(ü) If xp—-Q<E—6 <xp—(8/Q), a* =[R(xg + 
e) — y8(E + Q)]/(Q? — y8). 

(iii) If xp — (8/Q) <E—€<xgp+Q, a® =[N7 (xp + 
6) + w(x — 2)]/ (Q? + 8). 

(iv) IfR-—e>xg+Q, a* =x +6. 


Proof. See the Appendix. 


An example of the best-response function of the Bu- 
reaucrat to any x is given in Figure 5. The horizontal 
axis at the top of the figure depicts x, along with the key 
points in the Bureaucrat’s best response as described by 
Proposition 1. The horizontal axis at the bottom of the 
figure depicts x — e, which determines the Bureaucrat’s 
best response in Proposition 1. The vertical axis depicts 
the Bureaucrats best response, minus the policy shock 
(i.e. it depicts a*(% — £) — 8). 

There are several points to note about the best 
response. First, as we noted above, extreme statutes 
induce the Bureaucrat to attempt to implement its 
most preferred policy a* — € = xp. This is because the 
marginal compliance cost is zero for sufficiently high or 
sufficiently low x.° This occurs when %— € < xp — Q or 
when x—s>xg+. Thus, the range of statutes for 
which the Bureaucrat tries to implement his own 
preferences increases as the Bureaucrat’s capacity in- 
creases. 


6 For large £, a* — s = xp is always comphant so the Bureaucrat will 
choose it. For low ¥, a* —s=xg 1s never compliant and marginal 
concessions have no impact on compliance, so the Bureaucrat is 
willing to choose it. 
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Second, the Politician can induce any action 
a* — 8 € [xz — (Sy/Q), A by adopting intermediate 
values of %. For x—¢ € [xg — Q, xp — (y/Q)], the Bu- 
reaucrat’s equilibrium action moves farther from his 
ideal point as discretion increases (with the specific 
value of the best response given by part ii of Propo- 
sition 1). The Politician obtains a better action by 
giving more discretion because ¥<a* when łž— e8 € 
[xp — 9, xp — (dy/Q)]. As noted in the discussion of 

4, when x < a*, the effect of expanding discre- 
tion is to increase the marginal compliance cost, which 
leads the Bureaucrat to make greater concessions from 
his ideal policy. For X—« € [xg — (8y/Q), xp + Q], the 
Bureaucrat’s equilibrium action moves closer to his 
ideal point as X — e increases (with the specific value of 
the best response given by part iii of Proposition 1). The 
Politician obtains a better action by giving less discre- 
tion because ł > a* when X—¢ € [xg — (6y/Q), xg + Q]. 
Thus, the effect of increasing discretion lowers the 
marginal compliance cost, leading the Bureaucrat 
to move his action away from the Politician’s ideal 
point. 

The Bureaucrat’s best response, a*—e, is thus a 
piecewise, linear function and V-shaped for X—e€ 
[xp — R, x +9]. At the bottom point, or lowest in- 
ducible policy outcome, a* —s=xX—e and this leads 
to a probability of noncompliance equal to 0.5. The 
Politician, then, can induce any action in the inter- 
val a* — e € [xp — (Sy/Q), xg]. If the Politician is rela- 
tively distant from the Bureaucrat, or if the Bureau- 
crat has low capacity (ie., if xp <xg— (8y/Q) +e), 
then the best action that the Politician can induce is 
a* — e = xp — (Sy/Q). Because we wish to focus in this 
paper on cases where Bureaucratic capacity is low, we 
will concentrate only on cases where the Politician 
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cannot induce his most preferred action (i.e, where 
xp > (5y/Q)). 

Third, the Bureaucrat’s V-shaped best response is 
asymmetric, with a steeper downward slope in the first 
interval than the upward slope in the second interval. 
This asymmetry can also be seen in Figure 4, where 
although £; and X3 produce the same action, X; is much 
closer to X2 than is ¥3. To understand why the Bureau- 
crat’s action is more responsive to changes in ¥ for 
low values of x — s than for high values, note that the 
effect on a* of a change in ¥ can be decomposed into 
two components. The first-order effect is that increas- 
ing x changes the marginal compliance costs, giving 
the Bureaucrat an incentive to adjust its action. The 
second-order effect is that the change in the action 
itself changes the marginal policy benefits and com- 
pliance costs, leading to an additional adjustment in 
a*.’ When these first- and second-order effects move 
in the same direction, the total effect of x is larger, lead- 
ing to policy choices that are more responsive to the 
statutes. 

If a* > x, the marginal compliance cost is decreasing 
in a and increasing in £. The first-order effect of increas- 
ing ž is therefore a decrease in a*. The second-order 
effect is that the decrease in a* also increases marginal 
compliance costs further decreasing a*. Because both 
effects work in the same direction, the intended ac- 
tion is very responsive to % when a* > <x. For a* <x, 
the marginal compliance cost is increasing in a and de- 
creasing in X. Thus, the first-order effect of increasing Ë 
is an increase in a*, whereas the second-order effect is 
increased compliance costs, which depresses a*. Since 
the first- and second-order effects move in opposite 
directions, the Bureaucrat is less responsive to changes 
in the statute. 

As Proposition 1 and Figure 5 make clear, as the 
Bureaucrat’s ideal point moves away from the Politi- 
cian’s, so too does the best action that the Politician 
can obtain. Consistent with many theories of delega- 
tion, then, when we ignore the impact of bureaucratic 
capacity, the Politician in our model always prefers a 
Bureaucrat with an ideal point closer to the Politician’s 
to one with an ideal point farther away. But if we take 
bureaucratic capacity into account, the Politician may 
not prefer a Bureaucrat who has preferences similar 
to the Politician’s. Note that the best action the Politi- 
cian can induce, a* — £ = xp — (dy/Q), moves closer to 
the Politician’s ideal point as bureaucratic capacity im- 
proves (Q gets smaller). Thus, when our assumption 
that xp > (dy/Q) is satisfied, in comparing any two 
Bureaucrats, there is always a level of bureaucratic 
capacity that will make the Politician prefer the “less 
friendly” one to the “more friendly” one (in terms of 
preferences) because the high-capacity Bureaucrat is 
easier to control. 


7 More technically mclined readers will recognize this intuition as 
that underlying the implicit function theorem. 

8 This asymmetry ın the response function does not depend on our 
specification of the distribution of œ. It is true for any distmbution 
function that is symmetric and unmodal at zero. 
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THE POLITICIAN’S OPTIMAL DELEGATION 
STRATEGY 


We now consider the Politician’s optimal statute. First, 
assume that the Politician has complete policy exper- 
tise, corresponding to the case where E=0. From 
Proposition 1, the best action the Politician can 
induce is a*—e=xg—(dy/Q), which occurs when 
īï—e=xp-— (8y/Q). The more difficult case occurs 
when the Politician does not know s (which occurs 
when E > 0). The Politician’s goal is to set x — e so that 
the Agent’s expected response is as close as possible 
to xp —(dy/Q), but policy uncertainty precludes the 
Politician from doing this effectively. 

Return to Figure 5 for an example of how the Politi- 
cian solves the problem. The Politician wishes to po- 
sition X so as to maximize his utility over the range 
of possible s. If the Politician chooses x,, for exam- 
ple, then x, — € € [£1 — E, %,+E], as depicted by the 
two dashed vertical lines. The expected utility for the 
Politician from x, is represented by the area that is 
bounded by the dashed lines on each side, the horizon- 
tal axis on the bottom, and the best response function 
on the top. The Politician wants to choose ¥ to min- 
imize this area (and, of course, would maximize her 
utility if the best response was a flat horizontal line at 
a*(%—e)=0). 

We can see that x, cannot be an equilibrium statute 
for the Politician. By moving from %; to X2, for ex- 
ample, the Politician eliminates the outcomes in the 
shaded area at the left in the diagram and adds the 
outcomes in the shaded area at the right in the diagram. 
Because the shaded area at the left is larger than the 
shaded area at the right (and the Politician wants to 
minimize the area under the best-response function), 
the Politician does better adopting ¥2 than adopting 
X1. 

In equilibrium, the Politician will choose £ such that if 
the Politician moves ¥ in either direction, the expected 
utility of the policy losses created on the right side of 
the V in the Bureaucrat’s best response is exactly equal 
to that of the expected losses on the left side of this V. 
This occurs only when a*(x, E) — E = a* (ž, —E) + E, as 
with £2 in Figure 5. 

Note that if the policy uncertainty is so large that 
E> Q, there is no unique optimal statute for the Politi- 
cian. In such cases of extreme uncertainty, the Politi- 
cian’s decision problem in designing her statute is triv- 
ial because all she can do is center X so as to en- 
sure that the interval [xg —Q, xg+Q] is spanned by 
[zx —E, +E]. For this reason, in what follows, we fo- 
cus on the case where Q > E, an assumption that is also 
consistent with our focus on polities with low-capacity 
bureaucrats. 

This intuition about the optimal statute is formal- 
ized in Proposition 2, which also states the Politician’s 
expected utility from this statute. 


Proposition 2. If Q> min{E, (8y/xg), /}, then 
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EUp(x"*) = —— (e — *) + (xs — es 
+ Am( 8) +o 
3 92 is 


Proof. See the Appendix. 


The results described in Proposition 2 can be used to 
explore two substantive issues. On one hand, since the 
Proposition describes x*, it can be used to examine del- 
egation incentives when bureaucratic capacity is low. 
We explore this issue in the remainder of this section. 
On the other hand, since it describes the Politician’s 
expected utility from optimal behavior, the Proposition 
can be used to examine the payoff to the Politician of 
particular types of political reform, such as improved 
ex post enforcement (which occurs when Sy increases). 
We explore this issue in the next section. 

The contemporary literature on bureaucratic policy- 
making has developed a number of standard predic- 
tions about delegation to highly competent bureaucra- 
cies, as follows. 


The uncertainty principle. Politicians should dele- 
gate more policymaking autonomy to bureaucrats 
when politicians are more uncertain about which 
policy will yield the best outcome. 

The ally principle. Politicians should delegate more 
policymaking autonomy to bureaucrats when 
politicians and bureaucrats share similar policy 
preferences. 

The monitoring principle: Politicians should dele- 
gate more policymaking autonomy to bureau- 
crats when politicians have more opportunities for 
ex post monitoring and sanctions. 


Our model suggests that neither the ally nor the 
monitoring principle should hold in polities with low- 
capacity bureaucrats. Note that an increase in X* 
implies greater policymaking discretion for the Bu- 
reaucrat in our model. Direct inspection of ¥* in Propo- 
sition 2 shows that the uncertainty principle holds in 
our model (i.e., x* is increasing in E). However, neither 
the ally principle nor the monitoring principle holds 
since x* is increasing xg and decreasing in yê. 

The reason the allied principle fails is straightfor- 
ward. As we have demonstrated, low-capacity bureau- 
crats are make fewer concessions toward the ideal point 
of the Politician because the impact of their policy 
choice on the probability of compliance is low. Thus, 
low capacity is especially harmful to the Politician when 
the Bureaucrat has divergent preferences. So how does 
the Politician mitigate the harm and make the Bureau- 
crat more responsive? One way is to increase x. This 
leads to a higher marginal impact of policy choice on 
compliance for low values of ¢ and a lower impact for 
high values. However, as we know from Figure 5, this 
leads to a sharper increase in concessions for lower 
values of s than the decrease in concessions for higher 
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values of s. Because politicians with divergent prefer- 
ences value this net increase in concessions the most, 
they are the most willing to grant additional discretion 
to bureaucrats. 

The mechanism that leads to a violation of the mon- 
itoring principle is similar. As ex post enforcement im- 
proves, the Bureaucrat will make more concessions 
to the Politician’s preferences due to the increased 
cost of noncompliance. In particular, these increased 
concessions are larger for low values of s than for 
high values. By reducing discretion, the Politician can 
transfer some of the gains for low-s to high-e states of 
the world. Thus, effective ex post enforcement creates 
opportunities for the Politician to use policy details 
in statutes to induce greater responsiveness by a low- 
capacity Bureaucrat. This turns the standard monitor- 
ing principle from studies of high-capacity bureaucrats 
on its head. With high-capacity bureaucracies, policy 
details and ex post enforcement are substitutes for each 
other, but with low-capacity bureaucracies, they are 
complements. 

Given these findings, our empirical predictions about 
delegation in developing systems are quite different 
from those of the advanced democracies for whom the 
bulk of the empirical knowledge about delegation has 
originated. Our model clearly suggests that findings 
consistent with the ally and monitoring principals are 
contingent on a high bureaucratic capacity and that 
these principles should be applied to developing sys- 
tems with a great degree of caution. 


POLITICAL REFORM IN POLITIES WITH 
LOW-CAPACITY BUREAUCRACIES 


Proposition 2 states that the Politician’s expected utility 
from adopting X* is 


We can use the equation to examine how bureaucratic 
capacity influences the Politician’s willingness to pay 
for different types of institutional reforms. In the Ap- 
pendix, we define these propositions formally and pro- 
vide proofs. In what follows, we state the Propositions 
informally and discuss their intuitions. 

Our first proposition concerns the interaction of pol- 
icy expertise and bureaucratic capacity. 


Proposition 3. When bureaucratic capacity is low, 


© the Politician’s benefit from increasing bureaucratic 
capacity increases as policy expertise increases, and 
© the Politician’s benefit from increasing policy ex- 
Pertise decreases as bureaucratic capacity declines. 


Proof. See the Appendix for a formal statement of 
the proposition and its proof 
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When politicians are relatively expert (E low), they 
can optimally choose statutes that capture large con- 
cessions from the bureaucrat for most values of e. For 
example, in the extreme case of E=0, they can induce 
the best possible action of a* — 8 = xg — (8/0). Be- 
cause these concessions get larger as capacity improves, 
highly informed politicians have strong incentives to 
increase capacity. 

On the other hand, consider the incentives to ac- 
quire information when capacity is low. As we have 
shown, low-capacity bureaucrats tend to bias policy to- 
ward their ideal point and the expected policy outcome 
a* — e tends to be less responsive to e. This change in 
responsive of the expected policy outcome to the policy 
shock means that politicians are unable to generate 
substantially more concessions by learning £. Thus, in 
systems with low-capacity bureaucrats, politicians are 
least likely to want to develop policy expertise. 

Next consider reforms related to enforcement. In 
our model, this enforcement, which is typically done 
by courts but may also be done by politicians them- 
selves, is represented by dy, the probability that bu- 
reaucratic actions that do not comply with a statute 
are discovered, multiplied by the punishment cost for 
noncompliance. A country with a judiciary that rigor- 
ously enforces statutes will have a large value of dy. 
We want to understand how reforms of bureaucracy 
are related to the quality of the judiciary and how in- 
centives to create strong judiciaries are influenced by 
bureaucratic capacity. Our main result is summarized 
in Proposition 4. 


Proposition 4. When bureaucratic capacity is low, 


© the Politician’s benefit from increasing bureaucratic 
capacity increases as enforcement becomes more 
effective, and 

e the Politician’s benefit from increasing the effec- 
tiveness of enforcement decreases as bureaucratic 
capacity declines. 


Proof. See the Appendix for a formal statement of 
the proposition and its proof 


We have noted that the actions of a high-capacity Bu- 
reaucrat are more sensitive to the location of statutes. 
The Politician can therefore best take advantage of a 
high-capacity Bureaucrat when strong ex post enforce- 
ment intensifies the relationship between actions and 
punishment. But the Politician has the least to gain 
from improving ex post enforcement when bureau- 
cratic capacity is low. One way to see this is to recall 
that the best policy the Politician can ever induce the 
Bureaucrat to take is given by xg — (8y/Q). Thus, the 
Politician can push policy toward its ideal point by im- 
proving enforcement (increasing ôy) or by increasing 
bureaucratic capacity (decreasing Q). But as the equa- 
tion makes clear, the benefit of improving enforcement 
will decline as Q increases, and the benefit of improving 
bureaucratic capacity will decline as dy decreases. 

Finally, consider how bureaucratic capacity influ- 
ences incentives to “politicize” the bureaucracy. Our 
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model speaks to this issue in the following respect. The 
preferences of bureaucrats are largely determined by 
the rules and structures that politicians adopt. If the 
rules and procedures give politicians great latitude to 
appoint, promote, and dismiss bureaucrats, then we 
might expect bureaucrats to have preferences that are 
relatively close to the politicians’. By contrast, if bu- 
reaucratic hiring and promotion are largely governed 
by neutral rules and procedures of administrative law, 
then we might expect bureaucrats to have preferences 
that are more divergent from the politicians (at least 
in comparison to bureaucrats who are chosen by a 
more politicized process). We can therefore ask how 
bureaucratic capacity influences the value of having a 
“politicized” bureaucracy—that is, of having a Bureau- 
crat with preferences aligned with the Politician. 
Our main result is summarized in Proposition 5. 


Proposition 5. When bureaucratic capacity is low, 


e the Politician’s benefit from decreasing policy diver- 
gence with the Bureaucrat increases as bureaucratic 
capacity declines, and 

e the Politician’s benefit from increasing bureaucratic 
capacity declines as policy divergence with the Bu- 
reaucrat decreases. 


Proof. See the Appendix for a formal statement of 
the proposition and its proof 


Proposition 5 describes another trap that is cre- 
ated by low bureaucratic capacity. When bureaucratic 
capacity is low, the benefit to the Politician of an 
“allied” bureaucrat (with policy preferences similar 
to the Politician’s) is greatest. This is because low- 
capacity bureaucrats bias policy choices so heavily to- 
ward their own ideal points. Thus, one way to eliminate 
those losses of political control is to try to politicize 
the bureaucracy by moving it to the Politician’s ideal 
point. 

But consider how this logic affects the decision of a 
politician to reform the capacity of an allied bureau- 
cracy. Since a low-capacity ally produces policies close 
to the Politician’s ideal point, there is significantly less 
reason to engage in capacity enhancing reforms. The 
Politician’s benefit from reforming the bureaucracy 
(decreasing N) declines when x, is close to xp. 


CONCLUSION 


In any polity governed by the rule of law, bureaucratic 
behavior during policy implementation is conditioned 
by the bureaucrats’ fear that they could be punished if 
caught taking actions that the law forbids (or not taking 
actions that the law requires). This fear is central to a 
politician’s ability to use laws to successfully delegate 
policymaking authority to bureaucrats. Without it, bu- 
reaucrats could ignore the policy goals of politicians, 
thereby severing any democratic link between citizens 
and the policymaking process. 

Factors that influence bureaucrats’ concern about 
the consequences of noncompliance with legislation 
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therefore have a significant influence on delegation and 
policymaking. Our analysis identifies one such factor— 
-low bureaucratic capacity—that has received little 
attention in formal studies of delegation. Our model 
indicates that low bureaucratic capacity diminishes the 
ability of politicians to influence the actions of bureau- 
crats. As bureaucratic capacity declines, bureaucrats 
recognize that their ability to take actions that comply 
with legislation also declines, diminishing their incen- 
tive to try to do so. Politicians, then, are less able to 
use legislation to influence bureaucratic actions when 
bureaucratic capacity is low. 

We argue that this diminished ability of politicians to 
control bureaucrats with low capacity has a significant 
impact on how we understand delegation and polit- 
ical reform in developing democracies. Our analysis 
suggests, for example, that when bureaucratic capacity 
is low, well-established principles in the existing dele- 
gation literature no longer operate. In particular, the 
“monitoring principle” is turned on its head, as is the 
“ally principle.” 

With respect to political reform, we argue that poli- 
ties with low bureaucratic capacity are caught in a trap. 
Although politicians always benefit from reforming 
low-capacity bureaucracies, they will be most willing 
to pay for such reforms when politicians themselves 
have technical expertise, and when ex post enforce- 
ment is effective. But politicians will be least willing to 
improve their technical expertise or ex post enforce- 
ment when bureaucratic capacity is low. In addition, 
when bureaucratic capacity is low, politicians have the 
greatest incentives to politicize the bureaucracy, and 
this politicization diminishes incentives to improve bu- 
reaucratic capacity. 

The existence of this trap suggests that “compre- 
hensiveness” is crucial to successful political reform. 
In studies of political development, scholars often un- 
derline the importance of reforming courts to enhance 
enforcement, or reforming bureaucracies to enhance 
capacity, or reforming legislatures or executives to en- 
hance technical expertise. Although each of these re- 
forms is valuable, what prior research neglects is the re- 
lationship of such reforms to each other. Politicians will 
have little incentive to enhance bureaucratic capacity 
if they lack the technical expertise necessary to influ- 
ence bureaucrats to implement policies that politicians 
desire or if ex post enforcement is too ineffective to 
condition bureaucratic behavior. Politicians will have 
little incentive to improve their own technical expertise 
if they are delegating to low-capacity bureaucrats who 
ignore statutory details or if lax enforcement makes the 
content of statutes largely irrelevant. And politicians 
will have little incentive to improve enforcement by 
the judiciary if low bureaucratic capacity diminishes 
the incentives of bureaucrats to comply with the law 
in the first place. The benefits of reform in any one 
area, such as the bureaucracy, then, depend crucially 
on reforms in other areas, such as the courts. 

This argument suggests an avenue for empirical 
study of political reform in polities with low bureau- 
cratic capacity. Our model suggests that reforms will 
be most successful when they are comprehensive, with 
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simultaneous efforts focusing on bureaucracies, courts, 
and politicians themselves. Although such comprehen- 
siveness is a tall order, our argument suggests the con- 
trary as well—reforms that are not comprehensive are 
likely to fail. Efforts to improve bureaucratic capacity 
in polities with ineffective enforcement by judiciaries, 
for example, are likely to flounder and unravel. This 
need for comprehensiveness, we suspect, may help ex- 
plain the substantial difficulties polities face as they 
attempt to develop politically. 

Because there has been little effort to model pol- 
icymaking in polities with low bureaucratic capacity, 
there is clearly a need to develop additional theoret- 
ical models that can further explore the issues raised 
here. Bureaucratic capacity can be conceptualized in 
a number of ways, and our analysis focuses narrowly 
on only one of the possibilities. A particularly interest- 
ing issue to introduce would be policy uncertainty by 
bureaucrats themselves. In addition, our policymaking 
model is “institution-free,” and it would be useful to 
consider how the constitutional setting, such as the sep- 
aration of legislative and executive power, influences 
delegation to low-capacity bureaucrats. And we have 
obviously offered a rather reduced-form specification 
of the factors within bureaucracies that create low 
capacity. Modeling these factors within bureaucracies 
explicitly should have a central place in future studies. 
The further development of such models should signif- 
icantly improve our understanding of challenges faced 
in governing those polities that are in the process of 
political development. 


APPENDIX 


Proposition 1. The following intended actions by the Bu- 
reaucrat are optimal given &: 


(v) [fX-—e<xp-Q a =xg +8. 
(vi) If xg — Q<- e < xp — (~8/Q), a” = [R(x +£) — 
WE + Q)]/(D? — y). 
(vii) If xp — (8/Q) <¥—e < xp +9, at = [R(x +8) + 
WE — 2)]/(% + yb). 
(vii) Ifk-—s>xg+Q, a*=xg+6. 


Proof. The Bureaucrat’s expected utility is 
EUs(a |) =—F(a— 6- x5} — 503 — p8F(a—2) 


where F(œw) is the cumulative distribution function for w. 
Thus, the Bureaucrats optimal implementation strategy a” 


solves the first-order condition 
QEUp(a|%)__,, y wla*—3| 
= re —&—Xg)— ot gm 72: 


Due to the absolute values operator, there are two relevant 
cases depending on the relationship between a and 7. 


Case I.a<% In this case the first-order condition be- 
comes a = xp + £ — (82/2?) + [y8(k — a)/ N°], which implies 
that a* = (X (xs +8) + vêl — 2))/(Q? + 78). We can verify 
that a* < £ when xg + £ — (78/Q) < X. Because it is dominated 
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strategy for the Bureaucrat to choose an action higher than 
her ideal action, we must also verify that a* < xg + e. This is 
guaranteed by¥—-Q>xg+eork>xrgpt+E+Q. 


Case 2. a>& In this case, the first-order condi- 
tion becomes a= ¢ + xp — (R/R) + [)8(a — ¥)/27], so that 
a* = (07 (xg +6) — [t + Q])/(2? — 38). Similarly to above, 
we must establish that a* > ¥. This inequality will hold when 
xp +8-— (38/9) > ¥. It can be verified that a* < xp +£ if and 
only iff>xg+e—-—Q. 


Proposition 2. If Q> min{E, (by/xs), VYŠ) ¥* =xB—- 
(18/2) + (8/M)E and 


EUp(*) = -i(e-£) + (ao *) 5% 


Proof. Note that the expected policy outcome given a 
choice of £ is 


E{—(a"(Z, £) — 6 —@)"] = E] —(a* (£, £) — 8)*] — 03. 


From the results of Proposition 1, we can compute 
a" (Z, 6) —s: 


a" (3, 8)-é= axs + BE-Q—«) 


02 + 8 
ô 
if 3-xs+ 1 >e>k—2p—0 
Xr- W(E+Q— 
Poea ee ETS) 


2? — y 
if B4+Q—xp>6>2-x9+ E 


a" (X,6)-—e=xg if I—xgp—Q>e 


or if eé>¥—xg+Q 


Note that because we assume that Q > (8y/xp), it must be the 
case that xp > a*(2, £) —s > 0 so that 


(a"(z, 8) ~ 8} € (= 7 2) a] 


Now we claim that since Q > E, * € [xg +E — R, x —E + 
Q]. Suppose that ¥<xg+E-Q or E>x,—E+Q. We 
can show that the distribution of utilities generated by 
these choices is stochastically dominated by a distribu- 
tion generated by some other choice of %. First, con- 
sider ¥>x,;—E+. This generates a utility of — x} — o? 
Pied eae and —(a"(%, 6) — e} —o2 > -xi — of 
for s ¢€[t-Q-—x,,E]. Now we compare this distribu- 
tion of utilities to that of Y=xgs—E+Q. Note that 
a*(¥’, 8) —e=a"(¥,e+2—-¥)—s—k¥+¥. Thus, because s 
is distributed uniformly, the distribution of outcomes 
for e e FE,E—2+4+2] under ¥ is the same as for se 
[% — 9 — xz, E] under £. However, for s € [E—¥+¥, E] un- 
der Y, —(a* (¥', €) — 6)? — 02 > —x} — o2, which 1s strictly bet- 
ter than the —x} — o% for the same width interval under 
£. Thus, the distribution of utilities under ¥stochastically 
dominates the distribution under ¥. We can show that any 
ł <x +E -— Q is similarly dominated. 
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Now consider the Politician’s optimization problem, which 
is to choose ¥ to maximize 


E 

1 2 
-g | 00D- de-o 

-E 
The first-order condition is 

1 F da 

-JE f (a(à, 8) =) ds =0 
-E 


Note that if ¥+(8y/2)—x,>E, then [@a*(%, e)/at] = [y8/ 
(Q?-+8)]>0 for all s e [-E, E], which guarantees that 
the first-order condition is always negative. Thus, we 
know that the optimal policy satisfies E — (8y/Q)+xg>£. 
By a similar argument, we can show that optimality 
also requires —E + xp — (78/Q) <¥. Combining these con- 
ditions with our first claim, the optimal £ must sat- 
isfy both —E + xp — (y8/Q) <¥ < —E + xs + Q and E+ xp — 
Q <7 <E+ xg — (8/Q). Therefore, the first-order condition 
can be written 


nox 
1 yö 
-E | Raps 1 (a(ž, £) — £) ds 
E 
-a f (a(z, 6) — e) de | =0 
Hou 


Integrating both terms we can rewrite this condition (1/4)E 
[(a(2, E)—E)* — (a(2, -E) + E)*] =0. Because both squared 
terms are positive, the only solution is a(ł, E)-— E= 
a(%, -E) + E.’ Thus, we can solve for £ from 


Yarr tW- I+E) Vx- wW(F+2—E) 
22+ 8 E -y 


which leads to ¥* = xp — (⁄8/2) + (y8/Q*)E. It is easy to 
check that x* satisfies both —E + xp — (y8/9) < ł* < -E + 
Xp+Q and E +xp— Q <3* < E + xp — (78/Q) because 2? > 
W. Finally, given £*, we can compute the Politician’s expected 
utility at the optimum as 


2 
cue) = -4 [ (+(e 


1/8 \" 
+3E (5) +a 


Proofs of Propositions 3—5. Each of Propositions 3—5 
relates to the interaction of Q with one other substantive 
variable in our model. Proving the propositions therefore 
involves (a) examining the first-order partial derivatives of 
EUp with respect to Q and with respect to the substantive 
variable of interest and (b) demonstrating that the cross- 
partial derivative of these two variables has the correct sign 
(given the signs of the first order partials). In Proposition 3, 
for example, we claim that (i) the benefits to the Politician 


9 The second-order condition is [~N (a(t, E) — E)]/[E(aQ? — 8) 
(Q? + y8)], which is negative because a(t, E) — E > 0 and 27 > 4, 
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of increasing bureaucratic capacity increase as policy exper- 
tise increases, and (ii) the marginal benefits to the Politician 
of i increasing policy expertise decrease as bureaucratic ca- 

pacity declines. As we show below, (8E Up /an2) <0. Thus, 
to show that the benefit to the Politician of i increasing bu- 
reaucratic ity increases as policy expertise increases, 
we need to show that (PEUp /JER) > 0. By a similar logic, 
given that (@EUp/dE) <0, we again need to establish that 
(P EUp/3E3) > 0. For Propositions 3—5, we simply estab- 
lish a sufficient condition on the size of Q to ensure that the 
various partial derivatives have the correct sign. In so doing, 
each of the three propositions takes advantage of Lemma 1. 


Lemma 1. ForQ> min{E, (8y/xg), /y3}, (@EUp/IQ) <0. 
Proof. From proposition 2, 


BD 9-H 


Note that the first term is clearly negative because Q > E. The 
sum of the second and third terms term is negative as long 
as E(w} (4E — 32) < NŚ. Because Q > E, the left-hand side 
of this inequality can be no larger than (y8). Because 
Q? >ê, the inequality holds and the partial derivative is 
negative. E 


Proposition 3 





Informal Statement 
When bureaucratic capacity is low, 


e the benefit to the Politician of increasing bureaucratic 
capacity increases as policy expertise increases, and 

© the benefit to the Politician of increasing policy expertise 
decreases as bureaucratic capacity declines. 


Formal Statement 


If R> min{E, (38y/2xg), /8y}, 
(#EUp/8E8Q) > 0. 


Proof. As noted above, given Lemma 1, both parts of 
the proposition are true if, for sufficiently low bureau- 
cratic capacity, es a and (#EUp/aB8n) > 0. Let 
mf = min{E, (36y/2xg), /8y}. If Q > 2, then the Politician’s 


(@EUp/aE) <0 and 








utilities are given by Proposition 2. Thus, 
EU; 1 ¥\ 2 2p 
-a (*— a) ta ae | <2 
and P 
EUr _ y8 3w \ 4.4 
JESN = 3 [(2- za) à Pa] = 
E 
Proposition 4 
Informal Statement 


When bureaucratic capacity is low, 


© the benefit of increasing bureaucratic capacity increases 
as enforcement becomes more effective, and 

e the benefit of increasing effectiveness of enforcement 
decreases as bureaucratic capacity declines. 
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Formal Statement 


For Q> minfĝ, (8y/xs), VEY}, 3EUp/Xőy)>0 and 
#EUp/HSy)8Q <0, where © is defined implicitly by 
r(Ĝ) m 82 — E—(p8/xg)(2 + (4E?/3027) — (3E/22)) =0. 


Proof. Given Lemma 1, both parts of the proposition 
are true if, for sufficiently low bureaucratic capacity, 
dE Up/& Sy) <0 and #EUp/Hsy)8N <0. Note that T” m1 — 
(y8/xe)((3E/2) — (8B*/30P)) > 0 because (x~/y8) > 1. 
Because T(E) = —(}8/3x3) < 0 and T” > 0, & > E. Therefore, 
if Q> min{ĝ, (8y/xs), VEY}, then Q> min{E, (3y/xB), 8y} 
so the politician’s expected utility is given by Proposition 2. 
Therefore, the i benefit of y8 is then dEUp/ 
Xy) = [(22 — E)/207)(ap — (78/2))(WE/20°) — (1/ 
3)(E*y8/O%)]. For Q>E and R> (sy/xg), this marginal 
utility must be positive. » These conditions are also implied by 
Q>2. 

The cross of Q and y8 is #EUP/HSy)@Q= 
PIE — 2) (x5 — (8/Q)) + 8 + [(4E — 6Q)/307] KE 
This cross partial is negative if Q-—E- (y8/xp)(2-+ (4E2/ 
39?) — (3E/Q)) m T(N) > 0. By definition of 2 and the fact 
that I” > 0, this implies that the cross partial will be negative 
as long as 2 > %2. | 


Proposition 5 


Informal Statement 
When bureaucratic capacity is low, 


© the Politician’s benefit from decreasing policy diver- 
gence with the Bureaucrat increases as bureaucratic ca- 
pacity declines, and 

e the Politician’s benefit from increasing bureaucratic ca- 
pacity declines as policy divergence with the Bureaucrat 
decreases. 


Formal Statement 
If R> min{E, (8y/xg), /3y}, (GEUp/axg) <0 and (#EUp/ 


axgaR) <0. 
Proof. Given Lemma 1, to prove both ee 

sition we need to show that (@EUp/axg and (ZEUp/ 
axg8Q) <0. If N > min{E, (Sy/xs), /3y}, He ie: conditions 
for Proposition 2 hold, implying that (3EUp/3xp)= 
—(xs — (⁄8/2)) — E(8/207) <0 and (PEUp/ðxpðN) = 
—(78/27) + E(8/2) <0 because E < R. a 


REFERENCES 


Bawn, Kathleen. 1995. “Political Control versus Expertise: Congres- 
sional Choice About Administration Procedures.” American Po- 
tical Science Review 89 (1): 62—73. 

Bendor, Jonathan, and Adam Metrowitz. 2004. “Spatial Models of 
Delegation.” Amertcan Political Science Review. 98 (2): 293—310. 

Bealey, Timothy, and John McLaren. 1993. “Taxes and Bribery: The 
Role of Wage Incentives.” Economic Journal 103: 119-41. 

Carpenter, Daniel 2001. The Forging of Bureaucratic Capacity: Rep- 
utations, Networks, and Policy Innovation in Executive Agencies, 
1862—1928. Princeton, NJ: Princeton University Press. 

Chang, Kelly, David E. Lewis, and Nolan McCarty. 2000. “The Tenure 

presented at the 2000 American 


Epstein, David, and Sharyn O'Halloran. 1999. Delegating Powers 
New York. Cambridge University Press. 


493 


Bureaucratic Capacity, Delegation, and Political Reform 


August 2004 





Evans, Peter, and James Rauch. 1999. “Bureaucracy and Growth: A 
Cross-Natonal Analysis of the Effects of ‘Weberian’ State Struc- 
tures on Economic Growth.” American Sociological Review 64: 
748-65. 

Geddes, Barbara. 1990. “Building ‘State’ Autonomy in Braz, 1930— 
64.” Comparative Politics 22 (2): 217-35. 

Geddes, Barbara. 1994. Politician’s Dilemma: Building State Ca- 
pacity in Latin America. Berkeley: University of California 
Press. 


494 


Huber, John D., and Charles Shipan. 2002. Deliberate Discretion? 
Institutional Foundations of Bureaucratic Autonomy New York: 
Cambndge University Press. 

McCourt, Willy, and Nazar Sola. 1999. “Using Training to Pro- 
mote Civil Service Reform: A Tanzanian Local Government Case 
Study.” Public Administration and Development 19: 63—75. 

McCubbins, Mathew D., Roger G. Noll, and Barry R. Weingast. 1987. 
“Administrative Procedures as Instruments of Political ControL” 
Journal of Law, Economics, and Organization 3 (2): 243-77. 


American Political Science Review 


Vol. 98, No.3 August 2004 


A Model of Path-Dependence in Decisions over Multiple Propositions 
CHRISTIAN LIST London School of Economics and Political Science 


model sequential decisions over multiple interconnected propositions and investigate path- 

dependence in such decisions. The propositions and their interconnections are represented in propo- 

sitional logic. A sequential decision process is path-dependent if its outcome depends on the order in 
which the propositions are considered. Assuming that earlier decisions constrain later ones, I prove three 
main results: First, certain rationality violations by the decision-making agent—individual or group—are 
necessary and sufficient for path-dependence. Second, under some conditions, path-dependence is un- 
avoidable in decisions made by groups. Third, path-dependence makes decisions vulnerable to strategic 
agenda setting and strategic voting. I also discuss escape routes from path-dependence. My results are 
relevant to discussions on collective consistency and reason-based decision-making, focusing not only 
on outcomes, but also on underlying reasons, beliefs, and constraints. 


ocial choice theory has improved our understand- 

ing of many collective decision problems: elec- 

tions, referenda, decisions in legislatures, com- 
Mittees, judiciaries, Classical models of social choice 
represent collective decision-making as the aggrega- 
tion of individual preferences or votes into collective 
outcomes (Arrow 1953; Sen 1970). Although they il- 
luminate many aspects of collective decision-making, 
these models focus mainly on the ranking of, or choice 
among, alternative outcomes: candidates, policies, ac- 
tions. They do not capture the reasons given in support 
of choices, the beliefs choices are based on, and the 
constraints choices impose on other choices. The im- 
portance of all these is emphasized, for example, in 
theories of democratic deliberation, which often crit- 
icize classical social-choice-theoretic models for their 
narrow focus on preference aggregation.! 

In this paper, I develop a model that seeks to cap- 
ture such reasons, beliefs, and constraints. Combining 
social choice theory and propositional logic, my model 
addresses decisions on multiple propositions based on 
one or several individuals’ views, beliefs, or judgments 
on these propositions. The propositions are intercon- 
nected: The views on some propositions logically con- 
strain the views on others. The decisions are sequential: 
Different propositions are considered not simultane- 
ously, but one after another—as in many governments, 
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and aggregation, see Dryzek and List 2003, Knight and Johnson 
1994, and D. Miller 1992. 


political parties, legislatures, committees, judiciaries— 
and earlier decisions constrain later ones. I investi- 
gate whether such decisions are path-dependent, i.e., 
whether their outcome depends on the order in which 
propositions are considered. 

For example, if a political body decides first to en- 
dorse a tax cut, it may reject subsequent proposals to 
increase expenditure, although it might have accepted 
these proposals if it had considered them before decid- 
ing the tax issue. In an individual decision, there might 
be two arguments that both begin with propositions the 
individual finds plausible and both use valid inferences, 
yet they support opposite conclusions. Political rhetoric 
or “heresthetics”—the art of political manipulation— 
might exploit this phenomenon (Riker 1986). 

Interest in decisions over multiple interconnected 
propositions was sparked by the “doctrinal para- 
dox” in law and economics (e.g., Chapman 1998 and 
Kornhauser and Sager 1986). A three-member court 
has to decide three propositions. P: The defendant did 
some action X. Q: The defendant had a contractual 
obligation not to do action X. R: The defendant is li- 
able. Legal doctrine requires that R (the pae 
be accepted if and only if both P and Q (the premises 
are accepted. The first judge accepts both P and Q and 
therefore accepts R. The second accepts P but not Q 
and therefore rejects R. The third accepts Q but not P 
and therefore also rejects R. This leads to the apparent 
paradox that a majority (of two-thirds) accepts P, a 
majority (of two-thirds) accepts Q, and yet a majority 
(of two-thirds) rejects R, although P and Q imply R 
and all judges’ views are individually consistent. There 
is more than one way of resolving the issue. The court 
might privilege the majority votes on the premises and 
find liability, overruling the majority vote on the con- 
clusion. Or it might privilege the majority vote on the 
conclusion and find no liability, overruling the majority 
votes on (one or both of) the premises. The order in 
which votes are taken and privileged may matter. 

The example highlights not only the final outcome 
(R), but also the supporting reasons (P, Q) and the con- 
straint linking reasons and outcomes (R if and only if P 
and Q). In general, such reasons and constraints need 
not be exogenously fixed, say, by legal doctrine; they 
may themselves be contested. A model of decisions 
over multiple propositions seeks to capture cases where 
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decisions are made on reasons and constraints in addi- 
tion to outcomes—more broadly, cases where a system 
of beliefs or judgments is built up, constrained by con- 
sistency and mutual support, a requirement sometimes 
called “integrity” (Dworkin 1986). 

Are such cases politically relevant? Are there any 
political bodies that do not just make separate deci- 
sions—at best mutually detached, at worst mutually 
inconsistent—but that build up a coherent system of be- 
liefs or judgments? Some scholars think that, at most, 
judiciaries have this property. Others, particularly de- 
liberative democrats, see the construction of such a 
coherent system as central to democratic politics (Pettit 
1997, 2001a). Here the “doctrinal paradox” has been 
interpreted as a paradox of “collective coherence” or 
“discursive dilemma,” illustrating a trade-off between 
majoritarian responsiveness and consistency in collec- 
tive decisions (Brennan 2001; Pettit 2001a). Many so- 
cial choice theorists have argued that a consistent col- 
lective set of views—more strongly, “collective will”— 
is democratically infeasible (e.g., Riker 1982). Others 
have defended its feasibility or argued that the issue is 
still open (e.g., Coleman and Ferejohn 1986). But the 
debate has been centered around results on preference 
aggregation, such as Arrow’s theorem (1951) and its 
descendants. For a richer analysis of collective consis- 
tency, it is desirable to dévelop a model of how groups 
can form collective views on multiple propositions. I 
aim to contribute to that development. 

The first model of decisions over multiple propo- 
sitions inspired by the “doctrinal paradox” is pre- 
sented in List and Pettit 2002 and has been generalized 
by Pauly and van Hees 2003. Earlier precursors are 
Guilbaud 1966 and Murakami 1968.7 But so far only 
simultaneous decisions on multiple interconnected 
propositions have been modeled. 

My model represents sequential decisions—indi- 
vidual and collective—and allows the study of path- 
dependence. I show that the order in which propositions 
are considered may affect the decision outcome even 
when the initial views of the decision-maker(s) on each 
proposition are held fixed. In individual decisions, cer- 
tain violations of perfect rationality are necessary and 
sufficient for path-dependence. A boundedly rational 
individual may be susceptible to path-dependence. In 
collective decisions, path-dependence may occur even 
when all individuals are perfectly rational. This fol- 
lows from an impossibility result analogous to Arrow’s 


2 Nehring and Puppe (2002), discussed below, have also modeled 
decisions on multiple propositions, but with less general intercon- 
nections. A recent contnbution is Dietrich 2003. 

3 Both reformulated preference aggregation m terms of the aggre- 
gation of ranking propositions, where different ranking propositions 
(eg, “A > B” “B > C” “A> C”) are logically interconnected. Both 
observed that majority voting, or some other propositionwise aggre- 
gation, over such ranking propositions does not generally preserve 
consistent interconnechons between these propositions. Guilbaud 
abstracted this effect (which he called the “Condorcet effect”) from 
ranking propositions to general binary propositions, so he arguably 
anticipated the more recent results on the “doctrinal paradox.” Nei- 
ther Gullbaud nor Murakami developed a general model of decisions 
on multiple interconnected propositions based on propositional logic 
(e.g, allowing decisions on atomic and compound propositions) 
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(1951) theorem. Path-dependence may be the price a 
group has to pay for achieving collective consistency. 

I also investigate two types of strategic manipula- 
tion opened up by path-dependence. Unsurprisingly, 
path-dependence may enable an agenda-setter to ma- 
nipulate the outcome by controlling the decision-path 
(Riker 1982). More surprisingly, the mere existence of 
an alternative decision-path that would change the out- 
come—even if that path is never adopted—may create 
incentives for strategic expression of untruthful views. 
This finding is related to the Gibbard—Satterthwaite 
theorem (Gibbard 1973; Satterthwaite 1975), albeit in 
a sequential context.‘ Finally, I explore some escape 
routes from path-dependence. 

Previous social-choice-theoretic work on path- 
dependence has addressed cases where a winning out- 
come or ordering is determined in sequential pair- 
wise majority voting over multiple alternatives (e.g., 
Plott 1973), but not cases where a system of accepted 
propositions with logical interconnections is built up 
over time. In a judicial context, Kornhauser (1992) 
has observed that if judges’ preferences are nonsep- 
arable—their judgments on different cases are inter- 
dependent—then sequential decisions may be path- 
dependent. This finding motivates developing explicit 
models of such interconnections. Page (1997, 2003) 
has modeled sequential decisions on multiple projects 
made by a single decision-maker. The projects create 
externalities for each other, and negative externalities 
may lead to path-dependence. Page’s results also em- 
phasize constraints between decisions, but these are 
due to externalities between projects, not logical inter- 
connections between propositions. 

My model is a first approach to representing how de- 
cisions on multiple interconnected propositions evolve 
over time. It makes many simplifications that call for 
generalization in future work. But, methodologically, 
I use propositional logic to model a class of decision 
problems not adequately captured by classical models. 
Substantively, I aim to identify how path-dependence 
arises in such decision problems and what its implica- 
tions are. 


TWO STYLIZED EXAMPLES 
A Collective Decision Problem 


The first example concerns a multimember government 
making multiple decisions that constrain each other 
across time (Pettit 2001b). The propositions under con- 
sideration are as follows. 


P: A new (costly) education project shall be imple- 
mented. 

Q: A new (costly) health care project shall be imple- 
mented 


4 Taylor (2002, 2004) discusses recent developments on strategic 
voting 

5 Stone Sweet (2002) discusses path-dependence in precedent-based 
legal reasoning. 
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P Q 
(Implement (implement 
education health care 
project) project) 


R If P and Q 
(Implement 
defense 


and A, then T 
(keep the budget 





Individual 1 Yes Yes 
Individual 2 Yes No 
Individual 3 No Yes 


project) 
No 


T 
No Yes 

Yes No 

Yes No 





R: A new (costly) defense project shall be imple- 
mented. 
T: Taxes shall be increased. 


The pursuit of all three projects leads to a budget 
deficit, given current tax revenues, whereas the pursuit 
of two or fewer projects is possible within the existing 
budget. I here assume that the government members 
unanimously accept a balanced budget constraint: P, 
Q, and R can be accepted simultaneously only if T is 
also accepted, formally: 


“Tf P and Q and R, then T.” 


My model also allows disagreement on such a con- 
straint. Let the government have three members, with 
views as shown in Table 1. Each member’s views are 
consistent. 

The propositions are considered one by one. At each 
point, a majority vote is taken on the proposition under 
consideration. If the majority opinion is consistent with 
all prior decisions, the government decides the new 
proposition according to that opinion; if it is not, the 
government decides the new proposition based on what 
those prior decisions imply. So earlier decisions con- 
strain later ones. Consider two alternative sequences 
in which the propositions might come up. 


Case 1. The balanced budget constraint is considered 
in January and accepted unanimously. The tax increase 
proposal is considered in February and rejected unan- 
imously. The education and health proposals are con- 
sidered in March and April and are each accepted by 
a two-thirds majority. When the defense proposal is 
considered in May, it is supported by a two-thirds ma- 
jority, but the government is already committed to two 
costly projects, to not increasing taxes and to balancing 
the budget. To avoid losing credibility (and the next 
election), the government rejects the defense proposal, 
overruling its majority opinion on that proposal. Over- 
all, the government accepts P and Q (and the budget 
constraint) and rejects R and T. 


Case 2. Again, the balanced budget constraint is 
considered and accepted in January. But the educa- 
tion, health, and defense proposals are considered in 
February, March and April and are each accepted by 
a two-thirds majority. As a tax increase has not been 
ruled out yet, accepting all three proposals is still con- 
sistent with the balanced budget constraint. But in May 
a dilemma arises. All government members oppose a 
tax increase, but the prior commitments to balancing 
the budget and to pursuing all three costly projects 





necessitate accepting a tax increase. Overall, the gov- 
ernment accepts P, Q, R, and T (and the budget con- 
straint). 

Cases 1 and 2 lead to mutually contradictory out- 
comes. The individual views are identical in both 
cases, as is the government’s method of deciding each 
proposition by a majority vote, unless prior commit- 
ments intervene. The two cases differ only in the or- 
der in which the propositions are considered: The 
decision process is path-dependent. Could the gov- 
ernment use a different decision rule to avoid such 
path-dependence? 


An Individual Decision Problem 


The second example concerns political argumentation. 
It is related to work by Aragones et al. (2002) on 
rhetoric, suggesting that one can change an agent’s be- 
liefs without giving her new information, just by orga- 
nizing existing information differently. Aragones et al 
analyze the effects of rhetorical analogies. The present 
example suggests that similar effects can be achieved by 
adjusting the order in which propositions are presented 
to the agent(s) for consideration. As this phenomenon 
can be strategically exploited, it is relevant to political 
manipulation (McLean 2002; Riker 1986). Imagine an 
individual agent who has a disposition to accept each 
of the following propositions: If she were to consider 
each proposition in isolation, then she would find the 
proposition sufficiently plausible to accept. 


P: Young people are entitled to unrestricted freedom 
after school. 

Q: Compulsory national service reduces crime among 
young people. 

R: Compulsory national service is justifiable. 

“If P, then not R”: If young people are entitled to 
unrestricted freedom after school, then compulsory na- 
tional service is not justifiable. 

“If Q, then R”: If compulsory national service re- 
duces crime among young people, then compulsory 
national service is justifiable. 


One may notice an inconsistency between these 
propositions, but I leave this point aside until later.® 
The agent considers the propositions one by one. At 


6 The example is not affected by assuming that the agent has no initial 


view at all on R (instead of assuming that she is disposed to accept R 
in isolation). The agent's initial views on all other propositions—that 
these propositions are plausible in molation—are essential. 


497 


A Model of Path Dependence 


each point, she finds the proposition under considera- 
tion initially plausible, as assumed. So if the proposition 
is consistent with her previously accepted propositions, 
she accepts it. If it is not, she rejects it, to respect her 
prior decisions. As in the government example, the 
agent’s earlier decisions constrain her later ones. Con- 
sider two alternative sequences in which the proposi- 
tions might come up—or in which a Rikerian “heres- 
thetician” might present them to the agent: 


Case 1. At times 1 and 2, the agent considers whether 
young people are entitled to unrestricted freedom af- 
ter school (P) and whether this implies that compul- 
sory national service is unjustifiable (“If P, then not 
R”). She finds each proposition plausible and accepts 
it. So, at time 3, when she considers whether com- 
pulsory national service is justifiable (R), she rejects 
that proposition. At time 4, she considers and accepts 
that compulsory national service would reduce crime 
among young people (Q). At time 5, she considers 
whether such crime reduction would be sufficient to 
justify compulsory national service (“If Q, then R”) 
and rejects the proposition, to respect her previous 
decisions. Overall, the agent accepts P, Q, and “If P, 
then not R” and rejects R, and “If Q, then R.” 


Case 2. At time 1, the agent considers and accepts 
that, if compulsory national service were to reduce 
crime among young people, this would make compul- 
sory national service justifiable (“If Q, then R”). At 
time 2, she considers and accepts that it would reduce 
crime (Q). So, at time 3, when she considers whether 
compulsory national service is justifiable (R), she con- 
cludes that itis. At time 4, she considers the proposition 
that, if young people were entitled to unrestricted free- 
dom after school, this would make compulsory national 
service unjustifiable (“If P, then not R”). She accepts 
this proposition, having not yet considered whether 
young people are entitled to such freedom. At time 5, 
she considers this issue (P) and rejects the proposition, 
following her prior decisions. So she concludes that 
compulsory national service takes priority over young 
people’s freedom after school. Overall, the agent ac- 
cepts Q, “If Q, then R”, R, and “If P, then not R” and 
rejects P. 


Cases 1 and 2 lead to mutually contradictory out- 
comes. Only a single agent is involved, who in both 
cases is initially disposed to accept each proposition. In 
each case, the agent uses only valid logical inferences. 
The two cases differ only in the order in which the 
propositions are considered: The decision process is 
path-dependent. What has happened? 


THE MODEL 


The Propositions 


I use a simple language of propositional logic, which 
includes both atomic and compound propositions. 
Atomic propositions are the most basic ones; they are 
represented by the (finitely many) letters P, Q, R, S,.... 
Compound propositions state interconnections be- 
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tween other propositions; they involve logical connec- 
tives, such as — (not), A (and), v (or), > (implies), <> 
(if and only if). 


Definition. Theset of all propositions of the language, 
L, is defined by three rules: 


e Each of P, Q, R, S, ...is a proposition. 
e If ġ and y are propositions, then so are —¢, (¢ A W), 
($ v Y), (¢> v), (Q e ¥). 


e There are no other propositions. 


Examples of Atomic and Compound Propositions. 
Examples of atomic propositions are the premises 
(P, Q) and the conclusion (R) in the court example, 
the policy and tax proposals (P, Q, R, T) in the gov- 
ernment example, and the propositions about freedom 
(P), crime reduction (Q) and the justifiability of na- 
tional service (R) in the argumentation example; ex- 
amples of compound propositions are the legal rule “R 
if and only if P and Q,” formally (R + (PA Q)); the 
balanced budget constraint “If P and Q and R, then 
T,” formally, ((P A QA R) > T); and the propositions 
relating freedom and crime to the justifiability of na- 
tional service, “If P, then not R” and “If Q then R,” 
formally, (P—> —R) and (Q > R). 


Truth-Values. An assignment of truth-values (true or 
false) to the propositions has the following properties. 
For any two propositions ¢, Y, 


e —¢ is true if and only if ¢ is false; 

© (¢ A 4) is true if and only if both ¢ and ware true; 

e ($ v y) is true if and only if at least one of ¢ or Wis 
true; 

e (¢ — w) is true if and only if it is not the case that [ 
is true and wis false]; 

e (¢ < wp) is true if and only if ¢ and y are both true 
or both false.” 


Properties of Propositions. A proposition ¢ is a tau- 
tology if it is true for every assignment of truth-values; 
¢ is a contradiction if it is true for no assignment of 
truth-values. A set of propositions ® logically entails 
a proposition ¢—denoted ©| = ¢—if, for every assign- 
ment of truth-values, [if all propositions in ® are true, 
then ¢ is also true]. Two propositions ¢ and y are 
logically equivalent if each entails the other. A set of 
propositions ® is logically consistent if there exists an 
assignment of truth-values for which every proposition 
in @ is true; © is logically inconsistent if there exists no 
such assignment. 

The Propositions under Decision. The set of proposi- 
tions on which decisions are to be made is a finite subset 
X of L. The propositions in X must be neither tautolo- 
gies nor contradictions (so they are “open”). I assume 
that X contains at least two distinct atomic proposi- 
tions, P and Q, and the compound proposition (P A Q). 
The use of conjunction (A) implies no loss of general- 
ity; using other logical connectives would yield similar 


7 Formally, such an assignment is represented by a truth-function v: 
L — {1, 0}, where, for any proposition ¢, »(¢)=1 means that ¢ is 
true, and »(¢) =0 means that ¢ is false. 
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results. For every ġe X, ——¢ is identified with ¢. I 
further assume that X contains proposition-negation 
pairs: Whenever ¢ € X, then —¢ € X. In the government 
example, X is the set containing the propositions P, Q, 
R,T, ((P A Q A R) > T), and their negations. 


Propositions in Politics. Individual proposals offered 
for acceptance or rejection in political decisions, such as 
the policy and tax proposals above, can be represented 
as atomic propositions. There are also many instances 
of compound propositions in politics, although they are 
not usually so construed. I have interpreted a budget 
constraint rule as a compound proposition in a polit- 
ical decision-making context and propositions on the 
implications of freedom and crime for the justifiability 
of national service as compound propositions in a polit- 
ical argumentation context. For further examples, think 
of some proposal, P, and some amendment clause, A. 
Then the proposal with that amendment can be repre- 
sented as the compound proposition (P ^ A).° Or think 
of some political goal, G, and some action, A. Then 
the claim that A is a necessary means for achieving 
G can be represented as the compound proposition 
(G— A). Political argumentation frequently involves 
claims about how different political propositions, val- 
ues, goals, or actions are interconnected. Such claims 
can be represented as compound propositions. Cru- 
cially, compound propositions need not be exogenously 
fixed; they can themselves be the subject of disagree- 
ment and decision. 


The Propositional Attitudes of an Agent 


I first assume that there is one agent. The agent can be 
an individual or a group of individuals acting collec- 
tively. I discuss the collective case explicitly below. 


Propositional Attitudes. I assume that, for each 
proposition ¢, the agent has a propositional attitude 
toward œ, which can be interpreted in (at least) two 
ways. On one interpretation, it is the agent’s fully en- 
dorsed view on ¢: acceptance or nonacceptance of ¢. On 
another, it is the agent’s initial disposition on œ: the 
verdict (acceptance or nonacceptance) he or she would 
give on ¢ if he or she were to consider ¢ in isolation, 
not referring to other propositions. Which interpreta- 
tion applies depends on whether the agent has already 
considered ¢. If the agent has, he or she may have a 
fully endorsed view on ¢. But even if not, he or she 
may still have an initial disposition on ¢ if he or she 


8 The set of connectives {—, A} is expresstvely adequate: For any 
proposition from L, there exists a logically equivalent proposition 
whose only connectives are — and A. 
9 The famous killer amendment problem can be restated as a path- 
problem in a decasion on multiple propositions. Suppose 
that there are three mdividuals: The first supports the proposal P and 
the amendment A, the second P but not A, the third A but not P. 
If all individuals reveal them views truthfully, there is a two-thirds 
majority for P, a two-thirds majority for A, but a two-thirds majority 
against (P A.A), the amended proposal. If the group votes on P 
first and on A next, then both are accepted; by implication (P A A) 
is also accepted. If the group votes on A first and on (P AA) next 
(having accepted A), then (P A A), and by implication P, is rejected. 
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were to consider ¢. That initial disposition might be a 
function of the prima facie plausibility of ¢ from the 
agent’s perspective. The agent’s propositional attitudes 
are represented by an acceptance/rejection function. 


Definition. An acceptance/rejection function (“AR- 
function”) is a function 8: X—> {1, 0}. 


For each proposition ¢ € X, 6(¢) =1 means that the 
agent accepts ¢ (or has an initial disposition to accept 
¢); 6(¢) =0 means that the agent does not accept ¢ (or 
has an initial disposition not to accept ¢). Note that 
5(¢) =0 does not by itself mean that the agent accepts 
the negation of ¢ (or has such an initial disposition). 
The agent accepts that negation (or has such an initial 
disposition) if and only if §(-@) = 1. For brevity, I usu- 
ally refer to 8(@) =1 as “acceptance” of ¢ and to 4(¢) =0 
as “rejection” of ¢, bearing in mind the more precise 
interpretation offered here.!° As discussed below, an 
agent’s propositional attitudes may or may not satisfy 
certain rationality conditions." I also discuss various 
rationality violations. 


The Propositional Attitudes ofa Group. If the agent is 
a group, ascribing propositional attitudes to that group 
does not presuppose any metaphysical assumptions 
about “group minds.” As illustrated by the government 
example and discussed below, the group’s propositional 
attitude on a proposition can be its majority verdict on 
that proposition, or the outcome of some other aggre- 
gation over the propositional attitudes of the group 
members. 


A Sequential Decision Process 


A Decision-Path. A decision-path is the order in 
which the propositions are considered in a sequential 
decision process. It can be interpreted in (at least) two 
ways. On one interpretation, it is the temporal order in 
which the propositions are considered: Earlier proposi- 
tions in the path come up earlier in time than later ones. 
On another, it is the order of importance or priority 
assigned to the propositions: Earlier propositions in 
the path are more important or “weightier” than later 
ones. 


Definition, A decision-path on X is a one-to-one func- 
tion Q: {1, 2,..., k} —> X, where k is the number of 
propositions in X. 


10 The agent’s propositional attitudes are modeled as acceptance 
or rejection and do not allow degrees of belief. This 1s realistic in 
a political context where propositions are ultmately accepted or 
rejected, particularly by voting. My model might be generahzed by 
using a logical system with more than two truth-values or by using 
real-valued credence functions of the form 8: X > [0,1] Pauly and 
van Hees (2003) have extended the analysis of judgment aggregaton 
to many-valued logics, allowing different (discrete) degrees of belief, 
and shown that results simular to Propositions 4 and 5 below continue 
to hold. This suggests that results similar to Theorems 4 and 5 may 
also continue to hold. 

11 Tf the agent is perfectly rational, his or her AR-function can be 
extended to a truth-function. 
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Here (1), (2), ..., Q(x) are the first, second, ..., 
kth propositions to be considered.” 


The Notion of a Sequential Decision-Process. Indefin- 
ing a sequential decision process, it is useful to interpret 
the agent’s propositional attitudes as initial disposi- 
tions: The agent enters the decision process without 
fully endorsed views on the propositions, but with ini- 
tial dispositions. The agent considers the propositions 
in the order given by a decision-path Q and decides 
at each point whether or not to accept the proposition 
under consideration, say, ¢. The outcome of the deci- 
sion process can then be interpreted as the set of fully 
endorsed views the agent has formed on the proposi- 
tions after having considered them one by one along 
the path 2. How does the agent decide whether or not 
to accept each proposition in the sequence? Suppose 
that proposition ¢ is under consideration. There are 
two cases. Hither the agent’s initial disposition on ¢ is 
consistent with the propositions (if any) the agent has 
accepted earlier: If so, the agent accepts or rejects @ 
according to that initial disposition. Or there is a logical 
inconsistency between the initial disposition on ¢ and 
some previously accepted propositions: If so, the agent 
requires a method of resolving this inconsistency. 


A Conflict Resolution Rule. I call a method by which 
the agent can resolve an inconsistency between his or 
her disposition on a new proposition and previously 
accepted propositions a conflict resolution rule. Under 
the priority-to-the-past rule, as in the two examples, the 
agent resolves the inconsistency by accepting the log- 
ical implications of previously accepted propositions 
and overruling the initial disposition on the new propo- 
sition. So earlier decisions constrain later ones, but not 
vice versa. Precedent-based decision making is a form 
of priority-to-the-past decision-making. For simplicity, 
I only consider the priority-to-the-past rule here. This 
is a restriction, as there are cases where other conflict 
resolution milesi are more plausible, but the model can 
be generalized.” 


12 In the examples, I assumed that each proposıtion-negation pair 
is considered at the same tme in the decision-path and that the 
individuals reject a proposition if and only if they accept its negation. 
My formal model is more general: It allows ¢ and —ẹ to be considered 
at different (possibly even nonadjacent) tmes in the decision-path. 
DB Other rules include the priority-to-the-present rule, by which the in- 
consistency is resolved by accepting the initial disposition on the new 
proposition and suitably revising previously accepted propositions 
The agent might use this rule in cases where, given new evidence, he 
or she would rather reject some previously held beliefs than deny 
the new evidence. There are interesting research avenues here; see 
the literature on belief revision, e g , Rott 2001. My model also al- 
lows generalizatons that preserve the priority-to-the-past rule. For 
example, suppose that certain propositions are nonnegotiable. The 
agent would never overrule them even given a logical conflict. Ex- 
amples might be strongly believed factual premises or propositions 
considered a priori truths. The constramt that the priority-to-the-past 
rule should never overrule such propositions can be formalized in 
my model by restricting the set of admissible decision-paths to those 
where nonnegotiable propontions occur first. If the set of nonnego- 
tiable propositions is logically consistent, under the modified model 
the agent would never overrule a nonnegotiable proposition in a 
priority-to-the-past decision process. 


500 


When Is the Priority-to-the-Past Rule Plausible? Let 
me suggest an illustrative list of cases (neither ex- 
haustive nor mutually exclusive) in which the priority- 
to-the-past rule is plausible. First, suppose that the 
decision-path is the temporal order in which the propo- 
sitions are considered. Then the priority-to-the-past 
rule is plausible in cases where decisions, once made, 
are to hard to overrule. This may be because: 


e Earlier decisions may have created commitments 
that cannot (easily) be overruled, for several rea- 
sons: (i) Legal reasons: Earlier decisions may be 
legally binding; they may have involved passing cer- 
tain legislation, changing constitutional rules, or they 
may serve as precedents. (ii) Social or cognitive rea- 
sons: Earlier decisions may be so entrenched in the 
system of beliefs or judgments of the agent(s) that 
they cannot be overruled without a substantial cost. 
(iii) Strategic reasons: If earlier decisions are publicly 
known, the agent may lose credibility by not respect- 
ing those decisions; this loss of credibility may be 
costly; the agent might lose the next election, for 
example. 

e Earlier decisions may have led to actions that (i) are 
irreversible, or (ii) are perceived by the agent(s) to 
be irreversible, or (iii) are too costly to reverse in 
practice. 


Second, suppose that the decision-path is the order of 
importance or priority among the propositions. Then 
the priority-to-past rule is a natural way of making de- 
cisions based on that order: Decisions on more impor- 
tant propositions constrain decisions on less important 
ones. My model is neutral between different interpre- 
tations of the priority-to-the-past rule. 


Definition. A priority-to-the-past decision process is 
the following procedure. Consider the propositions 
along the decision-path Q: 4) :=Q(1) at time 1, ¢2:= 
(2) at time 2,..., ¢k:= Q(k) at time k, where k is the 
number of propositions in X. For each t = 1,2,...,k, 
®, is the set of all propositions accepted up to time t, 
where ®, is defined inductively as follows: 


t=0 (added): ®o is the empty set. 
t>0: Proposition ¢, is under consideration. There 
are two cases. 


Case I: (i) Previously accepted propositions entail 
$: (Pr-1| = ¢y) or (ii) they entail ~g, (,-1| = -4,). Then 
F a U {ø} if (i), 
[OU {py} if ci 


Case II: , and —@, are each consistent with pre- 
viously accepted propositions (neither ,_1|=, nor 


14 If we already have ¢ € O;_1 or >¢ € D1, then P; = ;_1 under 
ths definition. 
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;-1| = —,). Then 


D1 U {h} if8(@) =1, 

D if 6(¢,) = 0.5 
Define the outcome set of the decision process, where 
the agent’s AR-function is ô and the decision-path is 
Q, to be M(8, Q) := D+. 

By definition, a proposition that is inconsistent with 
previously accepted propositions is never accepted in 
a priority-to-the-past decision process. 

Proposition 1. For any & and any Q, M(8, Q) is logi- 
cally consistent. 


pe 


Finally, I define a priority-to-the-past decision pro- 
cess to be decisive if, for every proposition in X, either 
the proposition or its negation is accepted in the pro- 
cess. 


Definition. M(8, Q) is decisive if, for every ¢ €X, ei- 
ther ¢ € M(8, Q) or ~ e M(8, Q). 


An Illustration from Politics. In a discussion of cred- 
ible commitment and path-dependence, Miller and 
Schofield (2003) give two empirical illustrations of how 
prior commitments can constrain political decisions. 
First, they argue that in 1972, as a result of the Re- 
publican Party’s socially conservative and anti-civil- 
rights decisions in the 1960s (including Goldwater’s 
presidential candidacy in 1964), Nixon had little choice 
but to continue his party’s ideological course and to 
pursue a “Southern strategy” that included a socially 
conservative law-and-order position. They suggest that 
Nixon would have been so constrained even if, hypo- 
thetically, he had considered an alternative “Northern 
strategy” that included a pro-civil-rights position—a 
position that Republicans had been inclined toward up 
to the early 1960s, to appeal to black voters. Second, 
they argue that in the mid-1990s, “Clinton was under a 
great deal of pressure from liberals in his party to re- 
store the Democratic Party to the economic liberalism 
of the New Deal” (258) But to do so, Clinton would 
have had to appeal to Southern and other social con- 
servatives, which “was no longer credible for the party 
that had supported civil rights and affirmative action 
for 30 years” (258). So Clinton had little choice but to 
pursue a more moderate economic policy. Miller and 
Schofield argue that “candidates are constrained by 
recent historical events that introduce an asymmetry in 
[their] calculations” (258), where those events include 
their parties’ previous decisions and commitments. The 
analysis suggests that Nixon and Clinton had strong 
strategic reasons—in the sense defined above—for re- 
specting their parties’ prior commitments. But note 


15 The reason for not defining ®; := ,_1 U {—¢;} if 8(¢,) =0 is to al- 
low consideration of ¢, and —¢ at separate steps in the decision-path 
If 3 is Incomplete (defined below), this allows indecisive outcome 
sets (¢ ¢ M(8, 2) and —¢ ¢ M(8, Q)) when X$) =0 and &(-$) = 0 and 
neither ¢ nor —¢ is entailed by propositions accepted before ¢ or —¢ 
is considered. 
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that the presence of constraints on decisions due to 
prior commitments is not the same as path-dependence 
and leads to path-dependence only under certain con- 
ditions, as shown below. 


The Notion of Path-Dependence 


A priority-to-the-past decision process is invariant un- 
der changes of the decision-path if its outcome is the 
same for all decision-paths. It is weakly path-dependent 
if there exist (at least) two decision-paths with differ- 
ent outcomes. It is strongly path-dependent if there 
exist (at least) two decision-paths with contradictory 
outcomes. Let ô be given. 


Definition. M(8, Q) is invariant under changes of the 
decision-path if, for any two decision-paths Q; and Qh, 
M(6, 21) = M(6, Q2). 


Definition. M(8, Q) is weakly path-dependent if, for 
some proposition ¢ € X, there exist two decision-paths 
Qı and Rz such that de M(8, Q1) and ¢¢ M(8, Q2). 


Tf ¢ is such a proposition, I say that M(é, Q) is weakly 
path-dependent with respect to 9. 


Definition. M(8, Q) is strongly path-dependent if, for 
some proposition ¢ € X, there exist two decision-paths 
Qı and Q2 such that ¢ eM(ê, Q1) and -ge M(, R2). 


If @ is such a proposition, I say that M(8, 9) is 
strongly path-dependent with respect to ġ. Strong path- 
dependence implies weak path-dependence. The con- 
verse holds only in special conditions: 


Proposition 2. Suppose that M(8, &) is decisive for 
all decision-paths &. Then M(6, 82) is strongly path- 
dependent if and only if it is weakly path-dependent. 1$ 


Weak and strong path-dependence may differ when 
M(6, Q) is indecisive for some decision-path 2. Below I 
state necessary and sufficient conditions for both kinds 
of path-dependence, but I focus mainly on strong path- 
dependence. 


RATIONALITY VIOLATIONS AND 
PATH-DEPENDENCE 


Ratlonallty Conditions on Propositional 
Attitudes 


I introduce four rationality conditions that an agent’s 
propositional attitudes may or may not satisfy. Com- 
pleteness requires that the agent should accept at least 
one member of each proposition-negation pair. Weak 
consistency requires that the agent should accept at 
most one member of each proposition-negation pair. 


16 Tf M(8, Q1) and M (8, Q2) are both decisive, then the existence 
of $e X such that de M(8, R1) and ¢¢ M(4, Q2) implies that ¢ € 
M(8, Q1) and —¢ € M(8, Oz). 
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Strong consistency requires that the propositions ac- 
cepted by the agent can be simultaneously true. De- 
ductive closure requires that the agent should accept 
all implications of other propositions she accepts. These 
concepts provide rationality criteria over and above the 
standard criterion of transitivity of preferences. Let 8 
be the agent’s AR-function: 


Definition. & is complete if, for any ¢ € X, 6( ¢)=1 or 
8(=) =1. 


Definition. 8 is weakly consistent if there exists no 
e X such that 8(¢) =1 and 6(-¢) =1. 


Definition. 38 is strongly consistent if the set {pe X : 
§(¢) = 1} is logically consistent.’ 


For any subset CX, I write (©) = 1 to mean 
[3(¢) =1 for all ġ € ẹ]. 


Definition. & is deductively closed if the following 
holds: For any logically consistent  C X and any ¢ € X, 
if 5(®) =1 and >| =@, then 6(¢) =1. 


The conditions are interrelated. Strong consistency 
implies weak consistency, but not vice versa. To illus- 
trate, if the agent accepts P, (P— Q) and ~Q, then 
weak, but not strong, consistency is satisfied: The agent 
accepts no proposition and its negation simultaneously, 
but P, (P > Q), and ~Q cannot be simultaneously true. 
Weak consistency and deductive closure jointly imply 
strong consistency.!® In the Appendix, I prove the fol- 
lowing result: 


Lemma 1. §& violates strong consistency if and only if 
there exist two logically consistent subsets Y1, Yı CX 
and a proposition ¢€X such that [8(%,)=1 and 
Wi| = 4] and [8(2) = 1 and 42| = —4]. 


So anecessary and sufficient condition for a violation 
of strong consistency is the existence of two logically 
consistent sets of propositions Y}; and W and a proposi- 
tion ¢ such that the agent accepts all the propositions in 
4; and all in 47, but Y; logically entails ¢, whereas Y2 
logically entails —¢. If this condition is met, I say that 8 
violates strong consistency with respect to ¢. I say that 
ô violates weak consistency with respect to ¢ if 6(¢) =1 
and 8(-¢) = 1.19 I say that ô violates deductive closure 
with respect to ¢ if there exists a logically consistent 
set of propositions &(CX) such that the agent accepts 
all the propositions in ®, ® logically entails ¢, but the 
agent does not accept ¢, formally 6(®) =1,61| = ¢ and 
5(p) =0. 


17 3 is strongly consistent if and only if 8 can be extended to a truth- 
function » on L such that, for all ¢ € X (CL), (6) = W(¢) 

18 All these properties can easily be proved in the propositional 
calculus. 

19 All violations of weak consistency are violations of strong consis- 
tency, but not all violations of strong consistency are violations of 
weak consistency. 
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Necessary and Sufficient Conditions 
for Path-Dependence 


Let me revisit the two examples. Consider the argu- 
mentation example. Are the agent’s propositional atti- 
tudes consistent? They are weakly, but not strongly, 
consistent: The agent is disposed to accept Q and 
(Q— R), which entail R, and to accept P and (P> 
=R), which entail =R. They also violate deductive clo- 
sure, as the agent is disposed to accept P and (P> =R) 
but not =R. Consider the government example. Each 
government member’s views are complete, weakly and 
strongly consistent, and deductively closed. But what 
about the propositional attitudes of the government 
acting collectively, as determined by majority voting 
on each proposition? There are majorities for P, Q, R, 
and ((PA Q ^ R) > T), which entail T, but there is no 
majority for T. So the majority views violate deduc- 
tive closure with respect to T. They also violate strong 
consistency, as there is a majority (unanimity) for =T. 
This suggests that violations of the rationality condi- 
tions, particularly of strong consistency, might lead to 
path-dependence. The following results confirm this 
suggestion. Consider any agent—individual or group— 
making decisions on multiple propositions, where ô is 
the agent’s AR-function. 


Theorem 1. For any ¢€ X, M(8, Q) is strongly path- 
dependent with respect to $ if and only if 8 violates strong 


consistency with respect to œ. 
Proof. The proofs of all theorems are in the Ap- 
pendix. E 


By Theorem 1, if (and only if) the agent’s propo- 
sitional attitudes violate strong consistency with re- 
spect to some proposition ¢, there exist (at least) two 
decision-paths such that under one path ¢ is accepted, 
whereas under the other —¢ is accepted. An agent’s 
propositional attitudes violate strong consistency either 
when they violate weak consistency or when they are 
weakly consistent but an inconsistency is “hidden” by 
a violation of deductive closure. 


Proposition 3. Suppose that 8 is complete and weakly 
consistent. For any ġ € X, & violates strong consistency 
with respect to ġ if and only if 8 is not deductively closed 
with respect to one of $ or >. 


The conjunction of Theorem 1 and Proposition 3 
yields a necessary and sufficient condition for strong 
path-dependence in the case where the agent’s proposi- 
tional attitudes are complete and weakly consistent. In 
this case, there exist (at least) two decision-paths with 
mutually inconsistent outcomes on some proposition ¢ 
if and only if the agent’s propositional attitudes violate 
deductive closure with respect to ¢: 


Theorem 2. Suppose that 8 is complete and weakly 
consistent. For any ¢€X, M(8, Q) is strongly path- 
dependent with respect to $ if and only if 8 is not de- 
ductively closed with respect to one of ¢ or 4. 


Although strong path-dependence is ruled out 
when the agent’s propositional attitudes are strongly 
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consistent (by Theorem 1), weak path-dependence is 
still possible. 

Theorem 3. Suppose that 8 is strongly consistent. For 
any pe X, M(8, Q) is weakly path-dependent with 
respect to $ if and only if 8 is not deductively closed 
with respect to . 


Theorem 3 states that if the agent’s propositional 
attitudes are strongly consistent, then there exist (at 
least) two decision-paths with different outcomes on 
some proposition ¢@ if and only if these proposi- 
tional, attitudes violate deductive closure with respect 
to ¢. 

If the agent is an individual, the conditions for 
avoiding path-dependence are clear. If this individ- 
ual’s propositional attitudes are weakly consistent and 
deductively closed (hence strongly consistent), he or 
she is immune to path-dependence in a priority-to- 
the-past decision process. By contrast, if the agent’s 
propositional attitudes violate strong consistency or 
deductive closure, he or she may be manipulable by 
a “heresthetician” presenting the propositions in some 
strategic order. Whereas it is unsurprising that irra- 
tionality can make individuals manipulable, my results 
imply that even a boundedly rational individual may 
be manipulable. As shown, a violation of deductive 
closure is sufficient for path-dependence, even when 
weak consistency is satisfied.” Deductive closure vio- 
lations are empirically plausible: Whereas individuals 
might be sufficiently rational not to accept a propo- 
sition and its negation simultaneously, they may lack 
the computational resources needed to derive all log- 
ical implications of the beliefs they hold. My results 
therefore suggest that bounded rationality might be 
of relevance to certain phenomena of political ma- 
nipulation. This suggestion raises interesting empirical 
questions on whether individuals satisfy or violate the 
identified rationality conditions and on whether there 
are cases of political manipulation targeted specifically 
at the fact that agents are boundedly, but not perfectly, 
rational. 

If the agent is a group, the conditions for avoiding 
path-dependence are less clear. It would be desirable to 
find a method of aggregating the group members’ views 
into a collective AR-function that satisfies complete- 
ness, weak consistency, and deductive closure (and thus 
strong consistency). By Theorems 2 and 3, this would 
rule out path-dependence. The multimember govern- 
ment example has illustrated that majority voting does 
not generally have these properties. I next show that 
this is not an accidental property of majority voting: 


2 Under Theorem 2’s conditions—the agent’s propostional atti- 
tudes are complete and weakly consistent—violations of deductive 
closure are necessary and sufficient for strong path-dependence, here 
violations of deductive closure imply violations of strong consistency. 
By contrast, under Theorem 3’s conditions—the agent’s proposi- 
tional athtudes are strongly consistent—violations of deductive do- 
sure are necessary and sufficient for weak path-dependence; here 
violations of deductive closure imply violations of completeness. 

21 Tf 3 is strongly consistent, a deductive closure violation umplies 
weak path-dependence, by Theorem 3. If 8 is not strongly consistent, 
this nnplies strong path-dependence, by Theorem 1. 
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Under certain conditions, no aggregation function with 
the desired properties exists. 


PATH-DEPENDENCE AT THE COLLECTIVE 
LEVEL: A GENERAL RESULT 


The Setting 


I now assume that the agent is a group of individuals, 
represented by the set N={1,2,...,m} (n>2). The 
views of each individual, i € N, on the propositions in 
X are represented by an AR-function ô, : X > {1, 0}, 
defined as before. I here interpret the individuals’ 
propositional attitudes as fully endorsed views and as- 
sume, as a best-case scenario, that each individual’s 
AR-function 4, is complete, weakly consistent, and de- 
ductively closed (and thus strongly consistent). 


Definition. A profile of individual AR-functions 
(“profile”) is a vector of AR-functions across the in- 
dividuals, (8;) = (61, &, as | 8n)- 


Different individuals may hold different views not 
only on atomic propositions, but also on compound 
ones.” This captures the fact that in politics individu- 
als may disagree not only on specific policies, actions, 
or reasons, but also on relations between reasons and 
actions, means-ends relations, constraints between dif- 
ferent policies, budget constraints, and so on. 


Aggregation Functions 


To determine the group’s propositional attitude to a 
proposition ¢—its initial disposition on ¢—a method 
of aggregating the n individual views on ¢ into a single 
collective propositional attitude on ¢ is needed. For 
each @¢, the vector of zeros and ones, (51(@), 62(@), ..-, 
3n(@)), must be aggregated into a single value of either 
zero (nonacceptance of ¢) or one (acceptance of ¢). 


2 Nehring and Puppe’s (N&P; 2002) model can also represent de- 
cisions on multiple propositions, but with less general mterconnec- 
tions In N&P, agents do not have AR-functions over propositions 
from propositional logic, but preferences over vectors of properties, 
(a1, @2,..., am) (€ {0, 1}*). If all individuals accept the same com- 
pound propositions, the logical structure in my model can be repre- 
sented in N&P’s property structure, by identifymg each property in 
N&P with an atomic proposition and representing the unanimously 
accepted compound propositions by restricting the set of alterna- 
tives, Le., the set of admissible (a), a2,...,@,) vectors For example, 
if all individuals accept (R + (PA Q)), then each vector (a1, a2, a3) 
corresponds to an assignment of truth-values to P, Q, and R, the rule 
(R + (PA Q)) can be captured by restricting the set of alternatives 
to {(1,1,1), (1,0, 0), (0,1, 0), (0, 0, 0)}. But if individuals disagree 
about compound propoaitions, then they also about the 
set of admusmble alternatives. If individual 1 accepts (R + (P A Q)), 
whereas mdividual 2 accepts (~R + (P ^ Q)), then for individual 
1 the set of admissible alternatives is {(1, 1, 1), (1,0, 0), (0,1, 0}, 
(0, 0, 0)}, whereas for indrvidual 2 it is {(1, 1, 0), (1,0, 1), (0,1, 1), 
(0, 0, ih N&P’s model requires a single set of alternatives There- 
fore my model can represent more general logical connections than 
N&P’s model. 
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A Model of Path Dependence 


Definition. An aggregation function is a function 6: 
{0, 1}" — {0, 1}. 


Examples of aggregation functions are majority vot- 
ing, as used above, and a dictatorship of one indi- 
vidual, analogous to the definition in Arrow’s (1951) 
preference-based model. 


Definition. Majority voting is the aggregation func- 
tion ô: a Da {0, 1} defined as follows. For any 
(di, do, ., da) € {0, 1)", 


1 if} d>n/2, 
tEeN 


8(d,, dh, ..., dq) = 


0 otherwise. 


Definition. A dictatorship of individual ie N is the 
aggregation function å : {0, 1}"— {0, 1} defined as 
follows. For any (dı, d2,..., dn) € {0, 1}", 8(di, d2, ..., 
dn) = d,. 


Majority voting has the attractive property of giving 
all individuals equal weight in determining the group’s 
propositional attitude on any proposition; it satisfies 
anonymity. A dictatorship violates not only anonymity, 
but also the weaker condition of nondictatorship. 


Definition. An sabe function ô is anonymous 
if, for any (di, dp, .. ) € {0,1} and any permutation 


ao.N>N, 8(d1, da, ... Ren 3(do1); dof)» + + - , do(n)); f] 
is non-dictatorial if it is not a dictatorship of any ieN. 


How can an aggregation function generate a single 
collective AR-function based on a profile of individ- 
ual AR-functions? For each profile (8,), an aggregation 
function 6 induces a collective AR-function êg) : X > 
{0, 1}, where 


foreach $EX, 8)(4):=4(61(#), 820), - +s n(A). 


The collective AR-function å) represents the group’s 
propositional attitudes, as determined hee the 
aggregation function 6 to the profile (6 

A restriction is built into my denaii of an aggre- 
gation function: The aggregation over the n individual 
views is performed on a proposition-by-proposition 
basis. This mirrors the restriction introduced by in- 
dependence of irrelevant alternatives in Arrow’s clas- 
sical model. Under Arrow’s independence condition, 
preferences are aggregated based on pairwise compar- 
isons between alternatives, i.e., based on (what might 
be seen as) pairwise ranking propositions. Whether 
such a restriction is plausible depends on the con- 
text. In many sequential decision problems, collective 


2 When a profile (3,) has been fixed and there is no risk of am- 
biguity, I sometimes identify the aggregation function ô with the 
collective AR-function ô) induced by 8 for the given profile (4). 
On a precise notation: hN M(éy,), N) is the outcome set of a de- 
cision process for the collective AR-function 84) (induced by the 
aggregation function 8 for the profile (8,)) and the deamon-path 2. 
Gi) M( 8, Q) is a function mapping each profile (8,) to a corresponding 
outcome set M(8q,), Q) as defined in (i). Oa ae a simplify 
the notation by As the subscript (8;) and writing M(8, Q) for 
M(Sq,), Q). 
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decisions are taken on a proposition-by-proposition 
basis, by voting separately on each proposition as 
it arises. But, as discussed below, a possible gen- 
eralization of my model is to relax propositionwise 
aggregation. 


Impossibility Results on Determining 
Collective Propositional Attitudes 


I noted that majority voting may sometimes fail to in- 
duce a complete, weakly consistent, and deductively 
closed collective AR-function. Are there other aggre- 
gation functions which always generate collective AR- 
functions with these properties? I now show that, under 
some minimal conditions, the answer is negative. 


Definition. The universal domain, U, is the set of all 
logically possible profiles of individual AR-functions 
satisfying completeness, weak consistency and deduc- 
tive closure. 


Proposition 4 (Corollary of List and Pettit 2002). 
There exists no aggregation function ô (satisfying 
anonymity) that induces, for every (8,) € U, a complete, 
weakly consistent and deductively closed collective AR- 
function OCSE 

Pauly and van Hees (2003). have recently shown that, 
if we drop anonymity, the unique aggregation function 

satisfying the remaining conditions is a dictatorship of 
one individual. 


Proposition 5 (Corollary of Pauly and van Hees 
2003). An aggregation function & induces, for every 
(8,) € U, a complete, weakly consistent and deductively 
closed collective AR-function 8,3) if and only if 8 is a 
dictatorship of some individual i € N. 

These results are analogous to Arrow’s theorem on 
preference aggregation. They are not corollaries of 
Arrow’s theorem, as there is no straightforward map- 
ping from views on multiple interconnected propo- 
sitions into preferences (see also List and Pettit 
2003).24 


24 The exact relation between sets of views on multiple intercon- 
nected propositions and preferences is nontrivial. On one mterpre- 
tation, preferences are a special case of sets of views on multple 
propositions (from predicate logic). An agent’s preference order- 
ing is here identified with a set of binary ranking propositions 
the agent accepts; eg, the preference ordering xPyPz 1s identi- 
fled with the set of propoutions {xPy, yPz, xPz}. Propositions like 
YxYyYz((xPy A yPz) + xPz) can represent conditions hke transitiv- 

. Preference aggregation is then a special case of judgment aggre- 
gation on multiple propositions It ıs arguably harder to interpret 
judgment aggregation as a special case of aggregation. 
Identifying each proposition with a single alternative m a prefer- 
ence context is insufficient, as judgment aggregation requires de- 
termining a collective set of judgments on these propositions, not 
a preference ordermg over them. Identifying a set of propositions 
with a single alternative might be another route, but it might fail 
to make exphat the logical structure of such a set of proposi- 
tions. A third route mght be to identify each proposition with 
a set of alternatives, but this leads to the problem that classical 
models typically take as the unit of choice (or ordermg) not such 
sets but single alternatives Although this might be solved via ag- 
gregation functions that produce winning sets rather than winning 
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impossliblitty Results on 
Path-(iIn)Dependence 


Consider the implications of Propositions 4 and 5. 
Any anonymous or just nondictatorial (proposition- 
wise) aggregation function generating complete and 
weakly consistent collective AR-functions necessarily 
produces deductive closure violations for some profiles 
in the universal domain. By Theorem 2, for these pro- 
files, there exist different decision-paths with mutually 
inconsistent outcomes. This mechanism underlies the 
following result. 


Theorem 4. There exists no aggregation function ô 
(satisfying anonymity or nondictatorship) such that, for 
every (6,) € U, M(8(g), Q) is decisive and invariant under 
changes of the decision-path. 


Moreover, the unique aggregation function that 
guarantees decisiveness and path-independence for all 
profiles in the universal domain is a dictatorship of one 
individual: 


Theorem 5. Let 8 be an aggregation function. For ev- 
ery (8,) EU, M(8a), Q) is decisive and invariant under 
changes of the decision-path if and only if ô is a dicta- 
torship of some individual i € N. 


In summary, suppose that a priority-to-the-past deci- 
sion process satisfies two conditions. Universal domain: 
Any logically possible profile of (rational) individual 
views on the propositions is acceptable as input. De- 
cisiveness: The decision process produces a determi- 
nate decision on every proposition. Then the unique 
(propositionwise) aggregation function for which the 
decision process is generally path-independent is a 
dictatorship of one individual. Add a further condi- 
tion. Anonymity: All individuals have equal weight 
in the decision on each proposition. Or, even more 
weakly, nondictatorship: There does not exist a dic- 
tatorial individual. Then there exists no (proposition- 
wise) aggregation function for which the decision pro- 
cess is generally path-independent. This does not mean 
that the decision process is path-dependent for ev- 
ery profile. But no anonymous or nondictatorial ag- 
gregation function guarantees path-independence. Un- 
der any such aggregation function, there exist some 
profiles for which the decision process is (strongly) 
path-dependent.* 

Does this imply that, at a collective level, path- 
dependence is in principle unavoidable? And, if so, 
what is the cost of this? I first address the second ques- 
tion, and show that path-dependence opens up two 
types of strategic manipulation. I then turn to the first 


alternatives, one would still require a corresponding way of repre- 
senting logical mterconnections It is also unclear what sets of propo- 
sitions would be on that route (e.g., sets of sets of alternatives). The 
most promusng route might be Nehring and Puppe’s (2002), dis- 
cussed above, but even there the representable logical interconnec- 
tions are restricted. It is too early to draw a conclusion. Future work 
might identify more fundamental parallels between the different 
frameworks. 


* Because I consider a decistve M(8, 2) here, strong and weak path- 
dependence coincide. 
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question, and explore some escape-routes from path- 
dependence. 


THE COST OF PATH-DEPENDENCE: 
STRATEGIC MANIPULABILITY 


Manipulation by Agenda Setting 


Whenever the agent’s propositional attitudes violate 
strong consistency with respect to ¢, an agenda-setter— 
whoever chooses the decision-path—may have power 
to determine whether the outcome of the decision pro- 
cess will be ¢ or —¢. By Theorem 1, given a violation 
of strong consistency with respect to ¢, there exists one 
decision-path leading to ¢ and another leading to >œ. 
Given sufficient information and computational power, 
an agenda-setter can determine the decision-path re- 
quired to bring about the preferred outcome. In the 
multimember government example, an agenda-setter 
who cares strongly about the defense proposal (propo- 
sition R) might advocate the decision-path of case 2, 
which results in the acceptance of R. An agenda-setter 
who opposes the defense proposal might advocate the 
decision-path of case 1, which results in the rejection of 
R. This parallels the problem of agenda-dependence in 
preference-based models, albeit within the new domain 
of decisions on multiple interconnected propositions 
(e.g., Plott and Levine 1978; Riker 1982, chap. 7). 


Manipulation by Expression of 
Untruthful Views 


Suppose that a decision process is (strongly) path- 
dependent but a particular decision-path has been 
fixed. If some individual (or group) cares strongly 
about certain later propositions in the decision-path, 
they might strategically express untruthful views on 
certain earlier ones, as decisions on the later proposi- 
tions will be affected by decisions on those earlier ones. 
In the government example, suppose that individual 3 
cares most about the defense proposal (proposition 
R) and is willing to sacrifice his or her support for 
the health proposal (proposition Q), to get his or 
her way on the defense issue. Suppose that the 
decision-path is as in case 1. In April, when Q is con- 
sidered, individual 3 might untruthfully vote against 
Q, bringing about a majority rejection of Q. Then, in 
May, when R is considered, there would no longer be a 
conflict between prior commitments and the majority 
verdict on R; the government would be able to follow 
that majority verdict and accept R. Without individ- 
ual 3’s strategic intervention, the outcome set of the 
decision process would have been {((PA Q ^A R) > T), 
P, Q,—R,—T}. With that intervention, the outcome 
set is {(((PA QAR) > T), P,— Q, R, >T}, an outcome 
individual 3 prefers: Individual 3 has an incentive to 
express an untruthful view on Q. 


26 To formalize this example using the definitions below, one needs 
to assign to individual 3 a preference ordering with respect to which 
individual 3 strictly prefers the outcome set {((PA QA R)> T), 
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I now define strategic incentives formally. Let M be 
the set of all possible outcome sets M(8, Q) (CX) 
of a priority-to-the-past decision process. I assume 
that each individual has certain preferences over these 
possible outcome sets. Individual i’s most preferred 
outcome set—for brevity, denoted ô, like the individ- 
ual’s AR-function—is the one that includes precisely 
those propositions that she individually accepts, {¢ € 
X : & (¢)=1}. The closer an outcome set is to 
this most preferred one—in a sense defined below— 
the more he or she prefers that outcome set. The 
individual’s preferences over the outcome sets are 
represented by an ordering R, on M (reflexive, 
transitive and, connected). The subscripts i and 4, 
indicate that the ordering depends not only on indi- 
vidual i, but also on his or her AR-function 4,. For 
any two outcome sets 1, 2, 2, R,s 2 means that 
individual i weakly prefers Dı to 2. I write © P;,5 P2 
to mean that individual i strictly prefers 2, to 2, 
formally ©1R, 5 P2 and not 2R;, 1). I now formalize 
the assumption that the closer an outcome set is to 
individual ?s most preferred one, the more he or she 
will prefer that outcome set. 


Definition. An outcome set Q; is at least as close to ô, 
as an outcome set > if, for every ¢ € X, 5,(¢) = A2(¢) 
implies 8,(@) = A1(¢), where, for j=1, 2, A,(¢)=1 if 
ped and A,(¢)=O0if d ¢ D. 


Informally, one outcome set is at least as close to an 
individual’s most preferred outcome set as another if 
there is no proposition that the individual accepts and 
that is included in the second outcome set but not in 
the first, and no proposition that the individual rejects 
and that is excluded from the second outcome set but 
included in the first. 


Assumption 1. If an outcome set %; is at least as 
close to 6, as another outcome set 2, then individual i 
weakly prefers D; to 2, i.e., P1R,3 Po. 


As the “at least as close” relation is only a partial, 
not generally complete, ordering over the possible out- 
come sets, this assumption is typically consistent with 
more than one preference ordering; so it is not maxi- 
mally restrictive. My definition of strategic incentives is 
a translation of the classical definition into the frame- 
work of decisions on multiple propositions (compare 
Gibbard 1973 and Satterthwaite 1975). 


Definition. In the decision process M(8, Q) defined 
on the domain D, an individual ie N has an incen- 
tive to express an untruthful AR-function at the pro- 
file (6,)€D if there exists an AR-function 6*(# 6,) 
(where (61,...,6%,...,6,)€D) such that ®*P,, 9, 
where ®* = M(sq,., ane Q) and ® = M(Sy), Q).77 


P,Q, R,-T} to the outcome set {((PAQAR)> T), P, Q, 
=R, =T}. 
27 Here ® is the outcome set when every individual ı € N expresses 
the “truthful” AR-function &; &* is the outcome set when individual 
ı expresses the “untruthful” AR-function &f and every other individ- 
ual j expresses the “truthful” AR-function 6,. 
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Informally, individual i has an incentive to express 
an untruthful AR-function at the profile (6,) if three 
conditions hold. (i) If individual i expresses her 
truthful AR-function 4, (holding the other individuals’ 
AR-functions fixed), the decision process leads to 
the outcome set ®. (ii) If individual i expresses 
the “untruthful” AR-function 8* (holding the other 
individuals’ AR-functions fixed), the decision process 
leads to the outcome set ©”. (iii) Individual i strictly 
prefers &* to È. 

The government example shows that path-depen- 
dent decision processes may give individuals incentives 
to express untruthful AR-functions. Whether or not an 
individual has such an incentive in a given case depends 
on several factors: the individual’s preference ordering, 
the decision-path, and whether the individual’s views 
are pivotal for the collective outcome on some propo- 
sitions. Below I prove that (weak) path-dependence is 
a necessary condition for the existence of individuals 
with incentives to express untruthful views. It is not 
a sufficient condition. Even in cases of (strong) path- 
dependence there may not exist a single individual who 
is pivotal for the outcome on a relevant proposition; 
so there may not exist an individual who can single- 
handedly manipulate the outcome. A more technical 
analysis might be used to show that, under some con- 
ditions, (strong) path-dependence implies that there 
exists a coalition of individuals with an incentive to 
express untruthful views. 


Avolding Strategic Manipulation 


Neither type of strategic manipulation is possible when 
a decision process is invariant under changes of the 
decision-path. In the case of manipulation by agenda- 
setting this is obvious. Agenda-setting—determining 
the order in which the propositions are considered— 
has no effect when the decision process is path- 
independent. In the case of manipulation by expression 
of untruthful views, the following result holds. 


Definition. An aggregation function ô is weakly 
monotonic if, for any (di, d2, . . . , dn), (€1, €2) +++) On) E 
{0, 1}", [for every ie N, d, > e,] implies 8(d,, d2,..., 
dy) > 5(€1, €2,---, €n). 


Propositionwise majority voting is a weakly mono- 
tonic aggregation function. 


Theorem 6. Suppose that ô is a weakly monotonic agg- 
regation function, and M(8,) is invariant under 
changes of the decision-path for every profile (8,) in 
some domain D. For every individual ie N and every 
profile (8,)<€D, the following holds. For every 
AR-function 5°(# 8,) (with (81,..., 5%,...,8:) € D), 
D=M(8q), N) is at least as close to 8, as O*= 
M(ês, 8th Q). 


Informally, suppose that the priority-to-the-past de- 
cision process is path-independent for every profile 
in the domain D, and the aggregation function used 
satisfies the weak monotonicity condition just intro- 
duced. Then, for every individual i, the following holds. 
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Regardless of the AR-functions expressed by the other 
individuals (where all resulting profiles are in D), if 
individual i expresses his or her truthful AR-function 
6, the outcome set produced by the decision process is 
at least as close to his or her most preferred outcome 
set as the one produced if he or she expresses any 
alternative, “untruthful” AR-function 5. 

So, if all individuals’ preferences satisfy Assump- 
tion 1, no individual has an incentive to express an un- 
truthful AR-function at any profile in D. More strongly, 
in the domain D, expression of truthful AR-functions 
is a (weakly) dominant strategy for every individual. 
Thus, in D, expression of truthful AR-functions by 
all individuals is a (weakly) dominant-strategy Nash 
equilibrium. In short, if a priority-to-the-past decision 
process M(8, Q) is path-independent for every profile 
in the domain D, then it is strategy-proof in D. 


Strategy-Proofness in D. For every individual ie N 
and every profile (8,)¢D, the following holds. For 
every AR-function 67(#8,) (with (8,..., 8%,..., 
ôn) ED), OR, *, where D=M(d4), R) and d*= 
M(6(s,,8%,- 8x) &)- 


Corollary of Theorem 6. Suppose that 8 is a weakly 
monotonic aggregation function, and D is a domain 
of profiles. If M(8, Q) is invariant under changes of the 
decision-path for every (8,) € D, then M(8, Q) is strategy- 
proof in D. 


Although related to the Gibbard—Satterthwaite the- 
orem, Theorem 6 and its Corollary make a subtly 
different point. The Gibbard—Satterthwaite theorem 
shows that certain minimal conditions on preference 
aggregation imply a violation of strategy-proofness.8 
My result shows that path-independence of a se- 
quential decision process implies strategy-proofness. 
A question for future research is whether closer ana- 
logues of the Gibbard—Satterthwaite theorem can be 
established in the context of decisions on multiple 
propositions. 

Despite the earlier impossibility results (Theorems 4 
and 5), Theorem 6 and its Corollary are not vacuous. 
As discussed below, if the aggregation function is the 
unanimity rule or a dictatorship of one individual, the 
resulting priority-to-the-past decision process is always 
path-independent—and thus strategy-proof. If the ag- 
gregation function is majority voting, but the domain 
of admissible profiles is suitably restricted, then the re- 
sulting decision process is also path-independent, and 
thus strategy-proof, in this restricted domain. 


ESCAPE ROUTES FROM 
PATH-DEPENDENCE AT A 
COLLECTIVE LEVEL 


Suppose again that all individuals’ views are comp 
lete, weakly consistent, and deductively closed. By 


28 The social choice function satisfies unrversal domain, satisfies non- 
dictatorship, has at least three alternatives in its range, and always 
produces a determinate winner. 


Theorems 4 and 5, there exists no anonymous or even 
just nondictatorial propositionwise aggregation func- 
tion such that a priority-to-the-past decision process is 
decisive and path-independent for every profile in the 
universal domain. To avoid path-dependence it is nec- 
essary to relax at least one of the conditions underlying 
this impossibility result: universal domain, nondictator- 
ship, decisiveness, or propositionwise aggregation. By 
Theorems 1 to 3, these conditions must be relaxed so 
as to allow the existence of an aggregation function 
that always generates a weakly consistent and deduc- 
tively closed (hence strongly consistent) collective AR- 
function. 


Relaxing Decistveness: The Special 
Support Approach 


This escape route relaxes the requirement that the 
decision process should produce a determinate ver- 
dict on every proposition, and allows that, for some 
propositions, neither the proposition nor its negation 
is accepted. The group’s propositional attitude on each 
proposition can then be defined by the unanimity rule 
or by a supermajority rule. 


Definition. The unanimity rule is the aggregation 
function 8: {0, 1}" > {0, 1} defined as follows. For 
any (d1, d2,..., da) E {0, 1}", 


1 ifd, = l foreveryi eN, 


di, h, ..., da) = 
ee dh) 0 otherwise. 
On this rule, for each proposition ¢, the group has 
a disposition to accept @ if and only if every individ- 
ual accepts ¢. The group’s propositional attitudes are 
then weakly consistent and deductively closed (hence 
strongly consistent), but not necessarily complete.” 
They are incomplete whenever there is a lack of una- 
nimity on some propositions and their negations. The 
resulting priority-to-the-past decision process is invari- 
ant under changes of the decision-path, but not gen- 
erally decisive: It fails to produce a verdict on those 
propositions that are neither unanimously accepted 
nor unanimously rejected by the individuals. The ap- 
proach gives veto power to every individual and is, thus, 
prone to “stalemate.” Can a less demanding superma- 
jority requirement be used instead? 


Definition. Supermajority voting with parameter q is 
the aggregation function ô: {0,1}"—> {0, 1} defined as 


follows. For any (dj, d2,...,d,) € {0, 1}*, 
1 ff d> qn, 
ê(di, d, ..., dp) = 1eN 
0 otherwise. 


2 Given that mdividual views are complete, weakly consistent, and 
deductively closed. 
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For a result on supermajority voting, note that the set 
of propositions X can be partitioned into 2m equiva- 
lence classes of logically equivalent propositions, where 
m> 1. The number of equivalence classes is even 
because, as assumed, X always contains proposition- 
negation pairs. It is greater than one because, as as- 
sumed, X contains more than one such pair. 


Theorem 7. Let 8 be supermajority voting with param- 
eter (m — 1)/m (where there are 2m equivalence classes 
of logically equivalent propositions in X). Then 6 satis- 
fies anonymity and induces, for every (8,) € U, a strongly 
consistent (but not generally complete and deductively 
closed) collective AR-function 8,,). 


Under supermajority voting with parameter 
(m—1)/m, the group has a disposition to accept each 
proposition ¢ if and only if a proportion of more than 
(m—1)/m of the individuals accepts ¢. The group’s 
propositional attitudes are then strongly consistent but 
not generally complete and deductively closed. They 
are incomplete whenever some propositions and their 
negations lack the required supermajority. They violate 
deductive closure whenever there is no supermajority 
on some implications of propositions that each get a su- 
permajority. The resulting priority-to-the-past decision 
process is not strongly path-dependent (as the group’s 
propositional attitudes are strongly consistent). It may 
be weakly path-dependent (as the propositional atti- 
tudes may violate deductive closure), and it may be 
indecisive (as they may violate completeness). The ap- 
proach does not give veto power to every individual, 
but it gives veto power to every group of a proportion 
of 1/m or more of the individuals. It may thus still be 
prone to “stalemate,” albeit less so than the unanimity 
approach. 

Theorem 7 mirrors results by Craven (1971) and 
Ferejohn and Grether (1974) in the context of pref- 
erences showing that suitable supermajority rules can 
generate acyclical collective preferences; it is not a 
direct corollary of these results, because of the dif- 
ferences between views on multiple interconnected 
propositions, on the one hand, and preferences, on the 
other cone between the associated consistency condi- 
tions. 


Relaxing Nondictatorship: The Dictatorship 
Approach 


This escape route not only relaxes the requirement that 
all individuals should have equal weight in determining 
the group’s propositional attitudes, but also allows the 
existence of a dictator. By Theorem 5, a dictatorship of 
one individual is the unique aggregation function that 
guarantees decisiveness and path-independence for all 


X At least ifn >m [ie if (n/m) > 1]. 

31 For example, strong consistency, on the one hand, and acyclicity, 
on the other. The relations between consistency conditions in the 
two frameworks are discussed in List and Pettit 2003 
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profiles in the universal domain (where the dictator’s 
views are complete, weakly consistent, and deductively 
closed). Under a dictatorship, the outcome set of a 
priority-to-the-past decision process is the set of propo- 
sitions accepted by the dictator and will typically fail to 
reflect the views of any other individuals. Here path- 
independence is achieved at the cost of democratic 
responsiveness. 


Relaxing the Universal Domaln: The 
Domain Restriction Approach 


This escape route relaxes the requirement that the de- 
cision process should accept as admissible input any 
logically possible profile of individual AR-functions. 
Suppose that not all such profiles will occur in col- 
lective decisions, for one of the following reasons. 
(i) Coercive reasons: Certain profiles are explicitly 
ruled out by restrictions on the views individuals are 
permitted to express. (ii) Empirical reasons: Certain 
profiles do not occur in practice, as a matter of con- 
tingent fact. (iii) Deliberative reasons: Although all 
logically possible profiles can in principle occur, the 


_ individuals engage in group deliberation prior to a col- 


lective decision, where such group deliberation trans- 
forms the individual views and thereby restricts the 
domain of postdeliberation profiles. 

Parallel mechanisms of domain restriction have been 
discussed in relation to classical models of social choice 
(Dryzek and List 2003; Knight and Johnson 1994; 
D. Miller 1992). I here remain neutral on how feasi- 
ble such mechanisms are. I identify a structure con- 
dition with the property that, if the domain of ad- 
missible profiles includes only ones satisfying that 
condition, then a priority-to-the-past decision process 
is path-independent. The condition—called unidimen- 
sional alignment—is similar in spirit, but not in form, 
to Black’s (1948) condition of single-peakedness in the 
context of preferences. A profile satisfies unidimen- 
sional if there exists a single ordering of all 
individuals from left to right—a structuring ordering— 
-such that, for every proposition in X, the individuals 
accepting the proposition are either all to the left or all 
to the right of those rejecting it. The profile shown in 
Table 2 satisfies unidimensional alignment. The corre- 
sponding structuring ordering is 4,1, 5, 2,3. Individual 5 
is the median individual, as defined below. 

For a formal definition, fix a profile (8,). For each 
geXxX, let N be the set of individuals accepting 
ġ, and Nrerect-s the set of individuals not accepting @, i.e., 


TABLE 2. Unidimenslional Alignment 
Individual No. 
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Nacy: = {iEN: 8(p)=1} and Nypececeg = {iE N: 
3,(¢) =0}. A structuring orderingis a linear ordering 
œ on the set of individuals N, where iwj means that 
individual i is to the left of individual j. For any two 
subsets N1, N2 CN, I write NjwN2 as an abbreviation 
for [for all i € N; and all j € N2, i wj ],i-e., the individuals 
in N are all to the left of those in N2. 


Definition (List 2003). A profile (8,) satisfies unidi- 
mensional alignment if there exists a structuring order- 
ing w such that, for every ¢ € X, either Noccep-¢@Nreect¢ 


Definition. The unidimensional alignment domain, R, 
is the set of all logically possible profiles of individ- 
ual AR-functions satisfying completeness, weak con- 
sistency, deductive closure, and unidimensional align- 
ment. 


Suppose that a profile (ô,) satisfies unidimensional 
alignment. Order the n individuals along a correspond- 
ing structuring ordering, say w. There are two cases. 


nisodd. Individual m is the median individual with re- 
spect to w if there are as many individuals to m’s left as 
to m’s right, formally, {ie N: i@m}|=|{ieN:moi}|. 
By unidimensional alignment, the median individual 
shares the majority view on every proposition. Hence 
the collective AR-function induced by majority vot- 
ing is the AR-function of the median individual, ôm. 
Provided that the median individual’s AR-function is 
complete, weakly consistent, and deductively closed, 
so is the collective AR-function. 


n is even. Here there exists no single median indi- 
vidual, but a median pair. Individuals mı and m are 
the median pair with respect to w if they are adjacent 
(with m left of m2) and there are as many individu- 
als to my’s left as to m?’s right, formally, (i) mı @ m, 
(ii) there is no ie N such that mı wi and iwm, and 
(iii) {ie N:iw@m,}|=|{ieN:mzqi}|. By unidimen- 
sional alignment, if (and only if) the median pair of 
individuals agrees on a proposition, then their view 
is also the majority view on that proposition. So the 
collective AR-function induced by majority voting is 
the product (the “intersection”) of the AR-functions 
of the median pair of individuals, ôm, 3m, Provided 
the AR-functions of the median pair are each complete, 
weakly consistent, and deductively closed, the collec- 
tive AR-function is also weakly consistent and de- 
ductively closed. It may violate completeness, namely 
when the median pair of individuals disagree on a 
proposition ¢.™ But this happens only if there is a 
majority tie, i.e., only if |Noccep-¢| = |Nrayect-¢|- Thus the 
collective AR-function is “almost” complete. 


Definition. Given a profile (8;), 8) is almost com- 
plete if, for any pe X, eee F |Nrapect¢| io that 
5)($) =1 or 8a (H) = 


* Note that this definition permits Naccept-y = Ø or Nrepect-¢ = Ø. 
D Here jm is defined as follows: for any 6X, den ôm ($) = 
Ben (Xim (P) 

~ In that case, Sm ($) Æ m (9), 30 Sm (P) Smp (P) = 0 = San (~A) 
öm (9). 
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Proposition 6 (List 2003). Let & be propositionwise 
majority voting. Then & satisfies anonymity and in- 
duces, for every (8,) eR (with structuring ordering w), 
an almost-complete, weakly consistent, and deductively 
closed collective AR-function å}, where: 


(i) Ifn is odd, 55) = ôm, where m is the median indi- 
vidual with respect to w. 

(ii) If n is even, 84) = ôm ôm, Where my and mz are 
the median pair with respect to œ. 


This result mirrors Black’s (1948) median voter the- 
orem, albeit in a new context; it is not a corollary of 
Black’s result, as unidimensional alignment and single- 
peakedness are formally different (for a more general 
domain restriction result in a preference context, see 
Sen 1966). By Proposition 6, for any profile satisfying 
unidimensional alignment, the group’s propositional 
attitudes, determined by majority voting, are almost 
complete, weakly consistent, and deductively closed. 
I next show that the resulting priority-to-the-past de- 
cision process is path-independent. It is not generally 
decisive if n is even, but “almost” decisive. 


Definition. Given a profile (8;), M(8ig), Q) is almost 
decisive if, for every ġ € X, |Naccepe-¢| 4 |Nreect-o| implies 
that ge M (êi) Q) or =p eE M(S«a), Q). 


Theorem 8. Let 8 be propositionwise majority voting. 
Then 6 atisfies anonymity and, for any (8,)€R (with 
structuring ordering w), M(8i3, &) is almost decisive 
and invariant under changes of the decision-path, where: 


(i) Ifnisodd, M(5q),2) = {p E€ X:8m(p) =1}, where 
m is the median individual with respect to œ. 

(i) If n is even, M(3q), 9)={0 EX: 8m (H)Sm 
(¢) =1}, where m; and mz are the pair 
with respect to a. 


So if the group uses majority voting as its aggregation 
function, then the resulting priority-to-the-past deci- 
sion process is almost decisive and path-independent 
for every profile in the restricted domain R. The out- 
come set of the process is the set of propositions ac- 
cepted by the median individual (if n is odd) or the 
intersection of the sets of propositions accepted by the 
median pair of individuals (if n is even). 

Is unidimensional alignment just an artificial condi- 
tion, or can it be met in plausible situations? Suppose, 
first, that different individuals disagree on the proposi- 
tions, but they reach some “meta”-agreement on what 
their disagreement is about: They agree on a single 
dimension (e.g., from “most liberal” to “most conser- 
vative”) on which their positions can all be placed; each 
individual takes a position on that dimension. I call it a 
left-right dimension, but different interpretations are 
possible. Suppose further that, for each proposition, 
the extreme positions on the left—right dimension cor- 
respond to either clear acceptance or clear rejection of 
the proposition, and there exists an “acceptance thresh- 
old” on the dimension (possibly different for different 
propositions) such that all individuals to the threshold’s 
left accept the proposition and all individuals to its 
right reject it (or vice versa). These conditions entail 
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unidimensional alignment; i.e., meta-agreement of the 
form described can induce unidimensional alignment. 
This suggests that, to the extent that such meta- 
agreement is feasible, the escape route from path- 
dependence opened up by unidimensional alignment 
may have some promise. But, for my purposes here, 
the specific interpretation of unidimensional alignment 
is less relevant than the general insight that path- 
dependence can be avoided if there is a sufficient level 
of structure among the individuals’ views. 


pelexing Eropostlonn ee Aggregation: The 
General ggregation Approach 


In my model the group’s initial dispositions are 
defined by propositionwise aggregation, in analogy to 
Arrow’s independence requirement. The aggregation 
function maps each vector of zeros and ones across 
individuals to a corresponding single collective 
disposition of either zero or one. Although plausible 
in sequential decisions where separate votes are taken 
on each proposition, this is a restrictive definition 
of aggregation.» A generalized aggregation function, 
F, maps each profile (8,) to a collective AR-function 6: 
X — {0, 1}. Such a generalized aggregation function 
F provides an escape-route from path-dependence 
if and only if it generates a weakly consistent and 
deductively closed AR-function 6. 

A function F with this property can be defined by 
“feeding” each profile (8,) into an appropriate se- 
quential decision process. For each profile (8,), define 
F((3,)) := ôr by: 


1 if ọ E€ M(êa)» Qr), 


foreachgex, 3 = | Sta 


where ô;ą) is the result of applying propositionwise ma- 
jority voting to (ô,} and Qr is some fixed decision-path. 
By definition, ôr is (almost) complete, weakly consis- 
tent, and deductively closed for any profile (8,) € U. 
Suppose now that ôr is itself used as an AR-function 
in a sequential decision process. The weak consistency 
and deductive closure of ôr imply that M(ér, Q) is path- 
independent. In this technical sense, the generalized 
aggregation function F provides an escape route from 
path-dependence. 

Some aggregation functions that have been pro- 
posed for solving the “doctrinal paradox” are of this 
form. The “premise-based procedure,” for example, 
applies majority voting to the premises and decides 
other propositions based on what the votes on the 


3 My assumption of propostionwise aggregation us demanding, re- 
quinng that (i) the group’s propositional attitude on each proposi- 
tion be determmed by aggregation over the indrvidual views on that 
propontion (independent of other propositions), and (ii) the same 
aggregation function be used for all propositions. Pauly and van 
Hees (2003) and Dietrich (2003) have recently investigated weaker 
assumptions of propositionwise aggregation—e.g., keeping (1) but 
droppmg (i1}—and obtained results broadly similar to Propositions 4 
and 5, suggesting that results similar to Theorems 4 and 5 continue to 
hold. So even a weaker assumption of propositionwise aggregation 
may not provide an escape route from path-dependence. 
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premises imply (Chapman 2002; Pettit 2001a). This can 
be represented as a generalized aggregation function 
F, defined in terms of a decision-path Qr where the 
premises occur before the conclusion. 

Now the decision process M(ér, Q) is path- 
independent after êp has been determined in the 
manner outlined. But the path-dependence problem 
has not been resolved; it has been shifted one level up. 
The path-independence of M(ér, 2) has been achieved 
at the cost of making the aggregation function F used 
for defining ôr itself “internally” path-dependent. As 
F is defined by a sequential decision process—i.e. 
ôr (¢)=1 if and only if ¢€ M(3,), Qr)—the collective 
AR-function é- is not generally invariant under 
changes of the path Qr used for defining F. All the 
problems I have discussed—particularly strategic 
manipulability—will therefore recur at the level of the 
aggregation function F. 

So, if the aim is not just to shift path-dependence one 
level up, the escape route via relaxing propositionwise 
aggregation seems not very promising. An open chal- 
lenge is to find a generalized aggregation function that 
both relaxes the assumption of propositionwise aggre- 
gation and is not itself “internally” path-dependent in 
any problematic way. 


CONCLUDING REMARKS 


I have modeled sequential decisions over multiple in- 
terconnected propositions and illustrated my model 
by investigating path-dependence in such decisions. 
The model is intended to complement, not replace, 
existing social-choice-theoretic models. It seeks to cap- 
ture a class of decisions that are not straightforwardly 
captured by classical models: reason-based decisions, 
where the focus is not only on outcomes, but also on un- 
derlying reasons, beliefs, and constraints. Let me briefly 
summarize my main substantive results. 

I have shown that certain violations of perfect ra- 
tionality by the relevant agent are necessary and suffi- 
cient for path-dependence. I have discussed the impli- 
cations of this result for both individuals and groups. 
Whereas a perfectly rational individual can avoid path- 
dependence, a boundedly rational individual may be 
susceptible to it. An example is an agent who never 
accepts a proposition and its negation simultaneously 
but who fails to foresee all the logical implications of 
the propositions he or she accepts. As path-dependence 
makes the agent manipulable by the presentation of 
the propositions in some strategic order, this finding 
suggests a possible link between bounded rationality 
and Rikerian “heresthetics,” the art of political ma- 
nipulation. Exploring this link may be an interesting 
avenue for future work. 

Path-dependence is particularly serious at the col- 
lective level. Whereas individuals might try to avoid 
path-dependence through a self-imposed “discipline” 
of rationality, no such option is easily available to 
groups. Under certain conditions, any group that deter- 
mines its propositional attitudes by aggregation over its 
members’ views will necessarily run the risk of those 
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rationality violations that lead to path-dependence. 
Path-dependence makes collective decisions vulner- 
able to manipulation both by agenda-setting and by 
expression of untruthful views. I have identified some 
formal escape routes from path-dependence at the col- 
lective level, and future research might ask how sub- 
stantively promising these routes are. 

My results are also relevant to the debates on col- 
lective consistency referred to above. As noted, so- 
cial choice theorists have traditionally been skeptical 
toward the feasibility of a consistent collective set of 
views. Related to the classical results on collective in- 
consistencies—usually defined as voting cycles—there 
are also several results suggesting that collective out- 
comes are inherently unstable. This, in turn, can be 
strategically exploited. Schwartz’s (1981) universal in- 
stability theorem, for instance, shows that, under mild 
assumptions, any collective outcome involving certain 
generalized exchange among agents is unstable: Some 
group of agents has the preference and power to over- 
turn that outcome in favor of some other outcome. 
Further, some results show that, even when equilibria 
exist in politics, these may not be unique, which raises 
the problem of equilibrium selection. The Muller— 
Satterthwaite theorem (Muller and Satterthwaite 1977; 
Myerson 1996), for instance, suggests that, if we in- 
terpret a political system as a voting game and there 
are three or more possible outcomes, then, absent a 
dictator, the game does not generally have a unique 
equilibrium (if any). 

Are my results yet another addition to the large set 
of social-choice-theoretic inconsistency and instability 
results? Not quite. Given a decision-path, the outcomes 
of a priority-to-the-past decision process in my model 
are neither inconsistent nor unstable. They are con- 
sistent, as the priority-to-the-past rule does not per- 
mit the acceptance of propositions that conflict with 
propositions accepted earlier. They are stable, as 
propositions, once accepted, are not overruled by the 
priority-to-the-past rule; moreover, each separate de- 
cision along the path is itself binary. The constraints 
implemented by the priority-to-the-past rule may thus 
reduce the continual drift of collective outcomes that 
classical models predict in unconstrained settings. All 
this comes at a price—path-dependence: A group can 
generate consistent collective outcomes in a priority- 
to-the-past decision process; it can do so in a rel- 
atively democratic manner, by using an aggregation 
function such as majority voting; but the decision 
outcomes may be path-dependent and thus affected 
by the problems I have identified, such as strategic 
manipulation. 

One might speculate whether my results suggest 
another possible answer to Tullock’s (1981) question 
“Why so much stability?” Perhaps political outcomes 
are stabilized not only by constraints created by insti- 
tutional arrangements—as in a structure-induced equi- 
librium (Shepsle 1979}—but also by constraints created 
by prior decisions or commitments—as in a priority-to- 
the-past decision process. Based on my research here, 
however, it would be premature to propose a new equi- 
librium concept, but there are interesting avenues for 
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future research. For example, see Page’s (2003) work 
on path-dependent equilibria. 

How threatening is path-dependence? Some fac- 
tors might make it less of a threat. A decision’s 
subject-matter might itself single out the appropriate 
decision-path. Some propositions might be unambigu- 
ously “weightier than,” or “prior to,” others, and the 
order in which the propositions are to be considered 
might be uncontroversial. Alternatively, history or em- 
pirical contingencies might determine the decision- 
path, with few opportunities for intervention by agents 
with a strategic interest. But even then it is worth asking 
whether or not the given path makes a difference to the 
outcome. If there is no path-dependence, the legitimacy 
of an outcome will be under no threat, whether or not 
the decision-path is disputed; that path is simply irrele- 
vant. But, if there is path-dependence, a justification of 
the chosen path is crucial. Further, in such cases, even 
if agents agree on a decision-path, this will not rule out 
manipulation by the expression of untruthful views. As 
shown, the mere existence of an alternative decision- 
path that would change the outcome—although that 
path is not adopted—may create incentives for strate- 
gic expression of untruthful views. So, curiously, path- 
dependence may matter even when the decision-path 
is fixed. 

I have already mentioned several questions for fur- 
ther research. Others include how far the present re- 
sults can be generalized to conflict resolution rules 
other than the priority-to-the-past rule; under what 
restrictions on the set of admissible, or naturally oc- 
curring, decision-paths there is no path-dependence; 
and whether any domain restrictions such as unidimen- 
sional alignment are empirically realistic. 

Improving our understanding of path-dependence is 
an important challenge in the theory of democracy. 
Many democratic decision processes are sequential, 
and hence it is important to learn whether, and how, 
the decision-path matters and what the implications of 
path-dependence are. 


APPENDIX 
Proof of Lemma 1: Let (i) and (ii) denote the left and 
right sides of the biconditional. 


(D bmpies (H). Suppose that (i) holds. Then :={peEX: 
($) =1} is logically inconsistent. Let Y, be a maximal con- 
sistent subset of . Then Y4 # Ø, as X contains no contradic- 
tions; Y2 + ®, as ® is not consistent. Choose any ġ € O\ Ya. 
As W is a maximal consistent subset of ®, 4 U{¢} is not 
logically consistent; so Y2] = —¢. Let Y: = {¢}; Wis logically 
consistent, as ¢ is not a contradiction. Then Y}, and Y% have 
the properties required by (i1). 

(if) implies (i). Suppose that (ii) holds As W,|=@ and 
| =-4, the set Y U Y is logically inconsistent. But ® D> 
Y U Wy. So & is also logically inconsistent, and (i) holds. m 


Lemma 2. For any ¢€X, (i) there exists a dectsion-path Q 
such that ¢ € M(8, Q) if and only if (ui) there exists a logically 
consistent subset Y C X such that (¥)=1 and W|=¢. 


Proof of Lemma 2: (i) implies (i). Suppose that (i) holds. 


Let Q be a decision-path such that ¢ € M(8, Q). Choose t such 
that ¢ is accepted at time ¢ in M(4, 2). Let P={w EX: 
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(y ) =1 and x is accepted at some time s < t in M(8, Q)}. As 
¢ is accepted at time ż, either 8(¢)=1 or V|=¢. If 8(@) = 1, 
then {¢} has the properties required by (ii): {¢} is logically 
consistent, as œ 1s not a contradiction. If ¥|=¢, then Y has 
the properties required by (ii): Y is logically consistent, as 
W CM(8, Q), which is logically consistent. 

(if) implies (i). Suppose that (ii) holds. Define Q as follows. 
Let tr=|WU {¢}]. On {1, 2,..., t}, let Q be any one-to-one 
mapping from {1, 2,..., t} onto ¥ U {¢} such that Q() =¢. 
To continue the path R on {f+1,..., k}, let Q be any one-to- 
one mapping from {t+1,. 7 onto X\(Y U {¢}), where Í 

= |X]. Then Q has the properties required by (i). 


Proof of Theorem 1: Theorem 1 follows immediately 
from Lemmas 1 and 2. E 


Lemma 3. Suppose that M(8, Q) is invariant under changes 
of Q. Then, for every peX (and any decision-path Q), 
pe M(8, Q) if and only if 8(¢)=L 


Proof of Lemma 3: Let eX. Suppose that M(ô, Q) is 
path-independent. We can use any decision-path Q. For each 
ge X, define Ry such that Q,(1):=¢, and Q,(2):= 4; on 
{3, ..,k}; let Q, be any one-to-one mapping from {3,.. 

onto X \{¢, ~p}, where k = |X]. Suppose that 8(¢) =1. ee 
by the definition of M(8, R4), ¢€ M(8, Qy). Suppose, con- 
versely, that de M(3, Q4), but ($) =0. As Do does not entail 
$, $ ¢ dı and ¢ ¢ d2. But ¢ or —¢ do not occur elsewhere 
in Ny, s0 ¢ ¢M(3, N4), a contradiction. Hence 8(¢) = 1, as re- 
quired. | 


Proof of Theorem 3: Let (i) and (ii) denote the left and 
right sides of the biconditional. Suppose that 8: X — {0, 1} is 
strongly consistent. 

(i) implies (ii). Suppose that there exist Q; and R2 such that 
p e M(8, R1) and ¢ ¢M(4, R2), but 3 is deductively closed with 
respect to ¢. As å is strongly consistent, M(8, Q) is weakly 
but not strongly path-dependent. As ¢ and N; satisfy (i) in 
Lemma 2, there exists a logically consistent ® C X such that 
8(©) =1 and &|=¢. Then 6(¢) =1, by deductive closure of 
8. Choose t such that 22(t) =¢. Under path M, at time t, 
¢ is not accepted, as ® ¢ M(4, Q2). As 5(¢) =1, this requires 
,_1|=—#. By the definition of M(8, R2), ~p € M(8, 22); s0 
M(6, Q) is strongly path-dependent, a contradiction. Hence 
6 is not deductively closed with respect to ¢. 

(i) implies (i). Suppose that 8 violates deductive closure 
with respect to ¢ € X, but M(é Q) is path-independent with 
respect to ¢. There exist a logically consistent dC Xandg@eX 
such that 8(®) = 1, d| = ¢ but (¢) = 0. By Lemma 2, there ex- 
ists Q such that ¢ € M(8, Q). As M(8 Q) is path-independent, 
pEeM(8, Q) for every Q. Consider N4 as in the proof of 
Lemma 3. As 8(@)=0 and not Bo|=¢, ¢ ¢ Dı and ¢ ¢ ® 
for Qy. But ¢ or —¢ do not occur elsewhere in Rg, 80 ¢ ¢M 
(5, Qy), contradicting ¢ € M(8, Q) for every Q. Hence M(8, ge 
is weakly path-dependent with respect to ¢. 


Proof of Theorems 4 and 5: Let (i) and (ii) denote the 
left and right sides of the biconditional in Theorem 5. Theo- 
rem 4 follows from the proof that (i) implies (ii). 

(i) implies (i). Let 3 be an aggregation function such 
that, for every (8,)€U, M(8q), R) is decisive and path- 
independent. By Lemma 3, for every ¢ € X, ¢ € M(8y), Q) if 
and only if 84) ($) =1. As M(dq,), Q) is decisive and logically 
consistent, 54) 18 complete and strongly consistent, hence 
weakly consistent and deductively closed. So 6 induces, for 
every (4,) € U, a complete, weakly consistent, and deductively 
closed 84,. By Proposition 4, ô violates anonymity. More 
strongly, by Proposition 5, 8 is a dictatorship of some i € N. 
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(if) implies (1). Let 5 be a dictatorship of some i € N. Take 
any (8) € U. Then 34) =4,. Because ô, is complete, weakly 
consistent and deductively closed (hence strongly consistent), 
so 18 84). By Theorems 2 and 3, M (w), Q) is invariant under 
changes of the decision-path. The decisiveness of M(3y), 2 
follows from the completeness of 5,,). 


Proof of Theorem 6: Suppose that 8 is weakly mono- 
tonic, and M(8, Q) is path-independent for every (8,) € D. 
Take so ieN, any (8)e€D, and any & 44, (with 
(81,...,d%,...,4n) ED). I prove that b= MOS p») is at 
least as close to 8, as O*=M(Su,, vara) 2 D. By path- 
independence of M(8, Q) in D, Lemma 3 implies that, for 
every EX, [ġ € M(8, Q) if and only if 6(¢)=1] and [Pe M 
(8, Q) if and only if 6*(¢)=1], where d:=3y) and & := 
5u;, a, .&)- I must show that, for every ¢ € X, 5,(¢) =8*(¢) 
implies (9) 5), i.e., |8($) — AA < |8* ($) —8,(¢)|. Take 
any pEX. Let dı:= 3:(¢), dz = 82($),...,dn = 5,(9), 
and d: = ($). Then 3(¢) = 8 (1(¢),.-., ECP), -> 8n($)) = 
te » de) and 5°(4) = (81 (Ø), (O), -><> Se 
(¢))=8 ae Pis da): an weak monotonicity of & 
implies that |8(d,,. a d,) — d| < |8(di,...,a,... 

d») — d|, and thus I($) -5 (9) < < |8" ($) —5(4)I, as required 


Proof of Corollary of Theorem 6: Suppose that 8 is 

monotonic, and M(8,Q) is path-independent 
for all (8,)€D. Take Y iceN, any (3,)¢€D, and any 
& (#8) (with (8,...,8%,...,64)€D). I show OR, D", 
where P= M(8q), Q) and o* = M(Sy,, i, bn)» Q). By 
Theorem 6, ® is at least as close to å, as ®*. By Assumption Í, 
OR, D". 


Proof of Theorem 7: Assume Theorem 7’s conditions. 
Define Naccepre and Nreecrg a8 10 the section Relaxing 
the Universal Domain, above. Let 8 be supermajority 
voting with parameter (m—1)/m; & satisfies anonymity. 
Take any (8,)€U. Let O:={PEX : 8y)(¢)=1} be the 
set of propositions accepted under the aggregation 
function ô for (ô). I show that, for some :éWN, 
D C{peEX: 3,(¢)=1}. As each ô is weakly consistent, 
there is no ġexX such that @ and -—¢ both obtain 
(m—1)/m supermajorities, so dy) is weakly consistent. 
Partition ® into m* equivalence classes of logically 
equivalent propositions As $y) is weakly consistent, 
® contains at most one member of each proposition- 
negation pair, so m*<m<2m. Let i, r... Pme EP 
be representatives for these equivalence classes. Now 
INaccept-p |) Naccept#y|)-»->|Naccepeo.«|>n(m—1)/m. Then 
\Naccept- N Nacen l > "(m—1)/m—n/m = n(m—2) 
m=n(m — 3)/m. Continuing, |Noccepns, N Nace N -N 
Naccept-tys| > N(m-m*)/m. De n (m— m*)/m> 
0, hence Vn r AN |>0. So for 
some iEN, ieN, » N Naccept4u But 
every pe is ees to one of is $a,- me. 
As 4, is complete, weakly consistent, and deductively 
closed, it follows that, for every ge, 1 € Naccepe-¢ 
hence OC {pe X:8,(¢)=1}. As 4, is strongly ee 
so is Sig). 


Proof of Theorem 8: Let be majority voting. By Proposi- 
tion 6, ô satisfies anonymity and, for any (3,) € R, 3g, is almost 
complete, weakly consistent. and deductively closed, thus 
strongly consistent. Take any (6,) € R. By Theorem 3, M(éa,, 
Q) is path-independent. By Lemma 3, M(84, Q)={¢eEX: 
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3u)(¢)=1}. The almost-completeness of 84) implies the 
almost-decisiveness of M4(5,,), Q); (i) and (ii) follow from (i) 
and (ii) in Proposition 6. | 
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ender differences in vote choice, opinion, and party identification have become acommon feature 
of the American political landscape. We examine the nature and causes of gender differences 
in partisanship using a time series approach. We show that gender differences are pervasive— 
existing outside of the context of specific elections or issues—and that they are a product of the interaction 
of societal conditions and politics. We find that from 1979 to 2000, the partisan gender gap has grown when 
the political climate moved in a conservative direction, the economy deteriorated, and the percentage 
of economically vulnerable, single women increased. The gender gap is likely to be a continual feature 
of the American political landscape: one that shapes everything from elite political behavior to election 


outcomes. 


both converged and diverged at different times 

in history. The late 1970s, for example, saw a 
period in which men and women shared similar levels 
of Democratic and Republican partisan support. Since 
the early 1980s, however, partisan preferences have 
diverged. The divergences are fundamentally politi- 
cal, often deciding election outcomes, helping structure 
policy debates and election campaigns, and also con- 
tributing to policy outcomes. Each of these may in turn 
either fuel or diminish the original differences in the 
partisanship of men and women and lead to a politics 
more or less divided on the basis of gender. Politicians, 
for example, may be well served by reaching out and 
supporting issues designed to minimize differences in 
support from men and women, especially when the 
parties find themselves in rough parity. Electoral moti- 
vations may also lead politicians to support policy that 
exacerbates gender differences when gaps work in their 
favor, further contributing to them. To the extent that 
partisan differences across genders are persistent, these 
implications become more profound, characterizing 
the political landscape for long periods of time. Conver- 
gence in the partisan preferences of men and women 


T: partisan preferences of men and women have 
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also has political consequences. It indicates a lack of 
polarization of the parties on issues relevant by gender, 
suggesting an equality of consideration and influence 
that are often associated with “good” democracy. 

We know that the partisan gender gap matters for 
the conduct and outcomes of politics and that its po- 
tential for divisiveness is large. Media, politicians, and 
pundits have noted the differences, yet surprisingly the 
gender gap has not been systematically illustrated over 
time and little has been offered to explain the patterns 
of convergence and divergence in party identification 
by gender over time. Candidate- or election-based ex- 
planations alone cannot explain gender differences or 
their persistence outside of elections. And studies that 
focus on individual partisan and opinion differences 
within an election (or even a short panel) cannot ad- 
dress the sources of variation over time. 

We examine the role of social and demographic 
trends as well as the economic and political roots of the 
partisan gender gap from 1979 to 2000. In doing so we 
address a set of related but also more general questions 
than are typically addressed in the literature on the 
gender gap. We begin with two descriptive questions. 
First, how general are gender differences: Do they ex- 
ist outside of specific elections and policy issues? Sec- 
ond, how enduring a feature of the political landscape 
are gender differences? To answer these questions, we 
present over-time gender differences in partisanship 
on a quarterly basis using survey data from CBS News 
and The New York Times since the late 1970s and assess 
the persistence of gender differences over this period. 
Answers to these questions speak to the nature and 
significance of gender differences and their potential 
for affecting, perhaps structuring, the play of politics in 
the short and long run. Then we turn to explanations 
for these differences and ask: Why does the partisan 
gender gap fluctuate over time? We expand on existing 
explanations, specifically asking, What are their over- 
time implications? To test the hypotheses, we model the 
dynamics of the partisan gender gap and ask how 
the political, social, and economic environments affect 
the dynamics during periods of “routine” politics as 
well as during elections. An understanding of gender 
differences and their over-time dynamics alerts us to 
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FIGURE 1. 


Democratic Partisanship by Gender 


1977 1979 1981 1983 


Note. Data represent the 
Data are q 
1996 Q1, 1997 Q1, and 1997 Q2, Interpolated data were used. 


Democratc identhers of all 


the factors that promote a politics more or less divided 
by gender. To the extent that politicians and society 
at large value convergence, answers to these questions 
can ultimately help us to build an environment in which 
political debates and outcomes are transparent with 


respect to gender. 


THE PARTISAN GENDER GAP OVER TIME 


The gender gap has been variously referred to as the 
differences between men and women in political opin- 
ions, voting behavior, and partisan identification. We 
know, for example, that women are less likely than men 
to support any form of aggression; they oppose capital 
punishment and support stricter gun control laws, and 
they are less likely to support involvement in foreign 
wars (Conover and Sapiro 1993). Women have also 
been more supportive of government involvement in 
the economy to ensure good jobs and a certain stan- 
dard of living, to reduce income differences, and to 
increase government services in general (e.g., Chaney, 
Alvarez, and Nagler 1998, and Erie and Rein 1988). 
In contrast, there appear to be no significant differ- 
ences in opinions on “women’s issues”—issues related 
to the Equal Rights Amendment, abortion, and other 
feminist concerns (Cook and Wilcox 1991; Klein 1984; 
Seltzer, Newman, and Leighton 1997), nor are there 
differences in the electorate between gender-role atti- 
tudes by party (Sanbonmatsu 2002). 

There are also well-documented differences in the 
voting behavior of men and women (e.g., Kaufmann 
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identifiers using CBS News and New York Times surveys. 


from the first quarter (Q1) of 1977 through the fourth quarter of 2000. Since no surveys were available for 1881 Q4, 





and Petrocik 1999, Kenski 1988, and Miller 1991). Gen- 
der differences in the vote often headline election night 
news coverage and are analyzed by a score of journal- 
ists and political scientists. In the 1992 election, for 
example, 41% of the men, compared with 45% of the 
women, voted for Governor Bill Clinton. The media 
suggested that Clinton owed his victory to women, and 
in 1996, much was made of President Clinton’s ability 
to maintain his support among women (“Year of the 
Soccer Mom”). 

Finally, there are differences in the partisan identi- 
fications of men and women (e.g., Bendyna and Lake 
1994, Kanthak and Norrander 2002, and Norrander 
1997, 1999). Typically these differences are presented 
in parallel with vote differences in individual elections 
or using biennial NES data. Snapshot differences in 
partisan preferences for any given election illustrate 
the average differences between men and women at 
isolated points in time. Biennial data reveal changing 
preferences but belie the continuous nature of gender 
differences and their response to routine politics. They 
further focus efforts to explain the gender gap on the 
tole of issues within elections. 

We investigate the partisan gender gap over 
time. The partisan preferences of men and women, 
measured quarterly from 1977 to 2000, are presented 
in Figure 1.* Several things are readily apparent from 
the figure. First, the partisanship of men and women 


1 The data are from the CBS News and New York Tunes (CBS/NYT) 
surveys from 1977 through the fourth quarter of 2000. The surveys 
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behaves differently, both in their levels and their re- 
sponsiveness. Second, these differences are continuous, 
existing outside of elections. Third, the percentages of 
both males and females claiming to be Democrats de- 
clined over much of this time period, with Democratic 
support rebounding, perhaps, in the late 1990s. Males 
led this decline in Democratic loyalties most sharply, 
with levels of Democratic partisanship dropping fairly 
steadily, from about 65% in the late 1970s to well under 
50% in the early 1980s. Since that time, Democratic 
support among men has hovered below 50%. The ranks 
of the Democrats among females have also dwindled 
from similar but slightly higher levels of support but 
have done so to a lesser extent, with over half of all 
females claiming to be Democrats over this period. 
This is consistent with Wirls’s (1986) observation that 
the gender gap represents “unequal rates of defection” 
from the Democratic party by each gender. While both 
men and women moved toward the Republican Party, 
women did so at a slower pace (see also Cook and 
Wilcox 1995 and Norrander 1998, 1999).? In the current 
political environment the gap is maintained by small 
movements toward the Democratic party, in particular, 
by women. Finally, we note that from the mid 1980s 
onward, the two series appear to be farther apart than 
at any previous time in the data. This gap i in major party 
preferences is maintained through 2000.3 

We present the gender gap itseli—the difference 
between the partisanship of men and women— 
aggregated by quarter in Figure 2. The gender gap has 
ranged from nearly zero to just over 10% between 1977 


we use were all done by telephone and include samples from the 
population of adults 18 or older and living in the contmental United 
States. The telephone numbers for each survey are selected by a 
variant of random digit dialing where random groups of area codes 
and exchanges are selected, from which the last four digits are then 
randomly selected. Contacted households are asked two questions 
whose answers are used to randomly select the adult interviewed. 
These are: How many adults live in the household? and How many 
of these are women? While CBS/NYT occasionally conduct panel 
surveys, only those contacted in the initial wave were included in the 
data. Thus the surveys are independent draws from the population. 
CBS/NYT asks the traditional party identification question, “In 
general, do you usually consider yourself a Democrat, a Republi- 
can, an Independent, or what?” Macropartisanship is expressed as 
one group of Democratic identifiers’ percentage of all those who 
identify with a party This is the standard way of expressing the 
macropartisanship series. This definition does not take mto account 
the movement of Independents (see Norrander 1997). Independents 
cannot be separated from “don’t know” or “other” because of the 
way the responses are coded by CBSYNYT over the full pernod 
Thus we leave consideration of the flows to and from the partisan 
to independence categories for future research. There are four quar- 
ters without surveys; we use Stimson’s (1998) dimensional analysis 
algorithm to estimate the missing time points. For 1979—2000, the es- 
timation sample of our multivariate analysis, on average, 3,241 adults 
(1,336 males and 1,905 females), responded Democrat or Republican 
to the party identification question in any quarter 
2 Kaufmann and Petrocik (1999), however, contend that women’s 
overall identification did not change during 1952-1996, They base 
their argument on the NES surveys conducted in presidential election 
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the south. Disaggregation of the CBS/NYT data by region finds that 


partisan gender differences are no less prevalent among nonsouthern 
women; the gender gap exists in both the nonsouth and the south 
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and 2000. Women were about 5% more Republican 
than men in the late 1970s and were about 5%—10% 
more Democratic than were men by the late 1980s and 
through 2000. The gap does not appear to grow steadily 
but increases substantially in 1979 and keeps increasing 
at aslower rate. Since all surveys have a margin of error, 
we include 95% confidence intervals (i.e., estimate + 
margin of error) in Figure 2.‘ For 1979—2000, the esti- 
mation sample of our multivariate analysis, the average 
margin of error is 4.2 percentage points at the 95% 
confidence level and 3.5 percentage points at the 90% 
confidence level. Figure 2 shows a clear trend in the 
partisan gender gap in that the confidence intervals are 
below or nearly below the null line for some quarters 
in the late 1970s and climb increasingly higher than the 
null line from the 1980s on. 

While stark changes in the nature of the gender 
gap have stimulated individual level research explain- 
ing why men and women behave differently or have 
different preferences, the dynamics of the gender gap 
have gone largely unexplored. 5 It is clear that the gen- 
der gap is dynamic and, importantly, that these dy- 
namics exist outside of elections. The partisan gender 
gap has grown in a period of Republican strength but 
appears to be maintained equally as the balance of 
partisanship has shifted toward the Democrats. Par- 
allel and counter movements in men’s and women’s 
partisanship, different rates of decline and rebound, 
produce a gender gap that tells a more interesting 
story than that women identify more with Democrats 
than do men; men and women react differently to 
the political environment. But it is not clear what 


4 The margin of error on these observed values rs not the sum of 
the sampling error in the two subsamples; it is smaller than this sum. 
The margin of error formula for the difference of two proportions is 
given by 
Pall — Pr) 

Nm 


where Ny ih heise peer np gE 
flers interviewed in surveys conducted within a given quarter, and 

and P are the percentage of Democrats in each group (Le., ea 
and male Democratic macropartisanship). Note that this margin of 
error is significantly smaller than the sum of the margms of error 
associated with each proportion. The reduction can be as large as 
29% according to one of van Belle’s (2002, p. 39) “statistical rules of 
thumb”: “Confidence intervals associated with statistics can overlap 
as much as 29% and the statistics can still be significantly different.” 
If we use this simple formula to place confidence intervals on the 
gender gap, we can see that the gender gap is nonzero over much of 
this thme period. Note that if we take any four consecutive quarters 
and consider the joint probability that the gender gap 1s zero, the 
probability is very low. This 1s true even in quarters with small Ns 
and a small estimated gender gap. In 1980, a time when the gap 
is relatively small, the probability 1s 0.00000761! See the archived 
materials for further details 

5 See Klein 1984 for an early effort at building a account 
Recently Sapiro and Conover (1997) used the 1992 U.S. election as 
a case study to focus on how to understand the fact that the degree 
or even presence of gender differences varies across time and place, 
Le., context. We focus on the dynamics of the gender gap rather 
than the inception of the gender gap. Some of the measures used in 
this paper are not available until the late 1970s or are only available 
annually prior to this ime Moving to annual data would reduce 
the number of time pomts for analysis, making reliable time series 
analysis impossible. 


Pr 1- Pr) 
Ny j 
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FIGURE 2. 


The Partlsan Gender Gap with 95% Confidence Intervals 


- August 2004 








Partisan Gender Gap (%) 


1977 1979 1981 1983 1985 1987 


Year/Quarter 
Note. Data represent the percentage of women tdentfying with the Democrats minus the percentage of men Id with the 
Democrats using CBS News and New York Times surveys. Data are q from the first quarter (Q1) of 1977 through the fourth 


quarter of 2000. Since no surveys were avaiable for 1981 Q4, 1996 Q1, 1997 Q1, and 1997 Q2, interpolated data were used. 


motivates these dynamics. What moves the partisan 
gender gap? What about the macropolitical environ- 
ment has affected men and women differently? These 
are important questions both for improving our under- 
standing of gender differences and for understanding 
the interplay of politics and gender as macropolitical 
events, 

These are the questions we begin to answer with this 
analysis. We use time series data on the partisan gender 
gap to test implications drawn from theories offered to 
explain static gender differences. Specifically, we con- 
sider what theories about the differential experience 
and autonomy of men and women imply for the dy- 
namics of the gender gap, with the goal of enhancing 
our understanding of the gender gap. 


A NEW DESIGN FOR EXPLAINING 
THE GENDER GAP 


Examining the dynamics of the partisan gender gap 
gives us a unique perspective on the causes of gender 
differences and offers several advantages over cross- 
sectional analysis. We explain our three design choices 
in this section: Why time series analysis? Why the par- 
tisan gender gap? Why the gap itself, rather than men’s 
and women’s individual partisanship? 

We have argued that understanding changes in gen- 
der differences is important. The differences in par- 
tisanship illustrated above are not time bound. Time 
series analysis allows us to assess the role of the chang- 
ing environment in determining gender differences in 
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attitudes and behavior and to offer an explanation for 
this change. Specifically using time series data we can 
test the hypotheses that changes are due to the ef- 
fects of changes in the political environment, changes 
in the conditions or experiences of men and women, 
and changes in the economy. In so doing, we can bet- 
ter understand gender differences more generally. In 
contrast, cross-sectional analysis is prone to several 
pitfalls. First, the persistence of the gender gap can- 
not be assesed cross-sectionally. Second, explanations 
for the gender gap in cross-sectional analysis focus on 
features that discriminate among individuals, which 
are inherently less political than macro-level factors 
(Erikson, MacKuen, and Stimson 2002); political con- 
ditions are constant within a cross section and cannot 
help us understand variations in gender differences at 
any one point in time. Third, the factors that help us ex- 
plain variation in individual behavior do not work well 
over time; in fact, they are usually constant. We, how- 
ever, want to understand the impact of slow-moving 
macro processes, e.g., economic, social, and political 
processes. For example, the percentage of women with 
a college education moves very slowly over time and 
is constant for all individuals if using a cross-sectional 
approach. Finally, one cannot consider the evolution 
of the gender gap and associated counterfactuals over 
time. We want to be able to talk about the effect of a 
shock on the trajectory and sustainability of the gender 
gap. 

Why partisan differences? First, partisanship is both 
more enduring and more general than the vote, which is 
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dependent on the particular candidates, issue positions, 
and campaign context of an election. Second, biennial 
examination of gender differences required by an anal- 
ysis of the vote belies the continuous nature of the gap; 
dynamics exist outside of elections and the events of 
the campaigns and provide more information about 
the gender gap. Third, the partisan gap may drive the 
vote gap so that we need to explain the partisan gender 
gap as well as, and prior to, a vote gap if our goal is to 
explain gender differences more generally. 

Consider a typical individual-level study of the gen- 
der gap that focuses on the presidential vote choice 
as the. dependent variable. Party identification is the 
most important explanatory variable for vote choice, 
but the gender gap is intrinsic to both party identifi- 
cation and vote choice. Using the gender gap in party 
identification to explain the gender gap in vote choice 
is thus not a satisfactory explanation. There are at least 
three methodological and substantive issues here. First, 
the gender gap in party identification itself needs to 
be explained. Even though one might be content with 
looking into the gender differences in public opinion 
to explain the gender gap in both vote choice and party 
identification, one can hardly be satisfied treating the 
gender gap in party identification as an exogenous vari- 
able. Second, if party identification is endogenous, fac- 
tors not included in the vote equation will affect both 
partisanship and the vote so that estimates of these 
effects on the vote will be biased. Time series analysis 
has an advantage over cross-sectional designs when 
there are endogenous variables. Third, even if one is 
willing to accept the exogeneity of party identification, 
itis doubtful how much of the gender gap in vote choice 
is left to be explained after controlling for this variable. 
Cook and Wilcox (1995) report that while partisanship 
did not fully account for the gender gap in vote choice 
in 1972, 1980, and 1984, it did so in both 1988 and 1992 
(see also Kaufmann and Petrocik 1999). Thus, it is not 
clear whether the observed differences between men 
and women concerning the distribution and effects of 
other explanatory variables in the vote choice equation 
really explain the gender gap. 

The natural solution to these problems, and the obvi- 
ous method for assessing change in the gender gap over 
time, is an aggregate-level, longitudinal study of the 
gender gap. We consider the gender gap in macroparti- 
sanship, measured at each time point as the aggregate- 
level difference between the percentage of female and 
that of male party identifiers who claim to be Demo- 
crats. Thus our work follows the macropolitical focus 
that Erikson, MacKuen, and Stimson (2002) advocate. 

Why focus on the gender gap itself, rather than sep- 
arately analyzing the partisanship of men and women? 
We look at the gender gap itself foremost because the 
differences between men and women compose the the- 
oretical concept at the heart of our inquiry. We are less 
interested, for example, in whether men and women 
both penalize the incumbent party for poor economic 
performance than whether women punish them dispro- 
portionately more severely than do men. An analysis of 
the gender gap focuses our attention on, indeed explic- 
itly models, these differences. An analysis of the gender 
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gap will thus allow us to draw conclusions not about 
men and women, but about the differences between 
them and about the implications of the different ef- 
fects of the independent variables on men and women. 
There are other advantages to this approach as well. 
Analyzing the difference between female and male 
macropartisanship eliminates any nongender-specific 
trend, for example, the general trend of increased Re- 
publican support that occured during the eighties. The 
shared movement need not be explained. This is a 
distinct advantage over the separate analysis of each 
gender where we must explain this shared variation. 
This is particularly true because the general trend in 
Republican party support has proven difficult to ex- 
plain. The fluctuation of the resulting gap series reflects 
only the differential responsiveness of men and women 
to the political, economic, and structural environment. 
By analyzing the path of such a time series, we can 
explore the dynamics in gender differences for a long 
period, of which only a few time points can be sampled 
in a cross-sectional study of the gender gap in vote 
choice or in a panel study of the gender gap in party 
identification. That is, change can be the focus of study. 


EXPLAINING THE DYNAMICS 
OF THE GENDER GAP 


While identifying the gender gap is easy, separating out 
causal mechanisms to explain the differences between 
men and women has proven difficult. Previous expla- 
nations for the gender gap have been cast largely in 
static terms, yet many of these explanations have clear 
aggregate dynamic implications for gender differences 
and we draw these out. We assess the persistence of 
the gender gap because of its importance for thinking 
about shifts in aggregate partisanship. If gender differ- 
ence are persistent, an increase in the gender gap due 
to shocks in the political, economic, and structural en- 
vironment may last for years even if the shocks are just 
transient phenomena. In addition, theory predicts that 
gender differences will be persistent because they are a 
function of features of the political system that change 
very slowly. For example, it is difficult to move people 
off of welfare or change education levels, so gender 
differences will be persistent. Another important part 
of our theoretical contribution is the recognition and 
discussion of the differential impact characteristics of 
men and women have on their partisanship as a result 
of the changing nature of the macro polity. We offer 
a set of hypotheses based on the theoretical literature 
that predict the aggregate dynamics of the gender gap, 
taking care to highlight the dynamic implications of 
these theories. 


Theorles of Political Climate and Gender 
Differences 


Political explanations for gender differences in parti- 
sanship treat the political climate itself as static, but 
the political climate, specifically its ideological nature, 
has important implications for how citizens react to 


519 


Dynamics of the Partisan Gender Gap 


the political parties over time. As Abramowitz and 
Saunders (1988) note, ideological polarization can im- 
pact party loyalties in the electorate. We argue further 
that ideological distinctions may have a differential im- 
pact on men and women. We hypothesize that as the 
political climate becomes more conservative, the gen- 
der gap should increase. Drawing on theories that re- 
late to both compassion and self-interest, gender roles, 
and economic vulnerability, we argue that as the po- 
litical climate becomes more hostile to welfare (more 
conservative), women ought to react more negatively 
(less positively) than men, thus increasing the gap: Po- 
litical swings to the right may (or may not) find support 
from both sexes, but women should be less supportive 
than men either if they are more compassionate or if 
they disproportionately bear the brunt of conservative 
social policy, i.e., cuts or proposed cuts that predom- 
inate in a conservative political climate. During the 
time period of our analysis, women are more likely to 
support the Democratic party and less likely to support 
the Republican party. 

Measuring the political climate is a difficult task. 
The political climate encompasses party priorities and 
rhetoric, public opinions, and political events and is 
often tied to a particular president, his administration, 
or party. We know that there was an atmosphere of 
fiscal conservatism and attacks on the welfare state 
by the Republican party during much of this time 
period (1979—98), which were given broad support. 
This contributed to Republican gains among both men 
and women. Popular ideology highlighted conservative 
preferences and these preferences were associated first 
with President Reagan and ultimately with the Repub- 
lican party (Carroll 1988, 1999). This clear association 
highlighted the ideological divisions between the par- 
ties and gave voters opposed to the conservative envi- 
ronment a clear incentive to support the Democratic 
party (or declare themselves Independents). A delay 
in the effect of the political climate on the gender gap 
would not be unexpected because the effect must be 
mediated by elite perception and articulation. 

We measure the degree of conservatism that char- 
acterizes the political environment using public sup- 
port for conservatism in the electorate. Specifically, 
we aggregate the percentage of adults who identify 
themselves as conservative as opposed to moderate or 
liberal.’ Conservative preferences are generally twice 
as popular as liberal preferences. The correlation be- 
tween the gap series and conservative macroideology is 


6 Measures of ideological self-placement based on CBS/NYT surveys 
show that women have consistently been less likely to identify as 
conservatives than have men since the late 1980s. See Rohde 1991 
for a discussion of mcreasing party polarization and Wolbrecht 2000, 
an important work that provides a strong case for party polarmation 
on gender issues 

7 Ideology ıs compiled from the CBS/NYT surveys. Respondents 
are regularly asked if they “consider themselves liberal, moderate, 
or conservative.” We used Stimson’s (1998) dimensional analysis al- 
gorithm to estimate four missing quartera. An average of over 4000 
respondents indicated that they were eather liberal, moderate, or 
conservative in each quarter the question was asked. 
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positive and significant, r = 0.215, over the full sample 
range. However, it is easy to identify points at which 
the series move in opposite directions. 


Theories of National Economic Conditions 
and Gender Differences 


National economic conditions should also affect the 
gender gap. We know that more women than men are 
economically vulnerable and reliant on welfare bene- 
fits (Erie and Rein 1988; Piven 1985; Stark 1996). We 
also know that women are more pessimistic than men 
about the state of the national economy (e.g., May and 
Stephenson 1994 and Seltzer, Newman, and Leighton 
1997)—a pessimism that is certainly enhanced as the 
economy gets worse. Poor economic conditions thus 
hurt more women than men both financially and psy- 
chologically. In addition, if women are also more com- 
passionate than men, a real or perceived bad econ- 
omy may heighten feelings of compassion for the poor 
(Ruddick 1980; Schlozman et al. 1995; Welch and 
Hibbing 1992). Since the Democratic party is typi- 
cally seen as the party for the “welfare state” and 
the party of the downtrodden, poor economic condi- 
tions should interact with the economic and psycho- 
logical differences between the genders and motivate 
more women than men to turn toward (or stay with) 
the Democratic party. Thus we expect that, all else 
equal, the gender gap will increase when the economy 
is in decline, and it will decrease when the economy 
improves. 

Chaney, Alvarez, and Nagler (1998) make an in- 
teresting argument that, because women have con- 
sistently more negative assessments of the economy 
than do men, a part of the partisan gender gap in fa- 
vor of the Democratic party in the 1984, 1988, and 
1992 presidential elections was actually a result of eco- 
nomic voting against the incumbent Republican party. 
This theory suggests that it is important to distinguish 
the variation in the partisan gender gap that is due 
to the two parties’ long-standing cleavage in welfare 
policies and the variation that is due to economic 
voting aimed at the incumbent party. To incorporate 
economic voting in our model, we test whether na- 
tional economic conditions, in addition to their main 
effects, also interact with the presidential party to af- 
fect the gender gap. These interaction effects should 
be of the same sign as their respective main effects. 
Thus, under a Republican administration, the main ef- 
fects are reinforced: When the economy is in decline, 
the partisan gender gap increases not only because 
of the main effects but because of blame attribution, 
and when the economy is on the rise, the partisan 
gender gap decreases not only because of the main 
effects but because of credit attribution. Conversely, 
under a Democratic administration, the main effects 
of the economy on the partisan gender gap are offset 
by the attribution process characteristic of economic 
voting. 

We consider three standard measures of national 
economic conditions: unemployment, inflation, and 
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income growth. Data are aggregated from the 
monthly, seasonally adjusted time series for unemploy- 
ment rates in the civilian labor forces, the consumer 
price index, and per capita real disposable income 
(chained 1996 dollars).? 


Theories of Social Structure and Gender 
Differences: Economic Vulnerability 
and Economic Autonomy 


Social change is typically very slow, but marked change 
in attitudes and behaviors often follows social change. 
Consider the nature of social structure differences in 
the economic positions of men and women over the 
last several decades.!° Scholars have pointed out the 
rapidly increasing personal dependence of women on 
government welfare programs, the increase in divorce 
rates, and the number of female-headed families, as 
well as increases in the number of women who provide 
a disproportionate amount of the care for children and 
elderly, the increase in the number of these families 
that receive means-tested benefits (e.g., food stamps, 
Medicaid, and AFDC), and the increase in the propor- 
tion of women employed in the social welfare economy 
and public sector (Andersen 1999; Clark and Clark 
1996; Deitch 1988; Erie and Rein 1988; Piven 1985). 
These structural differences between the economic po- 
sition of men and that of women mean that women as 
a group are more economically vulnerable and likely 
to be dependent on and supportive of the welfare state 
than are men. 

Yet there are opposing trends, such as the increas- 
ing number of women with higher levels of education 
and better-paying jobs (though the level of women’s 
salaries is still lower than that of men). These trends 
minimize the differences in the relative economic posi- 
tions of men and women and emphasize the economic 
autonomy that these women may have from men. 
Socioeconomic changes such as these tend to evolve 
slowly, but they can have dramatic consequences for 
attitudes and behavior in the long run. 

We consider two specific structural trends: the in- 
creasing percentage of well-educated women holding 


8 We want objective, not attitudinal measures of the economy be- 
cause it is the effects of actual economic circumstances rather than 
perceptions that create economic inequalities. 
Unemployment and CPI data are available at the Bureau of Labor 
Statistics Web site (http:/Avww.bls.gov/home.htm). Income data are 
avaliable at the Bureau of Economic Analysis Web site (http//www 
bea.gov/). Chained dollars is a measure used to express real prices 
that have been adjusted to remove the effect of changes in the pur- 
chasing power of the dollar, relative to a reference year (Energy In- 
formation Administration 1999). Chained dollars was introduced by 
the Department of Commerce and 1s the recommended measure be- 
cause chained dollars capture the effect of changes in components of 
economic measures rather than only reflecting overall price inflation 


10 Our discussion of “structural change” refers to the variation over 
time of what we call “structural variables,” i.e., variables measuring 
the various aspects of social structure. Our structural hypotheses 
refer to the possibility that such vanation affects the gender gap, not 
that the hypothesized effects would undergo abrupt changes when 
such variation occurs 
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professional or managerial positions in the workforce 
and the increasing perrentage of single women (includ- 
ing divorced, separated, and widowed women). These 
two trends capture two related but distinct phenomena. 
As the number of well-educated and well-employed 
women increases, more women find themselves in an 
economically privileged class. These women, “regard- 
less of marital status, can be considered economically 
independent from men because their education and 
experience in relatively high-status occupations en- 
able them to provide for themselves” (Carroll 1988, 
243-44). This economic independence allows women 
to “become self-governing” (240) at the same time that 
their higher levels of education and experience in the 
workforce also allow them to see beyond tradition sex 
roles and to discern interests that may be different from 
those of men. This behavior translates into an expres- 
sion of preferences, partisan or otherwise, that may not 
be shared by men. According to Carroll, as the per- 
centage of women who are economically autonomous 
increases, the gender gap should increase. Manza and 
Brooks (1998) found that gender differences in voting 
were largely explained by women’s increasing labor 
force participation, i.e., that the preferences of men 
and women diverged as women entered the workforce 
in increasing numbers. “Paid employment,” they ar- 
gue, “directly exposes women to gender inequalities 
that they are less likely to experience as homemakers, 
while also providing them with a means of economic 
independence that may shape their political behavior.” 
(1243). 

Independence or autonomy, however, is a two-way 
street. In particular, as this trend evolves, women view 
the world from a vantage point that increasingly finds 
them well educated and in professional occupations 
alongside men. This not only increases their indepen- 
dence, but also increases the number of shared ex- 
periences of men and women, potentially leading to 
shared or converging preferences. It is thus not clear 
that autonomy must produce distinct preferences. 

Single women are by definition independent of men 
and thus are likely to exhibit independent preferences. 
“Unlike economically independent managerial and 
professional women, many of the unmarried women 
who are economically independent by virtue of their 
marital status are clustered near the bottom of the so- 
cioeconomic structure” (Carroll 1988, 244). If women 
are more needy than men, personal self-interest should 
drive them (relative to men) disproportionately to the 
Democratic party, the party of the welfare state (Erie 
and Rein 1988; Piven 1985; Stark 1996), resulting in an 
increase in the partisan gender gap. 

Data on these two indicators of structural change are 
available from the Outgoing Rotations of the Current 
Population Surveys (available monthly since 1979).1! 


U The Outgomg Rotations of the Current Population Surveys (CPS), 
conducted by the Bureau of the Census, 1s a survey of a 

50,000 households conducted monthly. Each household is surveyed 
for four consecutive months, then dropped from the survey for eight 
months, then resurveyed for four consecutive months. In the outgoing 
rotation month for each household, 50,000/4 respondents are asked 
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The percentage of professionally employed, college- 
educated women steadily grew from just under 13% 
to about 19% of the population, with the sharpest in- 
creases coming before 1990. The percentage of single 
women ranged from 38% to 42%, increasing in the 
early 1980s, leveling off untill the 1990s, and then de- 
creasing and rebounding since 1995. 

In summary, as the political and economic environ- 
ment changes, factors hypothesized to explain the static 
differences between men and women may be exacer- 
bated with predictable consequences for the dynamics 
of the gender gap. As politics becomes more conser- 
vative, as party priorities or popular ideology move 
right, women—whether as a result of disproportion- 
ate reliance on social programs or compassion—tend 
to be less supportive than men. In addition, national 
economic conditions should also affect party prefer- 
ences distinctly. A weak economy, we argue, will hurt 
women more than men for similar reasons and will 
trigger compassion as well. Further shifts in the struc- 
tural positions of women may affect the gender gap. 
Here we expect that increases in the percentage of sin- 
gle female-headed households will increase the gender 
gap, as these women rely more on social programs sup- 
ported by the Democrats. Increases in well-educated 
and well-employed women, however, lead us to ex- 
pect both more autonomy (economic and otherwise) 
and more shared experiences so that we are agnostic 
about the effects of this trend. In short, we expect that 
the gender gap will increase when women as a group 
become more economically vulnerable, the economy 
gets worse, the political climate becomes more conser- 
vative, and government services are threatened. 


ANALYSES 


We begin our analysis by asking, How enduring or per- 
sistent are the gender differences in partisanship? and 
How does the gender gap respond to changes in the 
political, economic, and structural environment? 


The Persistence of Gender Differences 


Are gender differences in partisanship persistent? This 
is an important question because, substantively, if gen- 
der differences are persistent, an increase in the gen- 
der gap due to shocks in the political, economic, and 
structural environment may last for years even if the 
shocks are just transient phenomena; otherwise, the 
increase dissipates quickly once external shocks cease 
to exist. Methodologically, it is critical that we correctly 
estimate the persistence of the gender gap time series 
to ensure valid inferences about the dynamics of polit- 
ical processes in our multivariate analyses (e.g., Lebo, 
Walker, and Clarke 2000; Maddala and Kim 1998, and 
Parke 1999). 

We expect gender differences in partisanship to be 
persistent. Granger and Joyeux (1980) show that ag- 


a battery of questions. The survey provides mformation as to the 
respondent’s level of education, age, occupation, marital status, state 
of residence, and other demographic information. 
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pregating heterogeneous dynamic microprocesses with 
autoregressive behaviour may result in a macroprocess 
that is fractionally integrated. While it is not substan- 
tively meaningful to speak of a micro-level “gender 
gap,” the gender gap macroprocess is the difference be- 
tween female macropartisanship and male macropar- 
tisanship, which are macroprocesses aggregated from 
individual-level microprocesses. If these micropro- 
cesses are heterogeneous, then macropartisanship will 
be fractionally integrated, which Box-Steffensmeier 
and Smith (1996) have shown to be the case. 

More formally, let x, be the party identification 
(1 for Democrat and 0 for Republican) of individual i 
at time t. A general autoregressive model of the micro- 
partisanship process is 


Xit = Hj Xy-1 + ÔU + Ex, 


where z; represents systematic factors whose effects 
are taken to be common across individuals and e,, are 
independent error terms. Granger and Joyeux’s notion 
of heterogeneity entails that the autoregressive coeffi- 
cient œ, be different from individual to individual. Be- 
cause the gender gap is a function of female and male 
macropartisanship, we expect that it is also fractionally 
integrated and, hence, persistent. 

We can measure the persistence of the gender gap 
by estimating the order of integration (d). The order 
of integration tells us how quickly the effects of shocks 
to a process die out over time. A weakly stationary 
series, with d=0, has no long-term memory, meaning 
that an increase or decrease in the process due to ex- 
ternal shocks dies out completely and quickly as soon 
as the shocks are over. Such series are mean reverting 
and covariance stationary with finite variance. Short- 
term memory can be characterized with combinations 
of stationary autoregressive and moving average terms 
[ARMA(p, q)]. In contrast, a series with d=1 is said 
to have permanent memory, i.e., an increase due to 
transient shocks will last forever. The first-order inte- 
grated series, ie., the random walk, is characterized 
by mean nonstationarity (if there is a drift), variance 
nonstationarity, and covariance nopstationarity. These 
processes are typically integer differenced before anal- 
ysis, leaving the possibility of short-term memory in 
the differenced series. A series with d falling between 
zero and one is “fractionally integrated” and exhibits 
various degrees of long-term memory or persistence. 
The closer d is to one, the longer the memory and 
the more persistent are the effects of shocks (Box- 
Steffensmeier and Smith 1998, 667). Fractionally in- 
tegrated processes are mean reverting. For values of 
d<(0.5, fractional are variance and covari- 
ance stationary, but for values of d > 0.5 they have time- 
dependent variances and are covariance nonstationary. 
In general, a time series process may exhibit both long- 
term, persistent memory, which is captured by the frac- 
tional parameter d, and short-term, transient memory, 


12 A senes with d < (1s stationary with short and negative memory 


(meaning that its response to a postive unit input will be nega- 
tive), and a series with d > 1 is nonstationary with long and explosive 


memory. 
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TABLE 1. 


Female Democratic 
Partisanship* 

Male Democratic 
Partisansh!p* 

Gender Gap* 


Conservative Macroideology” 
Unemployment* 

Inflatton* 

Growth? 

Professional and College- 


Educated Women* 
Single Women’ 


ARFIMA Model Estimates and #Ratios 


d=0 
4.448 


6.179 
4.429 
1.480 
4.178 
5.927 
0.697 
5.649 


7.368 


#Ratio for 


d=0.5 
—1.299 


0.226 
—0.333 
—3.422 

1.703 

2.897 
—4.921 

2.776 


1.621 





—8.324 
—0.772 
—0.133 
—10.539 
—0.098 


—4.126 
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(0,0) 


(0,0) 


(0,0) 
(1,0) 
(2,1) 
(0,0) 
(4,3) 


(0,0) 


Note: The Schwarz information criterion (SIC) was used to select the best model from ARFIMA(0,d,0) up to 
ARFIMA(4,d,4) with the estimated d< 1 In levels. Models with d>1 n levels were considered exploaive and 
not selected All modets were estmated with a constant term. 


which is captured by the stationary autoregressive and 
moving average parameters p and q. Such a process is 
called an autoregressive fractionally integrated mov- 
ing average process or ARFIMA(p, d, q). The combi- 
nation of long-term memory and short-term memory 
implies that once the process is differenced to get rid of 
the long memory, it will still have some short memory 
left. 

We estimate the order of integration using the full 
information modified profile likelihood (MPL)(An and 
Bloomfield 1993). The autoregressive fractionally in- 
tegrated moving average (ARFIMA) model estimate 
for the gender gap is (0, d, 0) with d=0.465, shown 
in Table 1.4 The series is fractionally integrated since 


13 The modified profile likelihood has been shown to have supe- 
rior small-sample properties over the exact maximum likelihood 
estimator derived by Hauser 1999 and Sowell 1992. In partc- 
ular, bias in estimates of d are, on average, smaller using the 
MPL. We use OX 3.30 Console and ARFIMA 1.01 to estimate d. 
ARFIMA 18 a specialized package for OX; both software packages 
are available for free download at Jurgen A. Doornik’s Web site at 
http /Awwouffox.ac.uk/Users/Doornik/index_html (October 2003). 
14 The technical aspects of estimation of the ARFIMA models in 
Tables 1 proceed as follows. The estimation routines of the ARFIMA 
package for OX require that the series be made stationary before 
estimation, the parameter space to (—5, 0.49999]. This 
means that if the order of integration for the levels process, d, is 
outside or near the upper bound of the parameter space, we must 
estimate on first differences whose order of integration, d — 1, will 
He in the parameter space. If we estimate on first differences, the 
estimate will be for d—1 rather than for d. Since what is of sub- 
stantive interest is levels, not first differences, we must add one back 
to the estimated d— 1 to get the estimate for d. This 

which takes advantage of a straightforward algebraic relationship, 
gives us the d values presented in Table 1. Among the nine series in 
Table 1, only conservative macroideology and income growth could 
be estimated from levels, the other seven series must be estimated 





the hypothesis that d=0 as well as the hypothesis that 
d=1can be rejected at conventional levels of statistical 
significance. 

A time series’ degree of persistence can be depicted 
more clearly by its impulse response function. The im- 
pulse response function of the partisan gender gap is 
provided in Figure 3. The aggregate gender gap ex- 
hibits persistence. Assume that there is a temporary, 
one-quarter downturn in the economy. Following this 
downturn, about 24% of the initial change in the gen- 
der gap remains one year after the shock, about 17% 
after two years, and about 12% after four years. The 
influence of the impulse input diminishes but has not 
vanished even years afterward. This finding is impor- 
tant because shocks can promote gender partisan dif- 
ferences that are not easily put aside—once differences 
are stimulated, they will disappear only over fairly long 
time periods or by a contrary stimulus. Thus if economic 
downturns affect the gender gap and are sustained for 
long periods or national preferences trend in one direc- 
tion for any length of time, it will be hard to overcome 
their effects on the gender gap in the short term. If 
structural differences, which take decades to change, 
are the cause of the gender gap, the effect will be 
around even longer. We move now to a multivariate 
analysis of the gender gap. 


from first differences. The steps required for estimation are purety 
technical and do not affect the final ARFIMA mode] selected or the 
substantive mterpretation of the permstence that characterizes the 
process (Doornik and Ooms 2003; Lebo and Clarke 2000). We use the 
Schwarz information criterion (SIC) and assess whether the estimate 
mnplies an explosive process in determining the best ARFIMA model 
(e.g., Box-Steffensmeder and Smith 1998, Lebo 2000, and Mills 1992). 
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FIGURE 3. 
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Impulse Response Function for the Partisan Gender Gap 





54 63 


Quarters After an Impulse Input 
Note: Based on the estimated ARFIMA(0, 0.465, 0) model for the partisan gender gap The Impulse response function shown here does 


not take Into account the effect of a dnft term (0.140). 


Multivariate Analysis 


We begin by recognizing the need to address auto- 
correlation in our multivariate analysis to avoid spu- 
rious correlation and erroneous statistical inferences. 
We transform our series into the white noise residuals 
of their respective ARFIMA models, i.e., “prewhiten,” 
to purge autocorrelation and ensure stationarity be- 
fore examining the relationship among the series (e.g., 
Haugh and Box 1977).) That is, we remove the part of 
the series that can be explained by itself before assess- 
ing the multivariate relationships. 

Table 1 presents the estimates of the univariate 
models. It also shows, via t-tests, that conservative 
macroideology and income growth are the only clearly 
stationary time series with 0 < d < 0.5. All other series 
are nonstationary because d = 0 can be statistically re- 
jected, while either d=0.5 or d=1 cannot be rejected. 
These models are used to transform the time series into 
stationary time series. Table 2 provides a heuristic guide 
to facilitate substantive interpretation of relationships 
based on the innovation or filtered series. The first col- 
umn of numbers in Table 2 are correlations between 
innovation series and their respective levels series, and 
the second column shows correlations between innova- 
tion series and first differences of respective levels. For 
conservative macroideology and income growth, which 


15 The procedure can be considered a modification to the Box— 
Jenkins (1976) transfer function model, which is based on transform- 
ing both the input and the output series using the ARIMA model for 
the input senes See Appendix A (online) for more details 
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TABLE 2. Correlations of Prewhitened Series 
with Levels and First-Differenced Series 





Female Democratic 


Single Women 0.359 0.939 

a ARFIMA(p,d,@ models kdentified and estimated from levels. 
All the other senes were identified and estimated from first dif- 
ferences. See Table 1. 





are stationary in levels, the innovation series are highly 
correlated with levels. For the series that are nonsta- 
tionary in levels, the innovation series are not even 
moderately correlated with levels but are moderately 
to highly correlated with first differences, Thus, for ex- 
ample, the inflation mnovation series is much closer 
to the (quarterly) changes in inflation rates than to 
the levels of inflation rates. In contrast, the innovation 
income growth series is very close to income growth in 
levels. Depending on whether a levels series is station- 
ary or nonstationary—whether d<0.5 or d>0.5—its 
innovation series is substantively proximate to either 
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levels or changes. We discuss the results using levels 
and/or changes as reference points when a substantive 
interpretation is useful. 

Our multivariate analysis starts with the assumption 
that, in terms of prewhitened variables or innovations, 
both female and male Democratic partisanship are lin- 
ear functions of a set of explanatory variables measur- 
ing political climate, national economic conditions, and 
changes in social structure. In matrix notation, 


Female Democratic partisanship = XB + u, 
Male Democratic partisanship = Xy + v, 


where matrix X contains observations of the explana- 
tory variables. B and y are vectors of coefficients, and 
u and v are vectors of disturbances. Subtracting the 
second equation from the first one gives: 


Female Democratic partisanship 


— Male Democratic partisanship 
=X(B — y)+(a— Y). 


We focus on estimating this equation because our the- 
oretical hypotheses are about the differential effects 
of X on female and male Democratic partisanship. 
This equation allows us to test the set of null hy- 
potheses Ho: 6 — y = 0. Estimating the gender-specific 
equations would not be sufficient because to test Ho 
from the separate estimates of 6 and y one needs to 
know not only the estimates’ respective variances but 
also their covariance, which cannot be computed when 
the gender-specific equations are estimated separately. 
In fact, it is quite possible that both £ and y are not 
statistically significant, while their difference, £ — y, is 
statistically significant, in which case to draw inference 
about £ — y from the estimates of the gender-specific 
equations would be misleading.’’ For these reasons we 
present only the estimates of the gender gap equation. 
We present the results of our multivariate model 
of the partisan gender gap in Table 3. By modeling 
the innovations series, we do not expect strong statis- 
tical significance and results will err on the side of null 
findings. Overall, however, we find support for all hy- 
potheses that were tested. For the sample period 1979- 
2000, the partisan gender gap is affected by changes in 
political, economic, and social structures. Specifically, 
the partisan gender gap increases with (a) increases in 
the percentage of adults who claim to be conservative, 
(b) economic deterioration, and (c) increases in the 
percentage of women who are single/unmarried.'® 


16 While this is inexact, the direction of a relationship between 
prewhitened series will be consistent with that of the relationship 
between appropriate levels and/or changes because the correlations 
between a prewhitened series and its appropriate levels/changes are 
all positive. 

17 Even with two independent statistics, the fact that their respective 
confidence intervals overlap does not imply that their difference ts 
statistically insignificant. See van Belle (2002, 39). 

18 Since our dependent variable is the difference between female 
Democratic partisanship and male Democratic partisanship, we con- 
sider both ts endogenous. Following Box-Steffensmeier 
and De Boef (2001), we also consider conservative macroideology 
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TABLE 3. OLS Estimates of the Equation for 
the (Democratic) Partisan Gender Gap, 
1979-2000 
Dependent Variable: Female Democratic 
Partisanship; — Male Democratic Partisanship, 
Independent Varlable OLS Estimate (SE) 
Female Democratic Partisanship: 0.055 o 119) 
Female Democratic Partisanship; 
Male Democratic Partisanship.: 
Male Democratic Partisanship;_2 
Conservative Macroldeology;_; 
Conservative Macroldeology;2 
Unemployment; x Presidential 
Party; 
Unemployment, 
Inflation, x Presidential Party; 
Inflation; 
Income Growth: x Presidential 


0.122 
0.193 


1.451 (1.328 

—0.504 (0.188 
0.302 (0.185 
0.134 (0.082 


Party; 
Income Growth, 
Professional and College- 
Educated Women, 
Single Women, 
Constant 


—0.247 (0.084) 
—1.076 (1.282) 


1.276 (0.719)* 
0.069 (0.257) 
Adjusted R? 0.148 
Standard error of estimate (SEE) 2.357 
Durbin—Watson 2.061 
N of observations 87 
Note: AH time sertes (except Presidential Party) were filtered 
according to their respective ARFIMA models to purge autocor- 
relations and ensure no spurious N In the regres- 
*p<.10 





Conservative macroideology appears to lead the par- 
tisan gender gap by two quarters. That is, when the 
political environment becomes more conservative, the 
gender gap increases about a half a year later. When 
conservative political rhetoric and policy enticed in- 
creasing numbers of citizens to claim conservative iden- 
tification, they also polarized gender differences in par- 
tisan preferences. Whether because women felt more 
compassionate or were more likely to need services 
attacked in a politically conservative environment than 
men, this effect produced increases in the gender gap, 
supporting the political effects hypothesis. The two- 
quarter delay may reflect the time it takes for partisan 
changes to take place in response to movements in 


endogenous. We include m our model two lags of the endogenous 
vanables as m vector a Willams, and Lin 
1989), although our model is not strictly VAR. Here, lagged male 
and female partisanship capture gender-specific ics. We stop at 
two lags because additional lags do not improve the fit of the model. 
Our results show that female Democratic partisanship is somewhat 
significant at lag 2, with a negative coeffiaent This means that, con- 
trolling for all other variables, the gender gap process has a negative 
feedback mechanism in itself: Increasing female partisanship will 
lead to a decrease in the gender gap two quarters later. This does not 
mean that the gender gap should not be persistent. It suggests that 
the persistency in the gender gap is not intrinsic to itself but due to 
the environmental vanables that affect the gender gap. 
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mass support for conservative policy and the ensuing 
elite perception and articulation.’9 

As expected, income growth has a negative main 
effect on changes in the gender gap, but neither un- 
employment nor inflation has a statistically significant 
main effect.” The finding that unemployment has no 
significant main effect once income growth and infla- 
tion are controlled for is consistent with findings in the 
economic voting literature (Fair 1978, 1982; Kramer 
1971; Stigler 1973). Unemployment affects only a small 
percentage of the population, and not all of the unem- 
ployed would shift their party loyalty (see Stigler 1973, 
162). Inflation affects a far broader segment of society, 
in terms of real income, debtor/creditor status, fixed 
incomes, etc. When used in conjunction with changes 
in real income (vis-a-vis Monetary income), however, 
inflation is often found to have an insignificant effect 
on votes (Fair 1978; Kramer 1971). Presumably this is 
because real income is already adjusted for inflation. 
Our analysis shows that an increase in inflation does 
lead to an increase in the gender gap, but the main 
effect of inflation has a p value of .108 (two-tailed test). 
Since the prewhitened economic variables do not ex- 
hibit significant collinearity, income growth is the only 
economic condition that clearly drives the gender gap. 

Our model allows us to test whether the effects 
of economic conditions on the partisan gender gap 
are different under different party administrations, a 
hypothesis inspired by Chaney, Alvarez, and Nagler 
1998. By interacting the economic variables with a 
dummy variable representing the presidential party 
(1 for Republicans and —1 for Democrats), we sep- 
arate administration-specific effects from their respec- 
tive main effects.” Our estimates show that only infla- 
tion has a statistically significant interaction effect. The 
best way to assess the effect of inflation on the gen- 
der gap under different administrations, however, is to 
calculate its total effect with respect to Party = 1 (Re- 
publican control) and Party = —1 (Democratic con- 
trol). Mathematically, the total effect is: 


a(GenderGap) ; 
———— = MainFffect 
a (Inflation) 


+ InteractionEffect x Party 


The statistical significance of this total effect can be 
established by conducting a t-test (or an F-test for 
linear restrictions) after substituting Party with the ap- 
propriate value. (Friedrich 1982). Our tests show that 
the total effect of inflation is positive and statistically 
significant under Democratic administrations but not 


19 This findmg is consistent with Stmson, MacKuen, and Erikson 
(1995), who demonstrate that policy outputs of government respond 
to public opinion. 

2 Our empirical analysis showed that income growth needed to be 
lagged once, while unemployment and inflation did not. This makes 
sense because the former is in levels, while the latter two variables 
are first differenced. 

21 Party is not included as a “stand-alone” variable because our re- 
gression involves mnovatons, and any arbitrary origm ın levels has 
already been filtered out. If we included party directly, we would 
have a stationary left-hand side and a nonstationary mght-hand side. 
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under Republican administrations. During our sample 
period (1979—2000), there were only two Democratic 
presidents: Carter and Clinton. While inflation was 
consistently low under the Clinton administration, it 
rose sharply under President Carter. Thus, the positive 
total effect of inflation under Democratic administra- 
tions appears to be specific to the Carter administration 
and the large economic slump at that time. This effect, 
however, was obviously not due to women’s overly 
pessimistic reaction to the way the incumbent party 
handled the economy. If it was, the gender gap would 
have decreased rather than increased. The traditional 
partisan cleavage on welfare policies seems to be rel- 
evant as economic voting here. If so, the implication 
is that women will be more likely to tolerate inflation 
under Democratic presidents because of the party’s 
position on social welfare. Thus, in times of rising in- 
flation such as the late 1970s, women may have stuck 
with the Democratic Party while men turned to the 
Republicans, causing the gender gap to increase. Under 
Republican presidents, if men and women are equally 
likely to punish the administration for increasing infla- 
tion rates, inflation will exhibit no total effect on the 
gender gap. 

The results show that not all social structure related 
trends work together. We find that as the percentage 
of women who head households (whether single, di- 
vorced, separated, or widowed) increased, the gap has 
also increased. As Carroll (1988) puts it, single women 
tend to be near the bottom of the socioeconomic struc- 
ture, making them more reliant on social programs 
favored by the Democrats In contrast, we did not 
find support for the hypothesis that as college-educated 
women enter the professional labor force their prefer- 
ences are more likely to diverge from those of men. 

These results indicate that the assumption that shed- 
ding stereotypical gender roles and becoming more 
independent will automatically lead to distinct pre- 
ferences is not accurate. Specifically, it may be that 
shedding traditional gender roles, in which women are 
the nurturers and men the providers, frees women to 
develop preferences independently from men but to 
do so in ways that are sometime like men and oth- 
ers different from men. Further, given the economic 
independence that comes with increased participation 
in the labor force and higher levels of education on, this 
trend works against the role of need-based arguments 
for gender differences. More needs to be done (at the 
individual level) to disentangle these effects. 

Taken together, these results suggest that economic 
vulnerability may best explain the effects of social 
changes on gender differences. Where women are 
most likely to be economically independent from 


Z Observers have noticed a dramatic “marriage gap” over the last 
few presidential elections. When the electorate 1s broken down by 
gender and marital status, it 1s clear that unmarried women provided 
the strongest support for the Democrats. For example, a New York 
Times/CBS News poll conducted shortly before the 2000 election 
shows that while married women preferred Bush over Gore by a 
margin of eight percentage points, unmarried women preferred Gore 
to Bush by an overwhelming margin of 35 percentage points (Seelye 
2000). 
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men—holding better and, on average, higher-paying 
jobs—they find themselves more like men but also freer 
to develop their own perspectives, which may be similar 
to or different from those of men. At the same time, 
as women heading households are more likely to be 
economically vulnerable, they are likely to hold both 
different preferences than men and also more liberal 
preferences, leading them to support the Democratic 
party at higher rates than men and producing a larger 
Democratic gender gap. 

In understanding these results it is important to recall 
that while innovation series are used in the multivari- 
ate analysis, the gender gap in levels is a persistent time 
series process. Thus movements in the gender gap stim- 
ulated by concerted significant shocks in the political, 
economic, and structural environment in the late 1970s 
and early 1980s have been sustained throughout the 
1980s and into the 1990s. This does not mean that the 
partisan gender gap we observe today is nothing but a 
residual phenomenon. New shocks are constantly stim- 
ulating gender differences. It does suggest that even 
when the environment is relatively quiet, the gender 
gap will not necessarily vanish. 


CONCLUSION 


The gender gap is a pervasive feature of the political 
landscape, behaving in similar ways both between and 
within election years. The generality of partisan gender 
differences implies that their effects are felt outside of 
elections. They affect the kinds of issues on the political 
agenda, the nature of ensuing policy debates, and the 
nomination process, as well as elections themselves. 
The persistence of gender differences also implies that 
the differential voting behavior of men and women 
in a given election is not exclusively a product of the 
election itself It grows out of something bigger and is, in 
the short term, an aspect of the political landscape that 
represents the “givens” of a campaign and, thus, must 
be directly grappled with by savvy politicians hoping 
to win elections. The persistence of gender differences 
means that they become a part of the features of the 
political landscape and that they imply a presence and 
a pervasiveness that politicians and policy makers need 
be wary to ignore. 

Importantly, while gender differences in partisanship 
are persistent, they need not become a permanent fea- 
ture of the social and political landscape. As our anal- 
ysis demonstrates, they are created by a unique mix of 
social changes, economic opportunities, political actors, 
and government priorities. While these circumstances 
tend to change slowly, they do change in important 
ways even in the medium term. Further, changes are 
not random, nor do they inevitably move in a direc- 
tion that exacerbates gender differences. In the last few 
decades of the twentieth century social changes ran a 
fast pace. In particular, traditional family structures ev- 
idenced disintegration, with the percentage of women 
who head households increasing drastically. Accompa- 
nying these trends were economic and political con- 
ditions that, while more variable, often interacted to 
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produce more economic vulnerability among women in 
combination with less redress from government. These 
changes have acted to uniquely shape the experiences 
of men and women, producing a gender gap in partisan- 
ship characterized by disproportionately more support 
for the Democratic party among women and for the 
Republican party among men. The consequences for 
attitudes and for politics continue to evolve. 

Much has been made of the gender gap in American 
politics. Politicians talk about it, the media point to 
its significance, and the sense is that it is important. Its 
salience, we argue, is warranted. It is warranted because 
the gender gap both is persistent and is a product of the 
interaction of societal conditions and politics. Whether 
the gender gap continues to grow or diminishes will 
depend greatly on the behavior of strategic politicians, 
on the continuation of trends in the nature of families, 
and on economic conditions. Either way, it is likely to be 
a continual feature of the American political landscape, 
one that shapes everything from elite political behavior 
to election outcomes and public policy. 
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Revolutionary America 

Lee Ward 

$90 00 Hardbeck 0-521-82745-0 


Votes and Violence 
Electoral Competition and Ethnic Riots In india 
Steven Wilkinson 


Cambndge Studies in Comparatre Polrcs 
$75 00 Herdbeck 0-521-82916-X 


Medla and the Path to Peace 
Gadi Wolfsfeld 
Communication, Soosty and Polrbcs 


$70 00 Hardback 0-521-83136-9 
$25 00 Paperback 0-521-53862-9 


On Capito! Hill 


The Struggle to Reform Congress and Its Consequences, 
1948-2000 


Jullan E Zelizer 
$30 00 Hardback 0-521-80161-3 
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Political Science fom YALE 


New in paper 





WHY THE ELECTORAL COLLEGE 
Is Bap FOR AMERICA THe MODEREN Proxrce HELLFIgE NATION 





George C. Edwards III ; What Leaders Need to The Politics of Sin in American History 
Foreword by Neal R. Peirce ° Know Now James A. Morone TETI 
“Dispensing keen polit- ` Carpes: kord “Hellfire Nation [places] pepi 
ical insights as he goes, | “A tour de force. ... much of our public life in 





George Edwards punc- | Lord’s understanding of its proper soul-searching 

tures the fallacious BAD FOF the workings of govem- context—and its careful 

arguments used by sup- | AMECA ment, both ancient and anatomy of the hand-in- 

porters of the electoral - modem, is profound, glove relations between 

college whenever criticism of that and his ability to assimilate the two, the American state and 

splendid anachronism grows hot, as it makes The Modem Prince indispensable. the American faithful is both welcome and 
once again has.”—Jack Rakove $26.00 | ...As a handbook for leaders it flkuminating.”—Chris Lehmann, 


deserves to become an instant classic.” Washington Post Book World 
—Brian M. Camey, Wall Street journal 43 illus. $18.00 paperback 


$17.00 paperback 


New in paper 
50 Yrans or 





DISSENT 25 THe Morar FOUNDATIONS OF 
Edited by Nicolaus Mis © = = How Crass Worxs ^ Pouimics 

and Michael Walzer = = fee ee ee lan Shapiro 

With an introduction by a Stanley Aronowitz “In The Moral 

“| find this volume of jetofeocalden Shapiro reaffirms his 

essays impressive not only In Pace. ject of social class : Ea place as one of the very 

ty but also in their surpnsing relevance squarely on the intellec- clearest and most res- 

to political life today.”—Robert Dah! tual agenda—though In a new, Inclusive, jute, and most solidly 


$25.00 rback and dynamic form. Like his influential 
pis False Promises, How Class Works is both deans plate tion.” 


intellectually exciting and morally chal- Adolph Reed, Jr. $18.00 paperback 





Can Gop AND lenging.”“—Barbara Ehrenreich 

Caxsar COEXIST? $17.00 paperback I P 

& International Law Why Do They Fight? Can They Stop? 
Robert F. Drinan, S.J., DEMOCRACY BY DECREE Second Edition 

recipient of the 2003 Franklin What Happens When Bernard Wasserstein 

and Eleanor Roosevelt Courts Run Government ; 

Insttute’s Freedom of Worship Ross Sandler and “fa philosopher-king 

Meda! David Schoenbrod tis sch tale 

“No one Is bett | ] than Robert “A brilliant, Palestinian ict, 
Drinan to tackle the Intersection and brave account of jee 


between religious freedom and the pub- how federal courts have 
lic world. In this book he does that Inci- distorted our political sys- : 
sively, making complex issues accessible | tem by taking control of conet institu- 





—Fric Silver, jewish $ 
Chronicle $16.00 paperback 





and abstract problems personal.” tions like schools and pnsons—sometimes AFTER 
—William F. Schulz $30.00 for decades—instead of enforcing rights, pererecna 
i THE KHMER Rover A 
which is their proper domain. Inside the Politics of t 
F —Diane Ravitch $18.00 paperback Nation Building 
Warlords and Rebels in AN ; Evan R. Gottesman 
the Balkans kw REPUBLIC x 
John Phillips A History of the United oe ere ut 
` States in the Twentieth nuanced account of mul- 
A cogent, judicious ... Century tilayered backroom efforts 
report on an important, John Lukacs to rebulld Cambodia after [Vietnam] over- 
woefully under-covered threw the Khmer Rouge in 1979. The les- 
conflict. Phillips does e el ola ea x sons for the United States in Iraq are 
well in capturing the Nie at what has become the many. . . . Washington should take note: 
political, moral, communal—of the American democracy in the tw entieth this book is a sober and valuable warming 


various sides."—Strobe Talbott $30.00 century, one of our most respected histori- 
ans offers a major statement on the nature 


of our political system. $19.95 paperback 
YALE Unrversity Press (te) yalebooks.com 


of how difficult that struggle can be.” 
—Eric Pape, Newsweek $22.50 paperback 








MY LIFE 

by Bill Cinton 

President Bill Clinton’s My Life ıs the strikingly candid portrait of a global leader who 
decided earty in life to devote his intellectual and political gifts, and his extraordinary 
capacity for hard work, to serving the public. Filled with fascinating moments and insights, 


1t is told openly, directly, in President Clinton’s own completely recognizable voice. 
Knopf | Cloth | 957 pages | $35.00 
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SAME-SEX MARRIAGE: => 
PRO AND CON 


Edited by Andrew Sulitoan 

With same-sex marriage igniting a firestorm 
of controversy ın the press and ın the courts, 
1n legislative chambers and ın Irving rooms, 
Andrew Sullivan, a pioneering voice in the 
debate, has brought together two thousand 
years of argument in an anthology of historic 







Mv Lite 
Ball Clinton 


inclusiveness and evenhandedness. Same- THE ROADS TO MODERNITY = 
Sex Marriage ıs essential to a balanced The British, French, and American 

understanding of the most pressing cultural Enkghtenments 

question we face today. HG le Hi Nar’ 


Vintage | Paper | 416 pages | $14.00 


“To see nepotism as a natural human im- 
pulse, a force in the advancement of crvilzza- 
tion, and an enemy and frend of democracy 
and free markets was all a revelation. And 
Bellow’s description of a benevolent and 


inclusive nepotism is a strikingly onginal | arraian [| IMPERIAL AMERICA j 
idea that will make this book a landmark.” The Bush Assault on tha World Order 
ee by Joba Newhouse 
Anchor | Paper | 576 pages | $16.00 With a New Prefs 
“A lucid and accessible account of how 
FROM OSLO TO IRAQ AND [Newhouse] says the administration has 
THE ROAD MAP done more to imperil the United States than 
Essays to enhance ıts security.” 
by Ed | W. Sard —The New York Timer Book Review 


Said reveals information that never finds its 
way into the American media, thus provid- 
ing a real context for a better understanding 
of the Middle East. Fiercely uncompro- 
musing, written with clarity and elegance, 
From Oslo to Iraq and the Road Map provides 
an essential and unique voice that 1s more 
important now than ever before. 

Pantheon | Cloth | 352 pages | $25.95 








From one of the most distinguished in- 
tellectual historians comes this brilliant 





IN PRAISE OF NEPOTISM reconsideration of the Enlightenment that 
A History of Famıly Enterprise from reclaims its origin for the British intellectual 
King David to George W Bush tradition from the French to whom scholars 
have lo iven preeminence Himmelfarb 
NEPOTISM]  4y-4dam Bellow Aare 


also devotes chapters to the French and 
American Enlightenments in this remark- 
able exercise in the history of ideas. 
Knopf | Cloth | 304 pages | $25.00 


“John Newhouse 1s one of the foreign policy 
establishment's more thoughtful observers.” 
—The Boston Globe 


Vintage | Paper | 208 pages | $13 00 











Revised Edition 
NOW ON SALE 





yle 
NEMUA 


for Political Science 


APSA is pleased to announce the release of the new 
edition of the Style Manual for Political Science. 
Recently revised, this edition reflects the current 
style of the American Political Science Review and 
contains updated information on documenting 
electronic and online sources. It also includes 
submission guidelines for APSR and a new foreword 
by Lee Sigelman, editor of APSR. 


Single copy rate: $3 members; 
$6 non-members 


Order APSA Publications online at 
www.apsanet.org/pubs or call 202.483.2512. 

For more information about these or other APSA 
publications, visit the online catalog at 
www.apsanet.org/pubs/catalog.cfm 








EXUALITY RESEARCH FELLOWSHIP PROGRAM 












oF recognition of its roth and final year 
of providing fellowship support, the 
to outstanding scholars and other professionals 
working in a variety of settings, whose research 
in sexuality focuses on policy analyns, policy 
development and/or implementation that ıs rele- 
vant to local, state, or national concems. No dis- 
sertation fellowships will be awarded. 

The 2005-2006 competition will target more 
advanced sexuality scholars and practitioners as 
demonstrated by academic/adminustrative posi- 
tion, publication record and research experience. 
More junior scholars demonstrating sufficient 
research experience are eligible to apply. Prior 
direct policy experience is not required. 

Fellows should submit applications in parmer- 
ship with another scholar/professional who will 
function in tbe advisory capacity of Research 
Consultant. The Research Consultant will pro- 
vide substantive experuse to the Fellow in specif- 
ic areas of research methodology/design and/or 
policy, and provide overall assistance and advice. 


ELIGIBILITY 


The competition is open to scholars who already 
hold the Ph.D. or its equivalent from an accred- 
ited United States or foreign university at the 
time of the application deadline. In order to 
ensure a diverse pool, applicants must have 
received the Ph.D. degree no more than 10 years 
prior to the date of the deadline submission and 
cannot hold a tenured faculty position. 
Policymakers, advocates, service providers, 
and program administrators who conduct 


Postdoctoral Fellowships on Sexuality & Policy 






research in various settings—such as business, 
federal/state government, and community set- 
tings—are also invited to apply. 


STIPEND 


Fellowship awards will be given to six 
Postdoctoral Fellows for a one-year period and 
will include stipends based on the awardee’s pre- 
vious year's salary and professional standing, for 
a minimum of $50,000 and up to $60,000 to 
cover research costs and living expenses. For 
each fellowship awarded, an additional $5,000 
will be awarded to the Fellow’s Research 
Consultant, in compensation for his/her conti- 
bution to the research partnership. 

Applications must be received on or before 
December 15, 2004. Applications will be 
available online at www-serc.org/fellowships/ 
sexuality. 


CONTACT THE SFRP 


Social Science Research Council 
810 Seventh Avenue, 31st Floor 
New York, New York 10019 
TELEPHONE: 212.377.2700 
E-MAIL: stfp@zsrc.org 

In the administration of tts fellowship and grant 
programs, the Social Science Research Council 
does not discriminate on the basis of age, color, 
creed, disability, marital status, 
national origin, sexual orientation, 
race, gender, or any other charac- 
teristic protected by applicable laws. 
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More Damned Lies 
and Statistics 


How Numbers Confuse Public Issues 
Joel Best 
“Through hls devastating work on common 
myths about social problems, Joel Best has 
established himself as a brilliant observer of 
our national fads and scares. In this latest 
book, Best confronts yet more of the pseudo- 
statistics by which we are bamboozled day 
by day.” 

—Prup Jeunes, author of The New Ant-Catholictsm 
$19.95 cloth 


American Gulag 


Inside U.S. Immigration Prisons 

Mark Dow 

“American Gulag tells the horttfying story of 

men, women and children detained Indefinitety 

by U.S. Immigration offictals as It has never 

been told before. It sounds an alarm for us all.” 
—Aarwony Lines, author of Gadeon’s Trumpet 

$27.50 cloth 


Chechnya 

Ufe In a War-Tom Soclety 

Valery Tishkov 

Foreword by Mikhail S. Gorbachev 

“A most valuable book on an Important subject 
about which Americans know little. | recom- 
mend thts book to anyone Interested In the alt 
Important U.S./Russla relationship as well as 
to general students of International affairs.” 


—THomas GaaHas Ji, former U.S. Ambassador and 
author of Disarmament Sketches 


Callfornda Series In Public Anthropology 
$50.00 cloth, $19.95 paper 


A Tibetan Revolutionary 
The Political Life and Times of 

Bapa Phintso Wangye 

Melvyn C. Goldstein, Dawei Sherap, and 
Willlam R. Slebenschuh 

“These extraordinary memolrs.are essential 
reading... The founder of the Tibetan 
Communist Party recalls vividly hls personal 
role In the epic struggle of the Tibetan people 
over tradition and modemlty, and the hopes, 
betrayals and tragedies that have marked It.” 


—jonm L Hory, President, National Committee on 
U.S.-China Relations 


$24.95 doth 


Veering Right 
How the Bush Administratlon Subverts 
the Law for Conservative Causes 
Charles Tiefer 
“A brillant exposé of abuse of power at the 
highest levels—..This book lays bare the ques- 
tlonable legal foundation on which Bush II 
constructed his signature domestic and for- 
elgn policies. Veering Right should outrage 
anyone who cares about the rule of law and 
the principle of open government.” 
Rrra Hooy Waan 
$27.50 cioth 


The History of 
Human Rights 


From Anclent Times to the 

Globalizatlon Era 

Micheline R. Ishey 

“This well-written book, chock-full of knowt 
edge, presents a history of the Idea, or Kieas, of 
human rights through the prism of the author's 
thoughtful views on key controversies that 
bedevil human rights discourse to thts day.” 


—S$m NeeL Rooury, Chalr, Unrversity of Essex Human 
Rights Centre; Member, (UN) Human Rights 
Committee 


$60.00 cloth, $24.95 paper 


Shadows of War 


Violence, Power, and Intemational 
Profiteering In the Twenty-First Century 
Carolyn Nordstrom 
“Nordstrom, a pioneer In warzone ethnogra- 
phy, gives us an up-close view of the shad- 
owy worlds of wartime economics. Money 
laundering, blood diamonds, gun running— 
Nordstrom puts faces on each of these... 
This Is an Innovattve and Important book.” 
—Crmmaa Emos, author of Maneuvers 
Calffornra Series In Public Anthropology 
$50.00 cloth, $19.95 paper 
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NEW IN PAPERBACK 


Second Front 
Censorship and Propaganda 
In the ‘1991 Gulf War 

John R. MacArthur 
Foreword by Ben H. Bagdikfan 
UPDATED WITH A NEW PREFACE 
$19.95 paper 


Safe Food 

Bacterla, Biotechnology, and Bloterrorism 
Marion Nestle 

$15.95 paper 


Marching on Washington 
The Forging of an American Political 
Tradition 

Lucy G. Barber 

UPDATED WITH A NEW PREFACE 

$19.95 paper 


Smallpox 

The Fight to Eradicate a Global Scourge 
David A. Koplow 

$14.95 paper 


Global Community 

The Rote of International Organizations 
In the Making of the Contemporary World 
Akira Irtye 

$16.95 paper 


Talk About Sex 


The Battles over Sex Education 
In the United States 

Janice M. Irvine 

UPDATED WITH A NEW PREFACE 
$16.95 paper 


Paradise Lost 

Callfornla’s Experience, America’s Future 
Peter Schrag 

UPDATED WITH A NEW PREFACE 

$18.95 paper 















Our Political Systems 
A new book by: Dr. A. H. Krieg 
Specifications: Pages 268 4 
ISBN: 1-93085-991-0 

Publisher: Elderberry Press 
The Party System was a book written by Cecil 
Chesterton and published in 1911. Our Political 
Systems finds that neither the arguments nor the 
realities have changed much in over 100 years. It 
pinpoints the astounding similarities between 20° 
century England and 21" century America, Canada, 
& England. It proves the endemic and inherent 
corruption of two party politics, and recommends 
solutions. 
Price $ 19.95 From: all booksellers Amazon.com 
a2zPublications.com or: 1-800-431-1579 
Wholesale: Ingrams 
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A history of the place and 
the people whe made ii 


FOR THE MANY OR THE Few 

The Initiative, Public Policy, 

and American Democracy 

John G. Matsusaka 

“A remarkable achievement. This is the first 
comprehensive scienufic examination of how 
vorer inicanves affect public policy in the 
United States. This extremely well writen 
book deserves attention not only from acedem- 
xcs but also from anyone who cares about 
direct democracy’s impact on public policy. It 
is essennal reading for those who want to bat- 
de long-standing conjectures about the inina- 
uve process with systematic evidence.” 
—~Archur Lupia, author of Stealing the 
Initeatree 

Com $29.00 


DOWNTOWN AMERICA 

A History of the Place and the 
People Who Made It 

Alison Isenberg 

“A moet impressive achievement. Ir ıs refresh- 
ingly orginal in key respects, nduding ins 
ingenious analyms of untapped written and 
visual sources, rs keen attention ro the mpor- 
tance of race and gender, and its inrerdiscipl- 
nary scope It is also one of those rare books 
thar reintegreres and remterprecs familiar and 
novel elements of public history and sumulares 
personal reflecnon.”—John F Kesson, author 
of Rudeness and Crvlity 
HISTORICAL STUDIES oF URMANI AMERICA 

Com $32.59 


RICHARD A. EPSTEIN 


SKEPTICISM 


a CLASSICAL 


i mai LIBERARSE 


, Beethoven's Ninth 


VP diliead History 


UST 


ections 


REHEATING A FAI 
LECTORAL PROCES 


NEW FROM CHICAGO 














a 
we! “ft fate’ 


aft a wlutioit 








Now in paper 


SKEPTICISM AND FREEDOM 

A Modem Case for Classical Liberalism 
Richard A. Epstein 

“A luad and vigorous . . . account of classical 
liberalism that goes a significant way toward 
sansfying our need for thoughtful reflection on 
the foundations of the liberal democratic 
order "—Daniel J. Mahoney, First Things 
Stupoes m Law ap Ecososucs 

Param $20.00 


BEETHOVEN’S NINTH 
A Political History 
Esteban Buch 
Transtated by Richard Milter 
“Whar is it about Beethoven's last symphony 
that admits such wildly drvergent ınterprera- 
tions end appropnations, and how did it man- 
age to overcome its associations with Nexsam 
to become the anthem of pan-European 
democracy? . . In examining the symbiouc 
relarionship between music and polrtics, Mr. 
Buch offers a new view of the convoluted route 
by which the composer's music, and the ninth 
symphony ın particular, came to form the 
soundtrack of the aspiranons of the European 
dream." — The Ecsnsmist 
Par $1700 
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JUST ELECTIONS 

Creating a Fair Electoral Process 

In the United States 

Dennis F. Thompson 

“Much more than a quick reaction to a particu- 
lar event . . . Just Elections reflects years of care- 
ful thought and evaluation of electoral prac- 
uces in the United States .. The real value of 
Thompeon's book is found in the questions it 
presents and in the way it forces the reader to 
think hard abour the application of important 
principles to the electoral process.” 
—Thomas G. Walker, Lew and Politics 

Book Remew 

Parse $15.00 


THE OLD REGIME AND THE 
REVOLUTION, VOLUME I 
The Complete Text 


Alexis de Tocqueville 
Edited with an Introduction and Critical Apparatus 


by François Furet and Francoise Méonto 
Transtated by Alan S. Kahan 


“A most remarkable book—exploratory, 
longnghted, on occesion brilliantly witty, and 
quite the equal in weight to Demecracy sm 
America.”"—P.N Furbank, New York 
Renew of Books 

Papar $20.00 
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ths change, we might reverse rt 





shapes Congress as m mnsunrnon 
$44 86 Cioth - 0-8081-3458-0 
$29 95 Paper - 0-8061-3406-8 
448 pages 
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If you teach, study, work or have an interest 
in international relations, you must have 


lob 
enda 


Issues before the 59th General Assembly 
of the United Nations 





Available August 2004 for fall courses and 
the opening of the U.N. General Assembly— 
the 2004-2005 edition of A Global Agenda. 


Published by the United Nations Association of the United States of America 
Approx. 336 pages; $28.96 [ISHN: 1-880632-69-1] 





Theda Skocpol addremes the dechne m avic mrvolve- 
ment, which has not ahwrys been the case mn thes coun- 
try md contends thar by undentndmg the causes of 


$24 06 Paper - 0-8061-3827-8 384 pages 


Women Transformng Congress 

Edted by Candy Sumon Rosenthal 

cara that aanne the mereng mfinence of women 
on Congress and the ways m winch gender defines and 


In The Demonax Century, Seymour Marun Lipet and 
Jeon M Lekm combene socal, cukura, economnec, sod 
msutotane! analyses to explem why democracy has sic- — 
ceeded m some countes and fled m others 

$34 96 Cloth -0-8081-3018-0 480 pages 








COVERED in the 2004-2005 volume: 
° The Iraq War 
+ Weapons of Mass Destruction 


e Global Health—with a Special Prologue 
by World Health Organization Director 
General Dr. Jong-Wook Lee 


- U.N. Efforts to Advance Human Rights 
* The Millennium Development Goals 
* Regional Peacekeeping Efforts by the U.N. 


+ Legal Issues and Cases throughout the 
World 


e NAFTA’s First 10 Years 
* The Current State of U.N. Reform 
- And more... 


PLUS: 


+ What to Expect notes regarding General 
Assembly plans for 2004-2005 


« Essays and Expert Commentaries 
- A handy Reference Tool section 


HOOORDER: Call toll-free [-S60-225-700 Tor loo omo vv Ub dusi ore, 








TWO NOVELS 





FOR OUR TIMES. 


159051131X 
Hardcover 
$18.00 
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Homeland 


A Novel 
R.H. Weber 


“Frighttemng and powerful.. " 
—Nadine Strossen, President, American Civi! Liberties Union 


“Homeland is an excellent novel, an amazing fichonalzed look at the 
year 2008, when the fear of color-coded threats and the U.S. govem- 
ment's utter disregard for the Constitubon in the name of fighting terror- 
ism has become routine ... Read this book and fear for our country” 


—WMichae! Ratner, President, Center for Constitutional Rights 


Three characters’ cunousty interwoven Ives are wrevocably altered by 
regulations Imposed by the U S. goverment In the name of security and 
patnotsm. At once a politcal thniler, a literary novel, and an indictment 
of the overreaching govemment, Hornetand ts truly a book for our times. 





Meritocracy 


A Love Story 
Jeffrey Lewis 


“Yale graduates, one of whom has brought along his charming, young 
wife, meet for a class reunion on a Maine Island. The men, assembled 
by Jeffrey Lewis, are not types, but credible Indmiduals, each In his own 
skin. They live in contemporary history, with memones of the Chil Rights 
Movement and the threat or acceptance of service in Vietnam. Malne, 
a landscape of quet beauty, comes alve with its cold nights, dangerous 
fogs, deer on the road, the cause of car wrecks, or a handy excuse ... 
Merfocracy s a dramabe, mveting novel of our times.” 

—Elizabeth Hardwick 


A group of ‘66 college grads, from Yale's pottical Inner circle, struggle 
with ther fnend’s decision to fight in Vietnam Lews tells the story of a 
generation when It was young, caught at the moment when history 
armed to exact a tragic and Inevitable pnce. 


| 
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With over 400 titles 
issued each year, 
United Nations 
Publications, covers the 
range of subjects being 
tackled by the UN, its 
agendes, and partners. 
Whether it is peace 
and security, human 
rights or development 
the subject is most - 
likely to be covered by 
UN Publications, which, 
since the first printing 
of the UN Charter. have 
been providing valued 

. information to those in 
government, academia 
and business. _ 


UNITED NATIONS PUBLICATIONS - 
Sales and 


SHATTERED VOICES 

Language, Violence, and the Work of Truth 
Commissions 

Teresa Godwin Phelps 

“Hf you want peace, you must work for justice. Teresa 
Phelps presents challenging and provocatree ideas of 
justice and explains what truth commissions can and 
cannot do as vital perts of the justice process “ 
—Theodore M Hesburgh 

Pennesytwense Studees an Huren 

2004 | 192 pages | Goth | $39.95 


THE VARIETIES OF POLITICAL 
EXPERIENCE IN EIGHTEENTH-CENTURY 
AMERICA 

Richard R Beeman 

Ranging from Virgrrua, Massachusetts, New York, 
South Carolina, and Penmsyheanta to the backcountry 
regions of the South, the Mid-Atlantic, and northern 
New England, this book offers an ambrtrous overview 
of polrtca! Irfe in pre-Revolutonary Amertca 

Barty Amencen Studies 

2004 | 378 pages | 25 dius | Cloth | $30 96 


Sales #: E04//LA2 
ISBN: 9280810944 


ing Section, Room DC2-0853, 
Tel. 900-253-9646, 212-963-8302; Fax 212-963-3489 - 


United Nations Publication 


Public 
A511, New York, N.Y. 10017 
E-Mail: publications@un.ong; 


Sales #: E.04JILA.8 
ISBN: 9280811010 


Sales #: E.03JI.G.79 
ISBN: 9211214017 
Pages: 368pp. 
Price: £20.00 


Visit us at: www.un.org/publica 
VISA, MC and AMEX accepted. 


UNIVERSITY OF |PENNSYLVANIA 


HUMAN RIGHTS, THE RULE OF LAW, 
AND DEVELOPMENT IN AFRICA 
Edited by Poul Tlyambe Zeleza 

and Philip J McConnaughoy 

This volume makes a sgnificant contnbution to the 
debate about the connections betereen the protection 
of human nghts and the pursult of economic develop- 
ment m Africa 

Penmpytvenss Stuches m Humen Paghts 

2004 | 312 pages | Cloth | $49 95 


I'M THE TEACHER, 
YOU'RE THE STUDENT 

A Semester In the University Classroom 
Patrrck Allitt 

The book offers background and gurdance to those 
concerned with the state of higher education today, 
from educators to general readers, from young 
faculty facing the classroom for the first time to 
parents whose children are heading off to college. 
2004 | 256 pages | Cloth | $59 85 | Paper | $19 96 


AVAILABLE FROM BOOKSTORES OR CALL 


UNDERSTANDING TERROR NETWORKS 
Marc Sageman 

"Marc Sageman advances our understanding of al 
Qeeda as only the best-known part of the global 
Salafi jihad. A mayor contrbuton to network analysts 
In rts own nght, thts rs a very powerful book ° 
—Randall Collins 

2004 | 232 pages | Goth | $29 96 


EVERYDAY POLITICS 

Reconnecting Crtzens and Public Life 

Harry C Boyte 

“Through a combmation of expenence, polrocal 
analyss, and philosophy, [Boyte] shows us how 
‘everyday polrics’ can actually improve people's Irves ” 
—Jennifer Hochschild, Harvard University 

Sept 2004 | 284 pages | Goth | $29 96 


KARAOKE FASCISM 

Burma and the Politics of Fear 

Monique Skrdmore 

The first in-depth ethnography of the fascrstic regune 
m Burma 

The Ethnography of Polrbea! Violence 

2004 | 284 pages | 22 ilius | Goth | $68 96 | Paper | $22 60 


1-800-537-5487 





www upenn.edu/pennpress 





What Parents Owe Thelr Children 
and What Society Owes Parents 


ANNE L ALSTOTT 


“Alstott succeeds in making an extremely 
compelling case: public policy can do a much 
better job rewarding and supporting modem 
day mothers and fathers who provide indls- 
pensable care for chikdren A powerful and 
timely book "—Sytvta Aaa Hewlett, author of 
Creating A Lite 

2004 $2750 


Polttical Education and 
Liberal Democracy 


EAMONN CALLAN 


“Callan łs to be commended for applying a 
sophisticated Uberal polttical philosophy to 
the difficult and neglected [ssues of citizen- 
ship education... Even when readers dis- 
agree with Callan's conclusions, they will 
benefit from engaging with his argument and 
analysis "— The Rewiew of Pollties 


The Erosion of Political Support In 
Advanced Industrial Democracies 


RUSSELL J. DALTON 


This book uses an unprecedented array of 
cross-national public opinion surveys to docu- 
Mant the erosion of polrical support in virtually 
all Western damocracies These trends are 
making governing more difficult, but also fuet- 
Ing demands for polttical reform that may lead 
to a further expansion of the democratic process. 
(Comparatrve Polrics) 


Rellgtous Thought and Polttical 
Practice In Africa 


STEPHEN ELLIS and 
GERRIE TER HAAR 


“Power in the matenal world, most Africans 
continue to believe, cannot be separated from 
tts source In the splirttual.... The clanty and 
accuracy of this analytical tens makes Worids 
of Power one of the most important books on 
African religion—and, Indeed, on African pol- 
tcs—to appear In many years " 


—Prefessor R. Scott Appleby, Untrersity ef Notre Dame 


2004 paper $1995 doth $60 00 


ONFORD | 2° 


PRIVEE RSTEY PRISS 


` trade between the peoples and the powers of 
this crucial waterway.'—Ortando Figes 


` pensable guide to the history, cultures, and 





. TERRORISM AND THE 


7 COLP RUBIN 


"| [A] fine documentary reader It also offers 
a sampling of Amencan, Middle Eastern, and 


+ chronology of Middle East-connected terror- 





È. THE POLITICS OF 
PUBLIC HOUSING 


’ RHONDA WILLIAMS 


: and beyond, polttcalty expedient labels to 


; poor black women In urban America 
’ Drawing on dozens of Interviews, Williams 
challenges the notion that low-income hous- 





y LANGUAGES OF 


. Using local language sources and every 


‘Impact on the polrtical culture of the Kashmir 
Valley, there !s nothing mevitable or definite 
` about the ‘polrtical regionalism’ and ‘Istamic 





"A masterful account of the ever-changing 


Based on extenstve research in multipte lan- 
guages, this book fs an Innovative and Indis- 


polrtcs of the Black Sea area and rts future at 
the heart of Europe and Eurasia 
2004 $2995 


New In Paperback! 
ANTI-AMERICAN 


A Documentary Reader 
Edited by BARRY RUBIN and JUDITH 


world reactions to the event. With a detailed 





tsm against Amencans and helpful glossanes, 
this book rs a useful reference work."—Foreign Affairs 
2002 (paper 2004) paper $16 95 


Black Women’s Struggles against 


Urban Inequality 


This collectve biography takes us behind, 
provide an inctsrve and intmate portralt of 





Ing was a resounding failure that doomed 
three consecutive generations of post-war 
Amencans to entrenched poverty 
(Transgresang Boundanes Studees in Black Polrtcs and Black Communrbes) 
2004 $2995 


the Making of Kashmir 
CHITRALEKHA ZUTSHI 


important archrve, this books shows that 
while historical changes had a profound 


particulansm’ of thls area today. 
2004 $3500 








Fellowships 


The Radcliffe Institute for Advanced Study at 
Harvard University awards approximately 30 
fully funded fellowshlps each year. Radcliffe 
Institute fellowships are designed to support 
post-doctoral scholars and sclentists, as well as 
artists and writers of exceptional promise and 
demonstrated accomplishment who wish to pur- 
sue work in academic and professional fields 
and In the creative arts. Applicants must have 
recelved thelr doctorate or appropriate terminal 
degree by December 2003 In the area of the pro- 
posed project. Please check the website for 
more Information. 
































The stipend amount Is $50,000. Fellows recelve 
office space and access to libraries and other 
resources of Harvard University. During the fellow- 
ship year, which extends from early September 
2005 through June 30, 2006, residence in the 
Boston area Is required as Is particlpation In the 
Institute community. Fellows are expected to 
present thelr works-In-progress and to attend 
other fellows’ events. 











For more information, visit our Web site at 
www.radcliffe.edu. For an application, contact: 
Radcliffe Application Office 

34 Concord Avenue 

Cambridge, MA 02138 

phone: 617-496-1324 

fax: 617-495-8136 

email: fellowships@radcliffe.edu 


Applications must be postmarked by October 1, 









HIGH RISK and 
bts Wb l ! 
{ High Risk and 
E Ambition 
A cops W. Bush 
aes “4 Schier 














“Places the Bush Revolution, and that's what H Is, in context, 
grving the reader the tools for making sense of the momentous 
changes taking piace In the Unfted States and around the world.” 

—Nacheel A. Genovese, author of The Power af the Amencan Presidency 


“We need this book as an axploration of the legitimacy of the 
George W. Bush presidency. His admirustratton has the potential to be 
a pivotal point in our history, eminently worthy of the anatyses thes 
book brings."—G. Calvin Mackenze, Colby College 


Paper $19.95 
ISBN 0822958503 


—MIOVWICUUE 
hare it, wit 





| The Limits of Protectionism 
Building Coalitions for Free Trade 
Michael Lusztig 





=A tour die forca for thoss who wast 
to understand why many countries, 
it not the entire world, have embraced 
of a moet essential Ingredient in this 
Profound sea change: the conversson of 
business from protecbonism to free 
markets.” —Guy Pottras, Trinity Unversity 


“An important contribution to potica! 
economy. its theory, blending rational 
choice and political cutture, ts novel and 
compelling. And the sweep of historical 
evidence . . . ls breathtaking.” 

—fFic Ustaner, University of Maryland 
Paper $27.95 

ISBN 0822958430 














UNIVERSITY OF PITTSBURGH 


“An entertaining and informative 
framework for the discussion of methods 
for nominating presidents from the 
beginning of the republic onward.” 
—Neison W Poisby, University 

of Californea, Barkeley 

"In or out of the acadenuc world, Emmett 
Buel and Wiliam Mayer are two of 
the best people writing today about the 
presideatial nominating process. 
Thew latest collaboration ts a unique 
cotechon of commentary and writings on 
Its evolution.” —Rhodes Cook, author of 
Race for the Presidency 














HOC S lesignec 





Endunng Controversies in 
Presidential Nominating Politics 
Edited by 

Emmett H. Buell Jr. and Willam G. Mayer . 









or class use 


“A considerable feat of scholarship combining genuine innovation with 
real insight Into Enrope's complex politics of migration and 
ethmicity. It will be required reading for anyone with an Interest In 
these Important subjects."—Andrew Geddes, University of Liverpool 
“tretand has done a masterful job in addressing a number of Interretated 
kssuss: the entttiements of clizensittp; socloeconomuc mtegration and 
cultural diversity; and the future of the welfare state in face of the 
pressures of globalizatton."—Willlam Safran, University of Colorado 
Paper $27.95 

ISBN 08229658457 


Becoming Europe 
Immi Kael irae Oe 
the Welfare State 
Patrick Ireland 







a Yar 






















For an examlnation copy, or to order a copy for personal use, 
visit www.pitt.edu/~press/orderform or call 773-702-7000 
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PROFILES IN POWER 
TWENTIETH-CENTURY TEXANS 

IN WasHINGTON * New EDITION 

Epirep sy Kennetu E. Henpricxrson Jr., 
Micuaert L Co tins, AND Patricr Cox 


Profiles tn Power offers once beans of 
fourteen twrentieth-century Texans who wielded 


ificant political and influence in 
PDC Em lished in 1993 by 
Harlan it has revised and updated 


with new chapters on John Nance Gamer and 


Henry Gonzalez and expanded cha on Tadon 
Johnson, Barbara Jordan, Ralph Taah, ım 
Wright, and John Tower 


STRADDLING THE BORDER 
Inam1GRATION Poxicy AND THE INS 


By Lisa L Macafa 


of | tmplementation 


ANTHROPOLOGISTS IN 
THE PUBLIC SPHERE 
Speaxino Out on War, Pracer, 
AND AMERICAN PowER 

EpiTep sy Roserto J. GONZÁLEZ 


“This book demonstrates that bave a 
great deal to contribute to political and that the 


public spirit of anthropology may be rekindling.” 
Untvensty of Monroe Tren Chee 


FATAL FUTURE? 

TRANSNATIONAL TERRORISM AND 

THE New GLoBAL DISORDER 

By RicHarp M. PEARLSTEIN 

of terrorism in the context of the new global 
national (1.e., privately funded and internationally 
thorough attention to the two most virulent 
ethnoterrorism and radical Islamic terrorism. He 
also discusses the terronst race for ns of 
mass destruction and the types of 

including cyberterrorism, that are likely to occur in 
coming years. 

ED $5 paper; By 00 Clack 


ED $5 paper, 155 00 cath 


SADDAM’S WAR OF WORDS 
Poxitics, RELIGION, AND 

THE IRAQI Invasion or Kuwait 
By Jerry M. Lone 


This landmark book offers the first in-depth 
investigation of how Saddam Hussein used a potent 
blend of Islam and Arab nationalism to legitimate 
hus invasion of Kuwait in the eyes of fellow Muslims 
U.S.-led coalition and rts Arab members. 

ED & papat; By 00 cloth 


CINEMA AND THE SANDINISTAS 
FILMMAKING IN 

RevoLuTIONARY NICARAGUA 

By JONATHAN BUCHSBAUM 

and, as such, tf is i nat only to film 
students, but also to thera world studies and cultural 


Teras Palm ond Madea Studs Serras, Thomas Schotz, E dator 
5 bitte phates, 539 95 paper, 570 00 deth 


CONTEMPORARY THEATRE 

IN MAYAN MEXICO 

Dratu-Deryine Acts 

By Tamara L. UNDERINER 

“This is an exciting, original contribution to 
both Mayen studies and theatre studies 
Each of the bodies of work ecamrned is fascinating, 
extremely timely, and almost unstudied by scholars.” 
—Cynthia Steele, Professor of Comperatrre Literature, 


2! 95 paper, B55 00 cloth 


MAKING ECUADORIAN HISTORIES 
Four CENTURIES oF DEFINING Power 
By O Hueco Brnavipes 


and most ancient 1s in fact intimately inked to the 


production of power.” 
—Amala Pallares, Assistant Professor of Polincal 
Science and Latin American Studies, 


850 00 cloth 


LATIN AMERICAN LAW 

A History or Private Law AND 
INSTITUTIONS IN SPANISH AMERICA 
By M C. Mirow 


“The impresseve scope of this book makes it a major 


—Lanren Benton, New York Uni n author of 
Law and Cudturer 
Legal Regimes iu World History, 1400-1900 


L 00 cleth 


ISOCRATES AND 

CIVIC EDUCATION 

Epitrp sy Taxis PouLakos 

AND Davip Derew 

In this volume, ten leading scholars of Classics, 
rhetoric, and p hy offer a i 
interdisciplinary of Isocrates as a civic 
educator. Their essays are grouped into sections 
that i igate Isocrates’ program in civic 
education in general and in comparison to the 
Sophists, Plato, Anstotle, and contemporary 
views about civic education. 

sD 00 cerh 


UNIVERSITY OF 
TEXAS PRESS 


800-252-3206 
WWW.UTEXAS.EDU/UTPRESS 
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New from 
Cornell 
University 
Press— 


Dams and Development 
Transnational Struggles for Water and Power 
Sanjeev Khagram 

“Khagram brilliantly navigates the 
complex histones of mega-project 
development and never loses the plot: 
the changeable trajectorles of these 
efforts across the decades and the 
capacity of the powerless to contest thls 
kind of development. The analysis shows 
that ‘history’ can be undone, no matter 
how powerful those who script It.” 
—Saskla Sassen, author of Globalization 
and Its Discontents. $45 00 cloth, 

$21.95 paper 


Mixed Signals 

US. Human Rights Policy and Latin Amenca 
Kathryn Sikkink 

“Sikkink argues that the centralrty of 
human nghts in the U.S represents an 
‘Identity shift’ In the natlonal conception 
of tts interests In the world She does an 
excellent job of showing how the 
creation of a bureaucratic apparatus 
focused on human nghts within the 
government played a key role in thls 
Identity shrft."—Willlam LeoGrande, 
Amencan Universtty, author of Our Own 
Back Yard. The Unrted States in Central 
America, 1977-1992. $29.95 


CHtizens without Shelter 
Homelessness, Democracy, and Polrtical 
Exclusion 

Leonard C. Feldman 

“Citizens without Shelter engages an 
important soaal problem in the most 
theoretically sophisticated and morally 
engaged manner This ts a rare combi- 
nation. In addrtlon, Ht 1s a very good 
read that shows what first-rate 
interdisclplinary scholarship ıs all 
about.”—Austin Sarat, William Nelson 
Cromwell Professor of Junsprudence and 
Polrtical Science, Amherst College. $35.00 
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State-Bullding 
Governance and World Order in the 2tst Certtury 
Frands Fukuyama 

“Fukuyama ıs a wonderful synthesizer of 
grand subjects, an adventurer who doesn’t 
mind summing up the history of develop- 
ment theory in one chapter and the history 
of organizational theory tn the next. He pulls 
this off with minimal resort to jargon, and he 
pulls the reader along with him.” 
—Washington Post Book World. $21.00 


Balancing Risks 

Great Power Intervention in the Periphery 
Jeffrey W. Tallaferro 

“Tallaferro argues that prospect theory 
explalns why great powers persist In 
otherwise counterproductive interventions 
This book has much to recommend tt: It ıs 
historically rich, methodologically well 
designed, and Is well written.” 

—Michael Desch, Director, Patterson School 
of Diplomacy and International Commerce, 
University of Kentucky Coenen STUDIES IN 
SECURITY AFFAIRS. $39.95 


Crude Awakenings 


Global Oll Securtty and American 
Foreign Policy 

Steve A. Yetiv 

“This Is an excellent book that goes 
against the grain of much of today’s 
thinking. It ıs a rare example of superb 
integration of domestic politics, 
geopolttics, international politics, and 
market economics. Yetlv sheds light on 
an important subject that pertains to 
the largest single sector of global trade 
and the locus of two very large wars 
fought by the United States over the 
past dozen years.”—Edward L Morse, 
former Deputy Assistant Secretary of 
State for International Energy Policy. 
$35.00 


Cornell University Press www.comellpress.cornell.edu At bookstores, or call (607) 277-221 


The Remnants of War 

John Mueller 

“In thls book Mueller charts the 
continuing dedine of one of the oldest 
and most important of all human 
practices. The Remnants of Waris a 
powerful and provocative account of the 
fate of war In our time” 

—Michael Mandelbaum, The Johns 
Hopkins School of Advanced Interna- 
tional Studles, author of The ideas That 
Conquered the World Cogne STUDIES IN 
Security AFFAIRS $29 95 


Healing Identities 

Black Feminist Thought and the Polrtics 
of Groups 

Cynthia Burack 

“Healing Identrties is a thought- 
provoking application of psychoanalysis 
to Identity group discourse and an 
insightful overview of black feminist 
thought. I Is simultaneously scholarly 
and polrtical. Stimulating!” 

—Man Jo Buhle, author of Feminism and 
Its Discontents. A Century of Struggle 
with Psychoanalysis. PSYCHOANALYSIS AND 
Socu THEORY. $42.50 cloth, $19.95 paper 


Virginla Woolf as Feminist 

Naomi Black ; 

“Black provides a richly detalled account 
of how Woolf's most controversial 
feminist book, Three Guineas, was 
concerved, constructed, and recetved 
Black’s powers of historical recovery 
illuminate the central place of that work 
In Woolf's career and tts continuing 
challenge to new generations of 
readers.” —Alex Zwerdling, University of 
Caltfornia, Berkeley. $47.50 cloth, 

$19.95 paper 


THE RIVER RUNS BLACK 
Bere a 





The River Runs Black 


The Environmental Challenge to 

China’ Future 

Elizabeth C. Economy 

“Economy’s book Is a clear and 
compelling reminder that no 
engagement with China can afford to 
Ignore China’s vast environmental 
dilemmas. ... The case for International 
engagement with China emerges even 
more strongly from thls book; the case 
for ‘irrational exuberance’ ıs dashed to 
smithereens "—Robert A. Kapp, 
President, US-China Business Counal. 
A COUNCIL ON FOREIGN RELATIONS Book. 
$29.95 


Corruption and Market In 
Contemporary China 

Yan Sun 

“Sun's timely analysis of corruption In 
China brings an enormous amount of 
empirical data to bear on the question, 
puts the Issue In comparative perspec- 
tive, and offers sharp and perceptive 
explanations A must read."—Joseph 
Fewsmith, Boston University. $49.95 
doth, $21.00 paper 


Building Democracy In 
Contemporary Russia 

Western Support for Grassroots 
Organizations 

Sarah L Henderson 

“Henderson has produced a well- 
written, superbly researched book on a 
topıc of central Importance to both 
scholars and policy makers. Through her 
theoretical rigor and practical expertise, 
she aptly reveals the pathologies and 
promise of International attempts to 
bulld civil society In the postcommunist 
world.”—Jullet Johnson, McGIll 
Unrversrty. $32.50 





4%. In the Shadow 
of Just Wars’ 


In the Shadow of “Just Wars” 

Violence, Polttics, and Humanitarian Action 
Médecins sans Frontiéres 

Edited by Fabrice Welssman 

“The contributors . propose that [at times] 
humanitarian actors, mindful of donor 
money, access to target populations, or 
polttics, forget thelr duties to those on the 
‘wrong’ side in... confilcts.” 

—The Village Voice. $52.50 cloth, $23 95 paper 


Domestic Budgets In a United Europe 


Fiscal Governance from the End of Bretton 
Woods to EMU 


Mark Hallerberg 

“Domestic Budgets in a United Europe 
contributes greatly to the literature on the 
polttics of fiscal ‘retrenchment’ In compara- 
tive politcal economic research. In exemplary 
fashion Hallerberg combines deducttve 
theory building based on formal analysis, 
macro-quantrtative empirical comparison, 
and process tracing through detailed case 
studies Illustrating the diversity of mechanisms 
and outcomes In fiscal policy making.” 
—Herbert Kitschelt, Duke University $39.95 


Democracy’s Voices 


Social Ties and the Qualrty of Public Life 
In Spain 

Robert M. Ashman 

“Democracy Voices Is brilliantly 
conceived, carefully researched, elegantly 
written, and captivating 1n Its conclu- 
slons. tt will establish Fishman as one of 
the major sociologists of democraty in 
our generation... This book will be of 
interest to scholars of democracy 
throughout the social sclences, and rt 
deserves a wide readership beyond the 
walls of the academy as well” 

—an Shapiro, Yale Untverstty. $29.95 





LEADERS 





Leaders and Their Followers In 

a Dangerous World 

The Psychology of Polttical Behavior 
Jerrold M. Post 

Foreword byAlexander L George 

“Leaders and Their Followers in a 
Dangerous World ıs a highly sophisticated 
psychoanalytic exploration of famous 
polttical leaders as well as lowly assassins 
and terronsts. The nch collection of 
examples makes extraordinarily 
stimulating reading, Illustrating the 
vanity of psychic factors that drive the 
people who shape history.” —Fred C Iklé, 
Center for Strategic and International 
Studles, and former Undersecretary of 
Defense for Policy PsycHoanarysts AND SOCAL 
THEORY $29.95 


NEW IN PAPER— 


The Purpose of Intervention 


Back-Alley Banking 
Private Entrepreneurs In China 
Kellee S. Tsal 


$19 95 


Cornell University Press www.comelipress.comell.edu At bookstores, or call (607) 277-221 ( à 
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To see more graphs, go to www.stata.com/stata8/graphics.html 
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on race in the social sciences. It provides a forum fo 


and increased understanding of race and society from at 
disciplines, including but not limited to economics, political 


sociology, anthropology, law, communications, public polic 
and history. Each issue contains an editorial overview, 


essays, original research papers, and review essays covering 


books, controversies, and research threads. 


The first volume will include such noted authors as Clai 
on “The Psychology of Race”, William Darity on “B 


Global Economy”, Mary Waters on “Immigrants and the Civ 
Revolution”, Cathy Cohen on “Gender, Sexuality, and Po 
Claire Kim on “Race and Politics”, and review essays by É 
Bonilla-Silva, Glenn Loury, Alice O'Connor, Adolph Reed, Jr, 


Smith, and Barbara Ransby. 


VOLUME 1, 2004 


Subscriptions: 

$135.00/£85 institutions print plus online 
$120.00/£75 institutions print only 
$115.00/£75 institutions online only 
$65.00/£40 individuals print only 


To subscribe in the U.S.A., Canada, or Mexico, please cont 
Cambridge University Press by phone at 800-872-7423, 
via fax at 845-353-4141, or by email to journals_subscripti 


To subscribe from elsewhere, please contact Cambridge niv 
by phone at +44 (0) 1223 326070, via fax at +44 (0) 1223 325 


or by email to journals@cambridge.org 





EDITORS 


LAWRENCE BOBO 
Harvard University, USA 


MICHAEL DAWSON 
Harvard University, USA 


Published on behalf of the 
W.E.B. Du Bois Institute for African 
and African American Research, 
Harvard University 





A CAMBRIDGE 


ta) UNIVERSITY PRESS 


A NEW 
JOURNAL 
IN 2005 


Editors: 


Karen Beckwith 
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Lisa Baldez 
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litics & Gender is an intellectually 
gorous, methodologically sophisticated, 
yenda-setting journal that publishes the 
ghest quality scholarship on gender and 
olitics and on women and politics. It aims 
3 represent the full range of questions, 
ies, and approaches on gender and 
men across the major subfields of 
ical science, including comparative 
s, international relations, political 
, and U.S. politics. The Editors 
icome studies that address fundamental 
ons in politics and political science 
1 the perspective of gender difference, 
ell as those that interrogate and 
ige standard analytical categories 
conventional methodologies. Each 
ə features book reviews and a 
ebates” section in which scholars from 
lifferent fields within political science 
ent on contentious political and 
odological issues. 
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